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If you asked the average Athenian strolling across the Agora some fine day in 420 B.c., 
“‘How do you like living in the Golden Age?” the answer might have been, “What Golden 
Age? You think I don’t have troubles? And have you heard what that nut Pericles is up to 
now?” Residents in Golden Ages are not always aware of where they are. It may be that 
some readers of the Monthly are not aware that they are, right now, smack dab in the 
middle of the Golden Age of Mathematics. Evidence follows. 

The first year of Mathematical Reviews contained 400 pages, index included. Five years 
later, after the backlog had been worked off, the length was 334 pages. One month (August) 
of MR in 1993 consumed 580 pages. I don’t think anyone will argue that the quality of 
papers reviewed has deteriorated sharply since 1940. Theorems are being proved now that 
have never been proved before. One owing to Fermat springs to mind. 

In 1933, the Mathematical Association of America had, in the entire state of California, 
84 members. The current Combined Membership List has 61 columns of members in 
California, with each column containing 90-odd names. Forty years ago, the MAA published 
one journal. Now there are three large ones and two or three smaller ones. Then, the 
Association published a few Carus Monographs and Slaught Papers. Now, its catalog lists 
some 170 titles and is itself almost as long as a Slaught Paper. 

In 1990, 775,000 people were studying calculus in college. In 1993, 100,000 high-school 
students wrote the calculus advanced placement examination. It wasn’t too many years ago 
when there weren’t 100,000 people in the country who had ever had a course in calculus. 
Daniel Webster was not taught arithmetic in elementary school; now every member of 
Congress has been put through algebra. In 1990 there were more than 3,000,000 enroll- 
ments in college mathematics; allowing for duplication, that is almost 10% of the popula- 
tion! Mathematics and mathematicians command great prestige and respect, as is shown by 
how people will apologize for not being good at algebra. They don’t apologize for ignorance 
of metallurgy, even though metals figure as largely in the life of the average citizen as do xs 
and ys. I’m sure that future historians will look back in wonder at the craze for mathematics 
that swept the United States in the second half of the 20th century and speculate on its 
causes. 

Of course, you hear grumblings about how students aren’t what they used to be. But they 
have never been what they used to be. In 1924, when the Leopold-Loeb case temporarily 
added the meaning “homosexual” to the word moron, a professor at the University of 
Chicago said he would need a new word for his students, and suggested osseocaput. 
Boneheads at the University of Chicago? In 1924? Ahmes the scribe probably complained in 
1650 B.c. that Egyptian schools no longer turned out graduates able to deal with 2/17 the 
way the used to. United States students seem always to finish in the top three of the 
Mathematical Olympiads. Students are all right. After all, we were students once, and look 
at us now. 

In 1992, the National Science Foundation spent $76.5 million for mathematical research. 
That’s $2800 for each and every member of the American Mathematical Society. When you 
add in support for education and from other sources, every member of the AMS and the 
MAA got more than $4000. If you belonged to both, you got more than $8000. How much 
income tax did you pay last year? 

We've got it all. The Golden Age is right now. We should realize it, appreciate it, and 
give thanks. 

— Underwood Dudley 


Introduction to Fermat’s Last Theorem 


David A. Cox* 


The announcement last summer of a proof of Fermat’s Last Theorem was an 
exciting event for the entire mathematics community. This article will discuss the 
mathematical history of Fermat’s Last Theorem (which we will abbreviate through- 
out as FLT), broken up into the following periods: 

1. Diophantus to Euler (250-1783 a.p.) 

2. Euler to Frey (1783-1982 a.p.) 

3. Frey to Wiles (1982-1993 a.p.) 
We will give only an introduction to the story of Fermat’s Last Theorem, and our 
account is by no means definitive. Many of the more technical terms are not 
defined completely and the few proofs that appear are only sketched. On the other 
hand, I hope that the article succeeds in conveying the flavor of this truly 
wonderful mathematics. 


1. DIOPHANTUS TO EULER. Our history of FLT starts around 250 a.p. with 
Diophantus, whose Arithmetica considered many problems in elementary number 
theory. Consider Problem 8 from Book II, which asks “to divide a given square 
number into two squares” ((10], p. 144). Diophantus’ solution is as follows: Let the 
given square be 16, let x* be one of the required squares and (2x — 4)* the other 
square. Therefore, we must satisfy x? + (2x — 4)? = 16, which implies 


x? + 4x? — 16x + 16 = 16 => 5x* = 16x > x = 16/5. 


Hence the required squares are 256/25 and 144/25. 

We can observe two things about this problem. First, solutions are presumed to 
be rational. We neither restrict to only integer solutions nor generalize to real 
solutions. Second, we care only about finding one solution to a given problem; if 
we find one, we are happy and move on. 

The Arithmetica was one of the last Greek mathematical works translated into 
Latin; this occured in 1575. Fermat (1601-1665) had a copy of Bachet’s translation 
of 1621 and made a series of intriguing annotations in its margins. Sometime in the 
late 1630’s, while thinking about the problem given above, he added the famous 
words in the margin: 


“On the other hand, it is impossible to separate a cube into two cubes, or a 
biquadrate into two biquadrates, or generally any power except a square into 


*This article is based on a lecture given at the 1993 Smith College Regional Geometry Institute. 
This audience included high schools teachers, undergraduates, graduate students and researchers in 
discrete and computational geometry. I would like to thank Thomas Colthurst for transcribing the 
lecture, and I am grateful to my colleagues who pointed out errors in earlier versions of the manuscript. 
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two powers with the same exponent. I have discovered a truly marvellous 
proof of this, which however the margin is not large enough to contain.” 
((10], pp. 144-145) 


Hence the basic claim of Fermat’s Last Theorem is that the equation x” + y” 
= Zz” has no solutions when x, y, z are nonzero integers and n > 2. Generations 
of mathematical historians have debated over whether Fermat really did have a 
proof, though many experts doubt that he did. For one thing, the equation 
x” + y” =z” was atypical for Fermat—the vast majority of the other equations he 
studied dealt with exponents < 4. Also, in his correspondence, he only stated FLT 
for the exponent n = 3. As for Fermat’s ‘“‘marvellous proof,” it probably used the 
technique of infinite descent. His descent argument for n = 4 is actually known: it 
can be found in Fermat’s proof that the area of a right triangle with integral sides 
cannot be a square. This proof is given in one of his marginal notes, although even 
here, Fermat complains that there isn’t enough room to give the proof “with all 
detail’’ ((10], p. 293). It seems likely that Fermat thought that his proofs for n = 3 
and 4 generalized, and they almost certainly didn’t. 

So, what happened after Fermat? In 1670, his marginal notes were published by 
his son, and some of his letters appeared in Wallis’ Opera Mathematica. In 1729, 
Goldbach wrote Euler and mentioned some of Fermat’s results. This got Euler, 
only 22 at the time, thinking about number theory. Three years later, Euler wrote 
his first paper on number theory, disproving a conjecture of Fermat’s on primes of 
the form 27° +1. For the next fifty years, Euler proved many of Fermat’s 
conjectures and in so doing, transformed number theory from a collection of 
miscellaneous facts and results into an organized field at the very center of 
mathematics. 

Here is an example of what Euler did. In Problem 17 of Book VI of the 
Arithmetica (Problem 19 in Bachet’s numbering), Fermat had written in the 
margin, ‘(Can one find in whole numbers a square different from 25, when 
increased by 2, becomes a cube?... [The answer involves] the doctrine of whole 
numbers, which is assuredly very beautiful and very subtle...” ((6], p. 269). In 
modern terms, Fermat is claiming that the only integer solutions to x* = y* + 2 
are given by (x, y) = (3, + 5). You can see how the emphasis is different from 
Diophantus—Fermat is looking for all solutions, and he recognizes that asking for 
integer solutions (rather than rational ones) is a question of independent interest. 

To prove Fermat’s claim, Euler ({4], Part II, §193) uses numbers of the form 
a + b¥— 2, with a, b integers. First observe 


xe=y?+2=(y+v-2)(y -Vv-2). 


One can show that y + y— 2 and y — y-— 2 are relatively prime,and since their 
product is a cube, each of them must also be a cube. Thus there is a number p + 
qv — 2 such that 


y+y-2= (p + qV—2) =p? — 6pq? + (3p°q — 2q°)V¥— 2 


= 1 = 3p’q — 2q° = q(3p’ — 2q”). 


The last equation implies p = +1 and q = 1. Substituting this in, we get y = p° 
— 6pq? = +5 and x = 3, as claimed. 

This proof, while elegant, is incomplete, for we do not know that numbers of the 
form a+bV¥—2 have unique factorization, or even for that matter, primes 
(although it is relatively easy to prove that the numbers a + bY—2 have these 
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properties). There are several reasons why this example is important: 


e First, it reminds us that there are lots of diophantine equations besides just 
FLT, and what we really want is a method for dealing with as many of them as 
possible. 

e Second, it shows that properties of integers (such as unique factorization) can 
apply in more general situations, and it illustrates how a result in one context 
(the integers) can be proved by working in a more general context (numbers of 
the form a + by — 2). 

e Finally, the equation y* = x? — 2 is an example of an elliptic curve. Elliptic 
curves will play a crucial role in the final proof of FLT. 


2. EULER TO FREY. This section is only a sketch of more than two hundred 
years of beautiful and wonderful number theory. For more information on the 
work on FLT done during this period, we warmly recommend both Edwards’ 
Fermat’s Last Theorem [3] and Ribenboim’s 13 Lectures on Fermat’s Last Theorem 
[21]. (Precise references for results mentioned in this section can be found in these 
books.) 

Before we begin, first observe that it suffices to prove FLT for n = 4 (done by 
Fermat) and for n an odd prime (since we can factor the exponent). We can also 
assume that x, y, z are nonzero relatively prime integers (because we can cancel 
common factors). That being said, here are some of the highlights of the 19th 
century work on FLT: 


e By the early 1800s, all of Fermat’s problems were solved except for FLT (thus 
justifying the name, Fermat’s Last Theorem). 

e 1816—The French Academy announces a prize for a solution to FLT. 

In the 1820’s Sophie Germain shows that if p and 2p + 1 are prime, then 

x? +y? =z” has no solution with p + xyz. This is the so-called Case I of 

FLT. (Case II is where p|xyz and is usually regarded as being much harder.) 

e 1825—Dirichlet and Legendre prove FLT for n = 5. 

e 1832—Dirichlet, after trying to prove it for n = 7, proves FLT for n = 14. 

e 1839—Lamé proves FLT for n = 7. 

e 1847—Lamé and Cauchy present false proofs of FLT for general n. 

1844—1847—Kummer’s work on FLT. 


Let us describe Kummer’s work on FLT in more detail. Kummer (and Cauchy 
and Lamé) started, 4 la Euler, by factoring the right hand side of the FLT equation 
as 


xP = 2? —yP = (z—-y)(z~ by)(z- by) + (2 - bP Fy), 


where £ = e77'/? = cos(27r/p) + isin(27/p) is a pth root of unity and satisfies 
¢? = 1. In general, working with roots of unity will require us to use numbers of 
the form 


-1 
Ag taygt+-+: +a, 16? °, ao,...,4,_1 © Z, 


which are called cyclotomic integers. But a problem arises when unique factoriza- 
tion, one of our main tools, fails for the cyclotomic integers. As Kummer discov- 
ered in 1844, this first occurs for p = 23 (and in fact, unique factorization fails for 
all bigger primes as well). 

Kummer’s solution to this was twofold. First, he introduced a generalization of 
cyclotomic integers, called ideal numbers, which make up for the lack of unique 
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factorization. Second, he defined the class number h, which measures how badly 
unique factorization fails. 
Here is a summary of Kummer’s results: 


e 1847—Theorem: FLT holds for p if p + h (such p are called regular primes). 
e 1847—Theorem: p is regular iff p doesn’t divide the numerator of the 

Bernoulli numbers B,, B,,..., B,_3- 
We can define the Bernoulli numbers by the power series 


A corollary of this result is that for p < 100, only 37, 59 and 67 are irregular. 

e 1850—The French Academy offers a second prize for a solution to FLT. 

e 1856—At Cauchy’s suggestion, the Academy withdraws the prize and then 
awards a medal to Kummer. 

e 1857—Kummer develops complicated criteria for proving FLT for certain 
irregular primes. There are some gaps in his proofs which are later filled in by 
Vandiver in the 1920s. These results establish FLT for p < 100. 


The above history makes a wonderful story about how FLT inspired one of the 
greatest inventions in number theory, but the story is unfortunately false. Kummer 
was actually not trying to prove FLT, but something called a reciprocity theorem. 
Reciprocity theorems have their origins in Fermat’s study of equations like 
p=x*+y’ and p =x’ + 2y’, where p is a prime. In trying to understand these 
results, Euler, Lagrange, Legendre and Gauss created the theory of quadratic 
forms and proved the law of quadratic reciprocity. Later, Gauss, Abel and Jacobi 
formulated versions of cubic and biquadratic reciprocity, and Kummer and 
Eisenstein made the first attempts at higher reciprocity laws. Cyclotomic integers 
and ideal numbers came about primarily from Kummer’s attempts to prove these 
higher reciprocity laws. In turn, these concepts not only had something interesting 
to say about FLT, but they also made significant contributions toward the develop- 
ment of class field theory and abstract algebra (we use the terminology “ideal of a 
ring” because of Kummer’s “ideal numbers”’). 

Here are some highlights of the history of FLT after Kummer: 


e 1908—The Wolfskehl prize for a solution to FLT is announced. Later 
inflation in the German mark reduces the value of this prize considerably, but 
does not reduce the flow of crank solutions submitted. 

e 1909—Wieferich proves if x? + y? =z? and p t+ xyz (Case I of FLT), then 
2°-! = 1mod p’. This is a strong congruence which is particularly easy to 
check on a computer. 

e 1953—Inkeri proves that if x? + y? =z? and x <y <z, then x > (2p? + 
p)/log(3p))? in Case I and x > p??~* in Case II. 

e 1971—Brillhart, Tonascia and Weinberger show that Case I of FLT is true for 
all primes less than 3 - 10”. 

e 1976—Wagstaff shows that FLT is true for all primes less than 125,000. 


These results imply that any counterexample to FLT must involve p > 125,003 
and z>y>x > (125,003)? = 4.5 - 10171379) (in 1992, as a byproduct of 
other computations, the lower bound on the exponent was raised to p > 4,000,000 
—see [2].) 
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We should also mention that the Fermat equation x” + y” =z” has been 
studied in many other contexts, including polynomials, entire functions and matri- 
ces (see [21] and, for a recent proof of the polynomial case, [18]). 


3. FREY TO WILES. In 1983, Faltings [4] proved the Mordell Conjecture, which 
implies that a polynomial equation with rational coefficients O(x, y) = 0 has only 
finitely many rational solutions when the curve has genus > 2 (for a definition of 
genus, see the sidebar “The genus of an algebraic curve’). Since x” + y” = 1 has 
genus > 2 for n > 4, there are only finitely many rational solutions by the Mordell 
Conjecture. Then, clearing denominators, it follows easily that x” + y” =z” has 
only finitely many relatively prime integer solutions. 


The genus of an algebraic curve 


The genus of a curve given by a polynomial equation p(x, y) = 0 of degree n can 
be defined in a variety of ways. When the equation is sufficiently smooth (which is 
true for the Fermat curve x” + y” = 1), then the genus is g = (nm — 1)(n — 2)/2. 
This is > 2 when n > 4. 


Topologically, the solutions of p(x, y) = 0 over the complex numbers form a 
compact Riemann surface minus a finite set of points, and then the genus is just 
the usual genus of this compact real 2-dimensional manifold. 


Analytically, a Riemann surface is a compact complex 1-dimensional manifold, and 
one can define the notion of a holomorphic 1-form. Then the genus is the 
maximum number of linearly independent holomorphic 1-forms on the surface. 


For example, the Riemann sphere has genus zero, so that there are no 
holomorphic 1-forms, while the elliptic curve y* = Ax? + Bx*+ Cx +D _ has 
genus 1, and up to a constant, dx /y is the only holomorphic 1-form. 


This may not seem so useful, since we want to show that the number of 
solutions is actually zero. But Granville [9] and Heath-Brown [11], aided by an 
observation of Filaseta, used the above finiteness result to show that FLT holds for 
“most”? exponents, in the sense that if you look at all exponents—prime and 
composite—from 3 to n, the percentage where FLT could fail approaches zero as 
n increases (see [28] for the details). Also, Adelman and Heath-Brown [1] showed 
that Case I of FLT was true for infinitely many prime exponents. 

For us, the Mordell Conjecture is interesting because it shows how a general 
conjecture in number theory can have some consequences concerning FLT. Also, 
in proving the Mordell Conjecture, Faltings used the machinery of modern 
algebraic geometry, which had been developing since the 1950’s. 

By the end of the 1980’s, there were several conjectures in number theory 
which, if proved, would imply FLT, though sometimes only for sufficiently large 
exponents (see the sidebar “Conjectures that imply Fermat’s Last Theorem’’). This 
showed that FLT was not an isolated oddity, but rather was intimately connected. 
to other parts of number theory. People were especially excited in 1988, when 
Miyaoka gave a lecture in Bonn in which he stated one of these conjectures, the 
arithmetic Bogomolov-Miyaoka-Yau inequality, as a theorem. This would have 
proved FLT for all large primes p (without saying explicitly what “large” meant). 
In the days following his lecture, there was much fanfare in the press, and it was 
rather disappointing when a week later an error was found in the argument. 
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Conjectures that imply Fermat’s Last Theorem 


By the late 1980’s, there were several conjectures in number theory which, if 
proved, would imply FLT, at least for large exponents: 


Diophantine Geometry. The following conjectures in diophantine geometry would 
imply FLT for all sufficiently large exponents: 


e The abc Conjecture states that if a,b,c are relatively prime integers with 
a+b =c, then max(|a|, |b|, |c|) is bounded in terms of the primes dividing abc. 

e Szpiro’s Conjecture relates the minimal discriminant to the conductor of an 
elliptic curve. These terms are discussed in the sidebar entitled “Invariants of 
the Frey curve’’. 

e Vojta’s Conjecture concerns heights of points (relative to the canonical class) of 
a curve defined over the integers. 


Precise statements of these conjectures (and the relations among them) can be 
found in Lang’s survey article [18]. 


Arithmetic Surfaces. The Bogomolov-Miyaoka-Yau inequality for arithmetic sur- 
faces relates various invariants of a curve defined over the integers. This inequality 
is an- arithmetic analog of a well known inequality for complex surfaces. By Parshin 
[20] and Vojta (Appendix to [16]), this conjecture implies versions of the above 
diophantine conjectures strong enough to prove FLT for all large exponents. 


Elliptic Curves. The Taniyama-Shimura Conjecture states that all elliptic curves 
over the rational numbers are modular (a more precise statement of the conjecture 
is in the body of the article). As we will explain, the work of Frey, Serre and Ribet 
shows that this conjecture implies FLT for all exponents. 


Of these conjectures, the one ultimately most important for FLT is the 
Taniyama-Shimura Conjecture, which asserts that all elliptic curves over Q are 
modular (this term will be defined below). The full story of this conjecture goes 
back to Jacobi and Riemann, but we will begin our account with the work of 
Gerhard Frey from 1982 to 1986 (see [7] and [8]). Frey showed that nontrivial 
solutions to FLT give rise to very special elliptic curves, which we shall call Frey 
curves. His basic insight was that Frey curves were so special that they couldn’t be 
modular. Hence, if the Taniyama-Shimura Conjecture were true, Frey curves 
couldn’t exist, and FLT would follow. 

If a? + b? =c?” is a solution to FLT, then the associated Frey curve is 


y* =x(x —a?’)(x + b?). 


As usual, we assume a,b,c are nonzero relatively prime integers and p is an odd 
prime. This is an elliptic curve over the rational numbers Q, similar to the equation 
y” =x? — 2 considered by Fermat. In general, an elliptic curve over Q is given by 
an equation of the form 


y? = Ax? + Bx? + Cx + D, 


where A,B,C, D are rational and the cubic polynomial in x on the right hand 
side of the equation has distinct roots. Elliptic curves are a large and important 
part of modern number theory. 
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Actually, we have to be a bit careful when constructing the Frey curve. A 
solution a? + b? = c” gives rise to solutions b? +a? =c? and a? + (-c)? = 
(—b)? (since p is odd). From here it is easy to rearrange the solution so that b is 
even and a = —1mod4. This is needed in order that the Frey curve be semistable 
(this concept will be discussed below). For technical reasons, we will also assume 
that p > 3. 

Although Frey had published a paper about Frey curves in 1982 [8], things 
didn’t get really interesting until 1985, when Frey tried to prove that the 
Taniyama-Shimura Conjecture implies FLT. But his proof had some serious gaps. 
Several people tried to fix Frey’s argument, and it was Jean-Pierre Serre [24] who 
saw that a special version of a conjecture he made on level reduction for modular 
Galois representations would fill the gap. Hence we may credit Frey and Serre 
with showing that FLT follows from Taniyama-Shimura and the special level 
reduction conjecture made by Serre. Versions of this argument can be found in [7] 
and [22] and one should also consult Serre’s article [25]. 

Then, in 1986, Ken Ribet made significant progress along this route to FLT by 
proving this version of Serre’s conjecture, and his proof eventually appeared in 
[22]. Thus FLT (for all primes p) was now a consequence of the Taniyama-Shimura 
Conjecture! Inspired by this development, Andrew Wiles began to work on 
Taniyama-Shimura, and seven years later, he presented a proof on June 23, 1993 
that the conjecture is true for semistable elliptic curves, which (as we will see 
below) is good enough to prove FLT. Wiles’ argument is not easy—the manuscript 
containing the proof is over 200 pages long. But many people in the mathematical 
community are confident that the proof will hold up under careful scrutiny. For a 
broad outline of Wiles’ argument, see Ribet’s article [23] (this article also has some 
useful references). 

One interesting observation is that Frey was not the first to discover the Frey 
curve. On page 262 of [12], Hellegouarch writes down the Frey curve for a solution 
to FLT of exponent 2p”. Frey curves also appear implicitly as part of the 
correspondence between Fermat curves and modular curves considered by Kubert 
and Lang [15]. But Frey was clearly the first to suspect that the Frey curve couldn’t 
exist because of the Taniyama-Shimura Conjecture. 

To explain the Taniyama-Shimura Conjecture, we first need to define the 
concept of modular function. 


Definition. A function f(z) on the upper half plane {z =x + iy: y > 0} is a 
modular function of level N if f(z) is meromorphic, even at the cusps (see the 
sidebar “‘The modular curve X(N’), and for all integers a,b,c, d with ad — bc = 1 
and N\c, we have 


az +b 
(<— 


| = #02). 


Taniyama-Shimura Conjecture. Given an elliptic curve y? = Ax? + Bx* + Cx +D 
over Q, there are nonconstant modular functions f(z), g(z) of the same level N such 
that 

f(zy = Ag(z)° + Bg(z)° + Cg(z) +D. 


Thus the Taniyama-Shimura Conjecture says that an elliptic curve over Q can 
be parameterized by modular functions, or, as Mazur says in [19], it has a 
“hyperbolic uniformization.” Such an elliptic curve is said to be modular. Wiles 
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The modular curve X,(N) 


In the text, we considered the transformations z — (az + b)/(cz + d) of the 
upper half plane § = {z = x + iy: y > 0} associated to the group of matrices 


[T(N) = ‘(2 A a,b,c,d © Z, ad — bc = 1, Nie}. 


This group acts on § with quotient §/I,(N), and there is a compact Riemann 
surface X,(N) such that 


h/T)(N) = X(N) — {finite set of points}. 


These points are the cusps and X,(N) is a modular curve of level N. (Other 
modular curves come from using different groups of matrices). 


A function f(z) is invariant under [,(N) iff it descends to a function on §/T)(NV). 


Thus the definition of modular function means that we have a meromorphic 
function on X,)(N). Furthermore, the Taniyama-Shimura Conjecture asserts that if 
E is an elliptic curve over Q, then there is a surjective holomorphic map 
X(N) > E. 


As suggested in the text, a holomorphic 1-form on X,(N) is written F(z) dz, 
where F(z) is a cusp form of weight 2 and level N. It follows that the genus of 
X(N) (see the sidebar “The genus of an algebraic curve”) equals the dimension of 
the space of these cusp forms. 


It is known that X,(2) has genus zero (this follows by looking at the fundamental 
domain of I,(2) acting on §). Hence there are no cusp forms of weight 2 and level 
2. This fact is used in the proof of FLT. 


The modular curves X,(N) play an important role in the theory of elliptic curves. 
Some basic facts about modular curves can be found in [14] and [26] (and other 
references can be found in these books). 


proved this conjecture for semistable elliptic curves. We should mention that our 
statement of the conjecture is very naive—some work is needed to show it is 
equivalent to the usual formulation (see the technical appendix to [19]). For a 
discussion of the conjecture and some of its history, see pages 130-135 of Lang’s 
book [17]. At a more elementary level, Mazur’s article “Number theory as gadfly” 
[19] gives a lovely introduction to the Taniyama-Shimura Conjecture. 

Besides modular functions, we also need to know about modular forms of 
weight 2. The easiest way to see how these arise is through elliptic integrals. An 
elliptic integral is an integral of the form 


dx 


ITs GD 


(Strictly speaking, this is only an elliptic integral of the first kind—there are many 
other types of elliptic integrals.) If y? = Ax? + Bx? + Cx + D, then this integral is 
simply fdx /y. For a modular elliptic curve, we have x = f(z), y = g(z), and then 
dx df f'(z)dz 


y 2 “e(z) PA) 
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One can show that for a, b, c, d as in the definition of modular function, the above 
function F(z) transforms via the rule 


az +b 
[<— 


We call F(z) a modular form of weight 2 and level N. When the modular 
parametrization is chosen correctly, the function F(z) has some remarkable 
properties. It is holomorphic and vanishes at the cusps, and for this reason is called 
a cusp form. In addition, F(z) is an eigen-form for the action of a certain Hecke 
algebra on the space of all cusp forms. So F(z) is a rather sophisticated object. 

The miracle is that F(z) is intimately connected to the curve y? = Ax? + 
Bx? + Cx + D. Roughly speaking, one can reconstruct F(z) simply by knowing the 
number of solutions of the congruences y* = Ax? + Bx* + Cx + Dmod p for all 
primes p. Then the fact that F(z) is a cusp form of weight 2 and level N tells us 
some profound things about the elliptic curve. This is one reason why Taniyama- 
Shimura is such a wonderful conjecture—number theorists would be excited by its 
proof even if there were no connection to FLT. 

We can now sketch the argument of Frey and Serre which shows why FLT 
follows from Taniyama-Shimura and the level reduction conjecture of Serre. We 
begin with the FLT solution a? + b? = c”. As above, we will assume p > 3 Is 
prime and a, b,c are relatively prime with b even and a = —1mod4. We then get 
the Frey curve y* = x(x — a’Xx +b”). 

The discriminant of a polynomial is the product of the squares of the differ- 
ences of its roots. For the cubic x(x — a? x + b”), the discriminant equals 


(a? — 0)°(—b? — 0)*(a? — (—b”))* = a??b??c?? 


| = (cz + d)’ F(z). 


Invariants of the Frey curve 


Suppose that we have the Frey curve y? = x(x — a?)(x + 5”) with our usual 
assumptions on a, b,c. Then we get the following invariants: 


e Besides the discriminant defined in the text, an elliptic 
curve over Q has a more subtle invariant called the 
minimal discriminant. The minimal discriminant of the 
Frey curve is A = 27°a??b*?c?”, Since b is even and 
p=5, this is still an integer. This differs from the 
discriminant because the discriminant depends on the 
particular equation defining the curve, while the minimal 
discriminant is intrinsic to the curve. 

The conductor of the Frey curve is N =[]),,,/. The 
conductor is the most subtle of the invariants associated 
to an elliptic curve over Q. One can show that a modular 
elliptic curve is parametrized by modular functions whose 
level N equals the conductor of the curve. 

The j-invariant of the Frey curve is j = 2°(a2? + b?? + 
a’b?)? /a*?b*?c*”, The j-invariant classifies the curve up 
to isomorphism over the complex numbers. 


Precise definitions of these invariants can be found in Silverman’s book [26] (see 
pp. 48, 224 and 361). The calculations for the Frey curve can be found in [7] and 
[25]. 
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since a, b,c is a solution to FLT. It is unusual for a discriminant to be a pure 2p" 
power—this is our first hint that the Frey curve is very special. 

Besides the discriminant of its equation, an elliptic curve over Q has a variety of 
invariants, including its minimal discriminant A, conductor N and j-invariant j. For 
the Frey curve, these invariants are given in the sidebar “Invariants of the Frey 
curve.” In general, A, N and j give useful information about the elliptic curve. For 
instance, when the curve is modular, one can find a modular parametrization using 
modular functions of level N, where N is the conductor of the curve. This fact will 
play an important role in the proof below. 

We then have the following results about the Frey curve: 


Lemma 1. The Frey curve is semistable. 


Proof: We first need to define what semistable means. When a prime 7 divides the 
discriminant, two or possibly all three of the roots become congruent modulo I. 
Roughly speaking, an elliptic curve is semistable if for all such primes /, only two 
roots become congruent mod / (the definition is more complicated for the primes 2 
and 3). Thus, for primes bigger than 3, the Frey curve is semistable since the 
discriminant is a*?b??c’” and the roots are 0, a? and —b”, where a” and b?” are 
relatively prime. More work is required to check semistability at / = 2 or 3, and 
when / = 2, the conditions b even, a = —1 mod4 and p > 3 are needed. For 
more details, see [7] and [25]. O.E.D. 


Corollary (Wiles). The Frey curve is modular. 


Lemma 2. For every odd prime | dividing N, the j-invariant of the Frey curve can be 
written as j = 1~™? - q, where m is a positive integer and q is a fraction not involving 
1. (We say that the j-invariant is exactly divisible by 1~™? in this case.) 


Proof: The j-invariant of the Frey curve is 


28( a? 4 b2P 4 a’b?) 2°(c?? _ bPcP)° 


a’? b2Pc2P (abc)? 


The power of / dividing the denominator is obviously a multiple of p, and since 
a, b,c are relatively prime, one sees that (c?? — b?c”)? and (abc)? are relatively 
prime. Since WN is the product of the primes dividing abc, the lemma follows easily. 
The lemma fails for / = 2 because of the factor of 2° in numerator. O.E.D. 


In the context of these three results—semistable modular elliptic curves whose 
j-invariants are exactly divisible by ]~™“P!* of P for odd primes / dividing N—the 
level reduction conjecture of Serre applies for all odd primes dividing N (see [6] 
and [25] for the details of how this works). Serre’s conjecture involves Galois 
representations and is rather technical (see [22] for a precise statement), though 
we will discuss its implications below. 

We can now prove Fermat’s Last Theorem: 


Theorem. The equation x? + y? =z” has no solutions with a, b, c nonzero for p an 
odd prime. 
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Proof: Suppose there were a solution a? + b? = c?, with our usual assumptions 
about p and a, b,c. Then we have a Frey curve, which by the above corollary has a 
cusp form F of weight 2 and level N, where N is the conductor. The Frey curve 
also has a Galois representation p on the points of order p on the curve (we won’t 
define precisely what this means). The cusp form F is linked to the representation 
p in an especially nice way. 

Serre’s level reduction conjecture deals with the pair (p, F), and as we observed 
above, the hypotheses of the conjecture are satisfied for all odd primes / dividing 
N. In such a case, the conjecture asserts that there is a cusp form F’ of weight 2 
and level N/I with 

F' = F mod p 


and F’ is also an eigen-form for the appropriate Hecke algebra (it takes some 
work to define what it means for modular forms to be congruent modulo p). This 
congruence means that F’ is linked to p in the same way F was, except that F’ 
has smaller level N//. But then, if l’ is another odd prime dividing N, we can 
apply the level reduction conjecture to the pair (p, F’) and get a cusp form F” 
with even smaller level N//Il', and then apply it again to (p, F”), etc. Eventually we 
get a cusp form F of weight 2 and level 2. (Note that 2 divides the conductor N 
since b is even.) Here is a diagram of the argument so far: 


solution of FLT — Frey curve — cusp form of level N 
— cusp form of level N// 
— cusp form of level N//Il' 


> cusp form of level 2 


But it is well known that there are no cusp forms of weight 2 and level 2 (see the 
sidebar “The modular curve X,(N)”). Hence the above diagram self-destructs, 
and Fermat’s Last Theorem is proved! Q.E.D. 


This brings us to the end of the article, but certainly not to the end of the story. 
One thing missing from this account of Fermat’s Last Theorem is the work of the 
many mathematicians who created the theories of elliptic curves, modular forms 
and Galois representations, and searched out the amazing connections between 
them. There is a lot more to say about the mathematics involved in the proof of 
Fermat’s Last Theorem! 

For an introduction to some of this wonderful material, we recommend the 
books by Koblitz [13], Knapp [14] and Silverman [26]. At the undergraduate level, 
the recent book by Silverman and Tate [27] discusses elliptic curves and introduces 
the idea of a Galois representation. 


NOTE ADDED IN PROOF: As of December 1993, Wiles’s manuscript has not yet been released. 
Ken Ribet notes that a delay like this is relatively normal in connection with a long manuscript. Most 
experts continue to believe in the fundamental correctness of the proof. 
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Future Elementary Teachers: 
The Neglected Constituency* 


Thomas W. Hungerford 


“We have met the enemy and he is us.”’ 
—Pogo Possum 


WHERE WE ARE NOW. Complaints about the mathematical preparation of 
incoming students are endemic in both college and high school mathematics 
departments. College professors tend to blame the high school teachers and they, 
in turn, blame the elementary school teachers for the sad state of affairs. Although 
this linear model of blame may be comforting to those at the top, a circular model 
(with blame for all) is a much better reflection of reality because elementary 
teachers are “trained” by the same college professors who complain about incom- 
ing students. In the words of J. R. C. Leitzel [3], 


The mathematical preparation of elementary school teachers is perhaps the 
weakest link in our nation’s entire system of mathematics education. 


Colleges and universities have done little to improve the situation because 
neither the students (prospective elementary teachers) nor their instructors have 
any immediate reason to change. Many of the students have weak mathematical 
backgrounds and a high level of mathematical anxiety. The last thing they want is 
to take more mathematics courses or “‘harder’”’ ones. 

Many instructors who teach courses for elementary teachers do so unwillingly (I 
avoided teaching them for almost twenty years). If they can’t teach an advanced 
course, they would rather teach calculus or even precalculus—courses with some 
modicum of “real mathematics.” It’s no wonder they feel this way. With few 
exceptions, the typical “mathematics for elementary teachers” sequence is taught 
from one of several isomorphic textbooks of the same title and purports to ‘“‘cover”’ 
all of the mathematics in grades K—8 in approximately six semester hours. Every 
mathematical term that might possibly appear in elementary school is mentioned, 
with depth of coverage being inversely proportional to this breadth. 

Apparently, many instructors take the path of least resistance and follow the 
textbook, bad as it may be. From one point of view, this is not particularly 


*This article arises from my experiences at a major research university and a comprehensive urban 
university. In both departments, the mathematics courses for elementary teachers are routinely taught 
by mathematicians whose primary interests are not in mathematics education. Nevertheless, I suspect 
that the comments here apply equally well (with occasional minor changes) to all but a handful of the 
colleges and universities in this country that offer special courses for prospective elementary teachers. 
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surprising. Innovative teaching requires time and time given to teaching is time 
taken from research. With the reward system being what it is in many departments, 
there is little incentive for faculty members to attempt improvements. 

This cannot be the sole explanation, however. Despite similar pressures, calcu- 
lus reform is thriving and a great deal is being done to change the teaching of 
linear algebra, differential equations and a variety of other courses. Nor is reform 
restricted to courses taken by math majors (in which all of us, understandably, have 
a vested interest). For instance, at the 1992 Joint Mathematics Meetings in 
Baltimore there were two sessions devoted to innovations in mathematics courses 
for business students and three sessions to liberal arts mathematics courses. 

Considering what is being done to improve other courses, the lack of attention 
being paid to mathematics courses for prospective elementary teachers is astound- 
ing. Of more than 150 “math education” papers presented at the Baltimore 
meetings, only one specifically mentioned elementary teachers in its title. Anecdo- 
tal evidence suggests that this proportion accurately reflects the larger mathemati- 
cal community’s interest in future elementary teachers as well. 

The operative word here is “future.” There are a number of mathematics 
programs for in-service elementary teachers run by various universities and urban 
mathematics collaboratives, with funding from NSF, Exxon, and other foundations. 
Similar innovative projects for future teachers are extremely rare. Programs for 
in-service teachers are absolutely essential and will continue to be so for some 
time, but they have no direct effect on future teachers—those now enrolled in 
“math for elementary teachers.” This investment in the present generation of 
teachers will be wasted if we continue to ignore the generation that will eventually 
replace them. 


WHERE WE ARE HEADED. Meeting society’s increasing need for a mathemati- 
cally literate work force clearly requires more students to choose careers in 
mathematics and science. The professional well-being of those who teach mathe- 
matics depends in significant part on having a sufficient number of math majors to 
enable our departments to offer a variety of interesting courses and to obtain 
adequate support from the administration. Building and sustaining a political 
climate sympathetic to mathematical research, which will encourage government to 
provide more funding for researchers, depends on the public’s having at least a 
minimal appreciation of what mathematics is and what mathematicians do. 

There is a growing body of evidence to suggest that students’ attitudes toward 
mathematics and science are well established by the time they enter high school. If 
they haven’t developed positive attitudes by then, the chances of their considering 
a career (or college major) in math or science is practically nil, as is the likelihood 
of their later supporting increased funding for mathematical research. Obviously, 
the elementary teacher plays a key role in developing a student’s appreciation of 
mathematics. 

Elementary teachers who don’t know much mathematics, who have little inter- 
est in what it means to do mathematics, and who are afraid of mathematics, are 
not likely to engender positive attitudes toward mathematics in their students. Yet 
these are the kinds of teachers that the current system is geared to produce. The 
inescapable corollary is that continued neglect of the mathematical education of 
prospective elementary school teachers courts disaster, both for the mathematical 
profession and the larger society. 


16 FUTURE ELEMENTARY TEACHERS [January 


WHERE WE OUGHT TO GO INSTEAD. Making the changes needed to turn 
things around will require both curricular reform and attitude adjustment. As to 
the first, there is no secret about what ought to be done. A general framework is 
given in the M.A.A. report A Call For Change: Recommendations For The 
Mathematical Preparation Of Teachers Of Mathematics [4], the first half of which 
deals with elementary teachers, and in pages 132-139 of Curriculum and Evalua- 
tion Standards for School Mathematics (National Council of Teachers of Mathe- 
matics) [5]. Contrary to much current practice, the recommendations assume that 
mathematics departments must 


take seriously the needs of future elementary teachers; 
offer them courses that are adequate in terms of both breadth and depth; and 
expect them to achieve at a reasonable level. 


Both A Call for Change and the relevant pages of the NCTM Standards can 
easily be read at one sitting, so there is no need to paraphrase them here. It may 
be helpful, however, to consider what implementing these recommendations will 
require. 


Level. The current prerequisites for “Math for Elementary Teachers” are usually 
the absolute minimum allowable (two years of college preparatory high school 
math is typical). The course work recommended in A Call For Change presumes 
three years of college preparatory high school mathematics for prospective teach- 
ers in grades K—4 and four years for those in grades 5—8. Consequently, students 
without this preparation (most of them at present) may have to take additional 
courses before they begin the teachers’ sequence. 


Length. At most colleges and universities prospective elementary teachers are 
currently required to take approximately 6 semester hours (or less) of mathematics. 
The recommended minimums are 9 semester hours for teachers in grades K—4 and 
15 semester hours in grades 5-8. 


Content. In terms of breadth, the recommendations include virtually all of the 
major topics mentioned in current texts, plus some that are not (such as algebraic 
structures and concepts of calculus). However, since 50% to 150% more time is to 
be devoted to this material than at present, the coverage will have considerably 
more depth. 


Flavor. With few exceptions, the additional credit hours will require not simply 
more courses from those already offered for other audiences, but some new 
courses specifically designed for this audience. In addition to mathematical con- 
tent, these students should learn what it means to “do mathematics”—that doing 
mathematics is much more than just getting the right answers to computational 
problems. 


Pedagogy. In common with almost all current reform efforts in collegiate mathe- 
matics, the recommendations call for a change in how mathematics is taught to 
prospective teachers. It won’t be enough simply to design new courses or select 
new texts. Instructors in these (and other) courses must be encouraged to adopt 
teaching styles other than the conventional lecture. Ideally, the passive model used 
in most current courses (I tell you what mathematics you need) will be replaced by 
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an active one (with my assistance, you explore, discover, or construct the mathe- 
matics you need). 


Technology. Future teachers will need to be familiar and comfortable with a 
variety of technological tools, including ordinary calculators, graphing calculators, 
and computers, as well as appropriate geometric and computational software. It 
will take time, effort, and money for mathematics departments to develop and/or 
purchase suitable materials and to incorporate them into mathematics courses. 


WHAT IT WILL TAKE TO GET THERE. Obviously, full implementation of the 
M.A.A. recommendations will take considerable time and work. Those who do it 
may have to take on not only their departmental colleagues, but also the college of 
education, the university bureaucracy, and possibly the state department of educa- 
tion. Such restructuring is unlikely to happen, however, unless there is a significant 
change in attitude on the part of individuals, departments, and institutions. 

To begin with, mathematicians will have to abandon the belief that mathematics 
education in general, and certainly anything involving elementary teachers, is not 
their business. Such an attitude is reminiscent of the weapons scientist in the Tom 
Lehrer song (“once the rockets are up, who cares where they come down; that’s 
not my department...”) and equally absurd. No one else has as great a personal 
stake in the mathematical education of future elementary teachers as we do. If we 
don’t make their education our business, the rockets will come down on us. 

It isn’t necessary, or even desirable, for every mathematician to be involved in 
reforming the mathematics program for future teachers. But it is essential that 
those who have the intelligence, interest and talent to do this well be encouraged 
to do so. H. Clemens [1] has addressed these issues in a broader context, but his 
remarks are equally applicable here: 


This will demand...that research mathematicians show more respect for the 
enormous difficulties of mathematics education in the setting of a mass 
nonelite audience, and more respect for the intelligence and precious talent 
of educators who manage to achieve success in that setting. 


Such respect must be more than just lip service; it must include rewards, in 
terms of salary, promotion, teaching loads, etc., comparable to those given good 
researchers. This does not mean that any work with elementary teachers is worthy 
of reward—mediocrity there is no more deserving than mediocrity in research. But 
it does mean that people doing first-rate work in either area ought to be 
adequately rewarded. Encouraging appropriate individuals to undertake the neces- 
sary work and rewarding those who do it well are primarily departmental responsi- 
bilities. Carrying out these responsibilities successfully will require leadership from 
the chairperson and senior members of the department, and possibly a significant 
change in the departmental ethos. 

A new attitude in the department is necessary, but far from sufficient. Effective 
change won’t be possible unless the education college requires its elementary 
majors to participate in the reformed program. Such action cannot be taken for 
granted. Many education faculty members do not value mathematics to the same 
degree that we do and are satisfied with the present situation. If mathematicians 
must be convinced of the real need for change, we can hardly expect education 
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faculty to be any different. Most of us are unaccustomed to having to “sell” 
mathematics to others, but in this case it is essential. 

Concerns about academic turf may further complicate the reform process. 
Within a university, requiring more courses in one area may generate pressure to 
require fewer in other areas. At many institutions, the amount of mathematics 
currently required of elementary education majors is a reflection of the minimum 
state certification requirements for elementary teachers. Because of real or per- 
ceived fears of losing students to competitive institutions, the certification require- 
ments may be used as an argument against increasing the amount of mathematics 
required for future teachers; (if we do more than the state requires, students will 
go to schools that don’t). Depending on where you live, this could be a very serious 
problem. Of 54 states and territories* that certify teachers, 


More than one-fourth require no mathematics at all for elementary certification! 
Furthermore, as best I can determine, 


At most 10% require more than 6 semester hours of mathematics for elemen- 
tary certification.** 


These figures are depressing enough when “elementary” is defined as K—5 or K-6, 
but in many states elementary certification allows a person to teach seventh and 
eighth grade as well. Thus, revitalizing the mathematical education of future 
elementary teachers may require not only the consent of the mathematics depart- 
ment and the cooperation of the college of education, but also institutional 
support at the highest level. 


WHAT TO DO UNTIL THE DOCTOR COMES. It would be unrealistic to expect 
large numbers of mathematicians to make reform of the program for elementary 
teachers their first priority. Nevertheless, in all but the smallest departments, there 
are likely to be some capable people who would be willing to do that. Those not 
directly involved can certainly encourage and support their efforts. 

If you have never taught the course, you might consider doing so. Teaching the 
course even once will give you a much better picture of what the problems are and 
why change is necessary. In any case, it will take time to bring about change. In the 
meantime, there are many small things that any instructor can do to improve the 
present course. Here, in no particular order, are a few possibilities based on my 
Own experience. 

Make your students aware of the mathematical resources available to them, 
particularly journal articles at their level. 1 routinely require my students to write an 
“article report” on something they have read in The Arithmetic Teacher or other 
expository journals. Until they go looking for such an article, they are unaware of 


* Fifty states, the District of Columbia, Puerto Rico, Guam, and the Virgin Islands. 

**Ten states do not specify minimum numbers of credit hours, but approve an institution’s entire 
program for teachers. In Ohio (one of the ten), the vast majority of institutions require no more than 6 
semester hours of mathematics. It seems likely that the same is true of the other nine. 
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the vast amount of mathematical literature designed for classroom teachers and of 
the practical help such articles can provide. 

If possible, have your students attend a meeting of the state or local affiliate of the 
National Council of Teachers of Mathematics. Recently, the annual meeting of the 
Ohio Council of Teachers of Mathematics was held here, so I required my students 
to register ($5) and attend at least one session. Most of them were amazed to 
discover how much is going on in elementary school mathematics. They were also 
impressed with the exhibits by publishers and school-supply organizations—re- 
sources most of them had been unaware of. 

Try to wean yourself from the traditional lecture method, at least for part of each 
class. I have found that a relatively painless way to do this is to have the students 
spend the first 10-15 minutes of class in groups of 3 or 4 discussing current and 
past homework while I circulate from group to group, giving suggestions or 
clarifying what needs to be done. I don’t work the problems for them, but 
encourage them to “talk about it with the group.” At first some students are 
unhappy that I won’t just give them the answer, but they soon find that the vast 
majority of their questions and difficulties can be resolved in the group, or by 
consultation with another group. I do answer for the class those few questions that 
none of the groups have been able to handle. 

Talk to your friends at other institutions about the course. Despite the lack of 
public discussion and major reforms, there are a variety of innovative ideas being 
tried by instructors all over the map. In two conversations at the math meetings, 
for example, I learned of a “lab manual” and a plastic device for visualizing 
reflections in the plane that I will be able to adapt to my own course. 

Don’t let your students (or yourself ) get bored. In my experience this is quite 
likely to happen in the early part of the course, which deals with integer and whole 
number arithmetic, topics that form the bulk of the mathematics in the primary 
grades. Even students with weak mathematical backgrounds (often the majority of 
the class) know the mechanics of addition, subtraction, multiplication, and division. 
Having mastered rote procedures for calculating, they pay little attention to why 
the various algorithms work or how they will explain them to children. 

I have found that presenting this material in base five, without telling the 
students that it is base five keeps me awake and jars the students out of 
complacency. The key is to present the material, not as a “translation exercise”’ 
from base ten, but as a “new” arithmetic in which you count 1, 2, 3, 4, 10 (fen), 11 
(fenone), 12 (fentwo),...,21 (twofen-one),..., 100 (one fundred), etc. I tell them 
to forget all the math they learned in the past—that we are going to recreate a 
child’s mathematical experiences in the primary grades. This new “fen arithmetic”’ 
is just similar enough to the “old” one that students aren’t overly anxious and just 
different enough that they can’t rely on what they learned by rote in the past; they 
now must think about what’s going on. For full details, including student reactions, 
see [2]. 

Undoubtedly, others who have taught the course can contribute other or better 
suggestions—and I certainly hope they will. 
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LETTER TO THE EDITOR 


The theorem of Abel, given an elegant new proof by Grosof 
and Taiani in the June-July issue of v. 100 (1993) (575-576), 
could also be proved by the Lagrange Interpolation formula 
applied to P(r) at the points {r,,...,7,}, by equating the 
coefficient of r“~ on both sides. The Lagrange Interpolation 
formula, in turn, relies on the fact that a polynomial of degree n 


is uniquely determined by its values at n + 1 distinct points, 
that is the “standard” application of Vandermonde’s determi- 
nant mentioned at the first paragraph. 


Doron Zeilberger 
Department of Mathematics 
Temple University 
Philadelphia, PA 19122 
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Galois Theory for Beginners 


John Stillwell 


Galois theory is rightly regarded as the peak of undergraduate algebra, and the 
modern algebra syllabus is designed to lead to its summit, usually taken to be the 
unsolvability of the general quintic equation. I fully agree with this goal, but I 
would like to point out that most of the equipment supplied—in particular normal 
extensions, irreducible polynomials, splitting fields and a lot of group theory—is 
unnecessary. The biggest encumbrance is the so-called “fundamental theorem of 
Galois theory.” This theorem, interesting though it is, has little to do with 
polynomial equations. It relates the subfield structure of a normal extension to the 
subgroup structure of its group, and can be proved without use of polynomials 
(see, e.g., the appendix to Tignol [6]). Conversely, one can prove the unsolvability 
of polynomial equations without knowing about normality of field extensions or the 
Galois correspondence between subfields and subgroups. 

The aim of this paper is to prove the unsolvability by radicals of the quintic (in 
fact of the general nth degree equation for n > 5) using just the fundamentals of 
groups, rings and fields from a standard first course in algebra. The main fact it 
will be necessary to know is that if @ is a homomorphism of group G onto group 
G’ then G’ = G/ker d, and conversely, if G/H = G’ then H is the kernel of a 
homomorphism of G onto G’. The concept of Galois group, which guides the 
whole proof, will be defined when it comes up. With this background, a proof of 
unsolvability by radicals can be constructed from just three basic ideas, which will 
be explained more fully below: 


1. Fields containing n indeterminates can be “symmetrized”’. 
2. The Galois group of a radical extension is solvable. 
3. The symmetric group S, is not solvable. 


When one considers the number of mathematicians who have worked on Galois 
theory, it is not possible to believe this proof is really new. In fact, all proofs seem 
to contain steps similar to the three just listed. Nevertheless, most of the standard 
approach had to be stripped away before the present proof became visible. I read 
the books of Edwards [2], Tignol [6], Artin [1], Kaplansky [3], MacLane and 
Birkhoff [5] and Lang [4], taught a course in Galois theory, and then discarded 
90% of what I had learned. 

I wish to thank my students, particularly Mark Kisin, for helpful suggestions and 
discussions which led to the writing of this paper. I am also grateful to the referee 
for several improvements. 


THE GENERAL EQUATION OF DEGREE nr. The goal of classical algebra was to 
express the roots of the general nth degree equation 


(*) x"+a,_,x" |+-++++a,x+a,=0 


n— 


22 GALOIS THEORY FOR BEGINNERS [January 


in terms of the coefficients ay,...,a,_,, using a finite number of operations 
3 

+,-,X,+ and radicals ¥ ,v ,.... For example, the roots X1,X, of the 

general quadratic equation 


x? +a,x +a, =0 


are expressed by the formula 


X15 Xy = ») 


Formulas for the roots of general cubic and quartic equations are also known, 
using cube roots as well as square roots. We say that these equations are solvable 
by radicals. 


The set of elements obtainable from dp,...,a,_, by +, —, X , + is the field 
Q(a,,...,a,- 1). If we denote the roots of (*) by x,,...,x,, so that 
(x —x,) 00: (x —x,) =x" +4, yx") +++ +a,x + ay 
then dy,...,a,_, are polynomial functions of x,,...,x, called the elementary 


symmetric functions: 


n 
Ao = (-1) XX 01 Xn eer A, 4 = —(x, + X5 ss +x,,). 
The goal of solution by radicals is then to extend Q(ay,...,a,_,) by adjoining 
radicals until a field containing the roots x,,..., x, is obtained. For example, the 


roots x,,x, of the quadratic equation lie in the extension of O(a), a,) = 
Q(x x5, x, + xX,) by the radical 


Vai — 4a, = V(x +x)” — 4x,X_ = V(x —x,)° = +(x, —X2). 


In this case we get Q(x,, x2) itself as the radical extension Q(dp, a), Vay — 4ay), 
though in other cases a radical extension of Q(a),..., a, _,) containing x,,..., x, 
is larger than Q(x,,..., x,,). In particular, the solution of the cubic equation gives 
a radical extension of Q(a), a,, a,) which includes imaginary cube roots of unity as 
well as x1, X5, X3. 

In general, adjoining an element a to a field F means forming the closure of 
F U {a} under +, —, X , + (by a non-zero element), i.e., taking the intersection of 
all fields containing F U {a}. The adjunction is called radical if some positive 
integer power a” of a equals an element f € F, in which case a may be 


represented by the radical expression “Vf . The result F(a,a,)...(a,) of succes- 
sive adjunctions is denoted by F(a,,...,a,) and if each adjunction is radical we 
say F(a,,...,a,) is a radical extension of F. 

It is clear from these definitions that a radical extension E of Q(ao,...,a,,_1) 
containing x,,...,x, is also a radical extension of Q(x,,...,x,), since 
Ay,..+,a, 1 © Q(x,,...,x,). Thus we also have to study radical extensions of 
Q(x,,...,x,). The most important property of Q(x,,..., x,,) is that it is symmetric 
with respect to x,,...,x,, In the sense that any permutation o of x,,...,%x 
extends to a bijection o of Q(x,,...,x,) defined by 


n 


Of (X1,..-,X,) =f(ox,,...,0%,) 


for each rational function f of x,,...,x,. Moreover, this bijection o obviously 


1994] GALOIS THEORY FOR BEGINNERS 23 


satisfies 
o(f+g) =ofteosg, 
o( fg) =of-o8, 


and hence is an automorphism of Q(x,,..., x,). 

A radical extension E of Q(x,,...,x,) is not necessarily symmetric in this 
sense. For example, Q(x,,..., X,, YX1 ) contains a square root of x,, but not of x,, 
hence there is no automorphism exchanging x, and x,. However, we can restore 
symmetry by adjoining fx2 yeees VXn as well. The obvious generalization of this 


n 


idea gives a way to “symmetrize”’ any radical extension E of Q(x,,..., x,,): 
Theorem 1. For each radical extension E of Q(x,,...,x,) there is a radical 
extension E > E with automorphisms o extending all permutations of x,,..., Xp. 
Proof: For each adjoined element, represented by radical expression e(x,,..., x,,), 
and each permutation o of x,,...,x,, adjoin the element e(ox,,...,0x,,). Since 
there are only finitely many permutations o, the resulting field E DE is also a 
radical extension of Q(x,,..., x,,). _ _ 
This gives a bijection (also called a) of E sending each f(x,,...,x,) € E (a 
rational function of x,,...,x,, and the adjoined radicals) to f(ox,,...,0@x,,), and 


this bijection is obviously an automorphism of E, extending the permutation co. & 


The reason for wanting an automorphism o extending each permutation of 
X1,---,X, 1S that adp,...,a,_, are fixed by such permutations, and hence so is 
every element of the field Q(ay,...,a,_,). If E DF are any fields, the automor- 
phisms o of E fixing all elements of F form what is called the Galois group of E 
over F, Gal(E /F). This concept alerts us to the following corollary of Theorem 1: 


Corollary. If E is a radical extension of Q(ag,...,@, 1) containing x,,...,x,, then 
there is a further radical extension E > E such that Gal(E/Q(ap,...,a,_ )) in- 
cludes automorphisms o extending all permutations of x,,..., X- 


Proof: This is immediate from Theorem 1 and the fact that a radical extension of 
Q(a,,...,4a,—,) containing x,,..., x, is also a radical extension of Q(x,,..., X,,). 
| 


n 


THE STRUCTURE OF RADICAL EXTENSIONS. So far we know that a solution 
by radicals of the general nth degree equation (*) entails a radical extension of 
Q(ay,...,@,_,) containing x,,...,x,, and hence a radical extension E with the 
symmetry described in the corollary above. This opens a route to prove non- 
existence of such a solution by learning enough about Gal(E/Q(ap,...,a,,_,)) to 
show that such symmetry is lacking, at least for n > 5. In the present section we 
shall show that the Galois group Gal(F(a@,,...,a@,)/F) of any radical extension 
has a special structure, called solvability, inherited from the structure of 
F(a,,...,a@,). Then in the next section we shall show that this structure is indeed 
incompatible with the symmetry described in the corollary. To simplify the deriva- 
tion of this structure, we shall show that certain assumptions about the adjunction 
of radicals a; can be made without loss of generality. 

First, we can assume that each radical a, adjoined is a pth root for.some prime 


6 3 . 
p. E.g., instead of adjoining Va we can adjoin first va = B, then VB . Second, if a; 
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is a pth root we can assume that F(a@,,...,a;) contains no pth roots of unity not 
in F(a,,...,a@;_,) unless a, itself is a pth root of unity. If this is not the case 
initially we simply adjoin a pth root of unity {#1 to F(a,,...,a;_,) before 
adjoining a, (in which case F(a,,...,a;_,,¢) contains all the pth roots of unity: 
1,£,¢°,...,¢?7 |). With both these modifications the final field F(a@,,...,a,) is 
the same, and it remains the same if the newly adjoined roots ¢ are included in the 
list a,,...,@,. Hence we have: 


Any radical extension F(a,,..., a@;,) is the union of an ascending tower of fields 
F=F, CF, C on CF, = F(a,,...,@,) 


where each F, = F,_ (a;), a; is the p,-th root of an element in F,_,, p; is prime, and 
F, contains no p,-th roots of unity not in F,_, unless a, is itself a p,-th root of unity. 


Corresponding to this tower of fields we have a descending tower of groups 
Gal(F,/Fy) = Gp 2G, 2 °°: DG, = Gal(F,/F,) = {VJ 


where G, = Gal(F,/F,) = Gal(F,/F,_,(a;)) and 1 denotes the identity automor- 
phism. The containments are immediate from the definition of Gal(E/B), for any 
fields E D> B, as the group of automorphisms of EF fixing each element of B. As B 
increases to E, Gal(E/B) must decrease to {1}. The important point is that the 
step from G;_, to its subgroup G,, reflecting the adjunction of the p,-th root ato 
F, is “small” enough to be describable in group-theoretic terms: G, is a normal 
subgroup of G;_,, and G,_,/G, is abelian, as we shall now show. 
To simplify notation further we set 


E =Ff,,B =F ia =a4;, D = Di» 
so the theorem we want is: 
Theorem 2. [f E > B(a) D2 B are fields with a? € B for some prime p, and if B(a) 
contains no pth roots of unity not in B unless a itself is a pth root of unity, then 


Gal(E/B(a)) is a normal subgroup of Gal(E/B) and Gal(E/B)/Gal(E/B(a)) is 
abelian. 


Proof: By the homomorphism theorem for groups, it suffices to find a homomor- 
phism of Gal(E/B), with kernel Gal(E/B(q)), into an abelian group (i.e., onto a 
subgroup of an abelian group, which of course is also abelian). The obvious map 
with kernel Gal(E/B(qa)) is restriction to B(a),|a(a), since by definition 


o € Gal( E/B(a@)) © olga) is the identity map. 
The homomorphism property, 
o' Ola) = O'laayT laa) for all o’, o € Gal( E/B), 


is automatic provided olgq)(b) € B(a) for each b € B(a), i.e. provided B(a) is 
closed under each o € Gal(E/B). 

Since o fixes B, olga) is completely determined by the value o(q@). If @ is a 
pth root of unity ¢ then 


(a(a))" = o(a?) = o(¢?) =o(1) = 1, 


hence o(a) = £' = a’ € B(a), since each pth root of unity is some ¢'. If a is not a 
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root of unity then 
(a(a))” =a(a?) =a? since a? EB, 


hence o(a) = f/a for some pth root of unity ¢, and £ € B by hypothesis, so again 
a(a) € B(a). Thus B(a) is closed as required. 

This also implies that |g) maps Gal(E/B) into Gal(B(a@)/B), so it now 
remains to check that Gal(B(a@)/B) is abelian. If a is a root of unity then, as we 
have just seen, each ola) € Gal(B(a)/B) is of the form o;, where oa) = a’, 
hence 


g,0;(a) = 0,(a’) = a = o0,(a@). 


Likewise, if @ is not a root of unity then each o\B(a) = Gal(B(a)/B) is of the form 
go; where o,(a) = £’a, hence 


g0,(a) = o,(¢/a) = ¢'a = o,0,(@) 


since £ € B and therefore ¢ is fixed. Hence in either case Gal( B(a)/B) is abelian. 

a 

The property of Gal(F(a,,...,a,)/F) implied by this theorem, that it has 

subgroups Gal(F(a@,,...,a,)/F) = Gp 2G, 2 ++: DG, = {1} with each G, nor- 
mal in G,_, and G,_,/G, abelian, is called solvability of Gal(F(a,,...,a,)/F). 


NON-EXISTENCE OF SOLUTIONS BY RADICALS WHEN n >= 5. As we have 
said, this amounts to proving that a radical extension of Q(a),..., a,,_,) does not 
contain x,,...,x, or, equivalently, Q(x,,...,x,). We have now reduced this 
problem to proving that the symmetry of the hypothetical extension E containing 
X1,..-,X,, given by the corollary to Theorem 1, is incompatible with the solvability 
of Gal(E/Q(ay,...,a,_1)), given by Theorem 2. Our proof looks only at the effect 
of the hypothetical automorphisms of E on X1,---,%X,, and hence it is really about 
the symmetric group S,, of all permutations of x,,..., x,. In fact, we are adapting a 
standard proof that S, is not a solvable group, given by Milgram in his appendix to 
Artin [1]. 


Theorem 3. A radical extension of Q(day,..., 4,1) does not contain Q(x,,..., x,) 
when n > 5. 

Proof: Suppose on the contrary that E is a radical extension of Q(ao,..., 4,1) 
which contains Q(x,,...,x,). Then E is also a radical extension of Q(x,,..., x,,) 
and by the corollary to Theorem 1 there is a radical extension E > E such that 
G, = Gal(E/Q(a,,..., a, _,)) includes automorphisms o extending all permuta- 
tions of x,,...,X,. 


By Theorem 2, G, has a decomposition 
Gy 2G,2°°: 2G, = {I} 


where each G;,, is a normal subgroup of G; and G,_,/G; is abelian. We now 
show that this is contrary to the existence of the automorphisms ao. 
Since G,_,/G; is abelian, G, is the kernel of a homomorphism of G;_, onto an 
abelian group, and therefore 
o,tTEG,_, > 0 't 'oT EG,. 


We use this fact to prove by induction on i that, if nm > 5, each G, contains 
automorphisms o extending all 3-cycles (x,,x,,x,). This is true for Gy by 
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hypothesis, and when n > 5 the property persists from G;_, to G, because 


(%4,%X,,X,) = (Xy,%Xq,X, "(Xe Xs Xy) (Kas X qa Xe )( Xe, Xe, Xp) 


where a,b,c,d,e are distinct. Thus if there are at least five indeterminates x, 
there are o in each G, which extend arbitrary 3-cycles (X,> Xp, X,), and this means 


in particular that G, # {1}. This contradiction shows that Q(x, ...,X,) is not 
contained in any radical extension of Q(a,,...,a,_,) when n > 5. a 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


hd i 


This famous topologist was usually 
considered more scary than scared. 
(see page 86.) 
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A Family of Invertible Matrices 


10214 [1992, 362]. Proposed by Stephen Penrice, Emory University, Atlanta, GA. 


For all integers n > 1, let f(n) denote the largest real number such that, for 
any set of non-negative real numbers satisfying a, + --- +a, < f(n), the n by n 
matrix with a,,...,a, along the main diagonal and —1 in all other positions is 
invertible. Show that f(7) is well-defined, and obtain an explicit formula for it. 


Solution by Dirk P. Laurie, Potchefstroom University for Christian Higher Educa- 
tion, Vanderbijlpark, South Africa. The matrix concerned can be written as 
M = D - ee’, where D = diag(1 + a;) and e is a column of ones. By The Sher- 
man-Morrison formula (see Alston S. Householder, The Theory of Matrices in 
Numerical Analysis, Dover, 1964, sect. 5.1), 


M'=D"‘(1- e™D-1e) ‘D-‘ee7D~!; 


hence M is invertible whenever e7 E~'e ¥ 1. Using the fact that the arithmetic 
mean of positive numbers is not less than their harmonic mean, we obtain 


n 
e’D'e= 


IV 
S 
iM: 


n + f(n) 
> lif f(n) <n? —-n. 


The function f(n) = n? — n is best possible, since if all a; = n — 1, then M is not 
invertible because Me = 0. 


Editorial comment. Many solvers noted that the same result holds if the 
condition a; => 0 is weakened to a; > —1. 


Solved also by J. C. Binz (Switzerland), B. W. Brock, F. Brulois and T. Shore and D. B. Tyler, D. 
Callan, R. J. Chapman (U.K.), M. Dindos (Slovakia), Z. Franco, K. S. Kedlaya (student), N. Komanda, 
O. Krafft (Germany), O. P. Lossers (The Netherlands), M. Mocsy (Hungary), A. Nijenhuis, A. Pedersen 
(Denmark), E. Schmeichel, R. Stong, M. Tsatsomeros (Canada), and the proposer. 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 


Answer to Picture Puzzle 
(p. 27) 


Solomon Lefschetz. 
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Into The Hourglass: Reflections on the 
Forces Acting on a Granular Material 


E. Bruce Pitman 


The storage and flow of granular materials is common to many industries. For 
example, coal is stored in large bunkers at electric generating plants, to be used 
later in fueling the plant furnaces. A very different kind of material storage occurs 
at cereal manufacturing facilities; here the corn flakes are baked, stored in large 
bins temporarily, and then packaged. A natural question to ask is How strong do 
the storage bins have to be? Engineers would like to be sure that the coal bunkers 
do not collapse under the weight of the coal (as sometimes happens, often with 
loss of life). At the same time, the cereal manufacturer wants to be sure that he 
doesn’t crush the corn flakes into tiny bits before packaging. How does one model 
the forces acting in a granular medium? What kind of pressures build up inside a 
vessel storing granular materials? These are the questions we address in this 
article. 

It would be natural to examine each particle individually, resolve all the forces 
acting on each particle, and then solve Newton’s law Force = Mass - Acceleration 
for each particle. Needless to say, with perhaps 10° or more particles in a bin, such 
a procedure would be prohibitively difficult to carry out. We make a simplification 
here. In trying to determine forces inside the bin, we forget the particle nature of 
the material and treat it as a continuum. Once we make this simplification, we are 
faced with the issue of describing the forces in a continuum, in particular of 
describing the constitutive behavior of the material [that is, explaining the stress- 
strain relationship which models the continuum]. We examine a simple constitutive 
model later on. 

Before looking at the pressures inside a bin of granular material, it would be 
useful to recall the analogous question for fluids. How much pressure is there at 
the bottom of a tall water tower? A look at your basic physics textbook will tell you 
that the pressure at a point at the bottom of a water tower equals the weight of all 
the water above that point. The density of water is 1 gram/cm’; you should 
calculate the pressure at the base of a water tower comprised of a cylindrical 
tower, perhaps 20 meters in diameter and 30 meters high, with a spherical top of 
perhaps 20 meters radius. 

In order to estimate the pressure inside a bin of granular material, such as 
illustrated in FiGuRE 2, we need to introduce the concept of stress; this is the 
subject of the next section. We then introduce a generalization of sliding friction 
laws as a constitutive model for granular material. Finally we look at the forces 
inside a cylindrical bin and a converging hopper. 

In the sections to follow, we restrict attention to two dimensions; an analysis can 
be carried out in three dimensions, but the 2-D case is a little easier to work out. 
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20 meters 


30 meters 


< 20 meters —» 


Figure 1. A schematic diagram of a watertower. 


ARR 


Figure 2. A typical bin, with a cylindrical part and a converging hopper. 


WHAT IS STRESS? In this section we introduce the concept of stress, and 
generalize the idea of sliding friction to model the deformation of granular 


materials. Let us begin with stress. We may take as a working definition of stress 
the following: 


Definition. Stress is the internal force in a continuous medium acting across a 
boundary. 


1994] INTO THE HOURGLASS 29 


To explain this definition a bit, imagine a region 2 c R*. If F' is the sum of all 
external forces in the ith direction acting on (12, per unit volume, then we write 


| Flav = [ oi(x) ds. (1) 


Here dV is the volume element for 0 and ds is the area element on 022, the 
boundary of ©. The integrand o'(x,v) is called the surface traction in the ith 
direction; it depends on position x and also upon the outward normal v to the 
boundary. 

Cauchy showed that the dependence of the traction on v is linear; that is, 


o'(x,v) = o'/(x)v/(x). 


The stress a0” gives the force per unit area acting in the ith direction across a 
surface whose normal is in the jth direction. Using this relation, we substitute into 
Eq. (1) to get 


| Flav = foie) ds. 


Then applying the Gauss Theorem, we have 
0 |. 
[ Fiav = [ —o'(x) av. 
ce) 00x; 


Since the region (7 is arbitrary, the integrands must be equal: 
F' ° 2 
=—oa”. 
ie (2) 
Equation 2 expresses the balance of momentum, or force equilibrium; it says that 
the forces acting in the ith direction are balanced by the gradient of the stress. 
The stresses a’ are elements of a matrix, >. If we call the 1-direction x and the 


2-direction y, then 
ot or 
a= (to oo) 


ag” 


In writing this matrix, we have used the fact that o*” = a”*, which arises from 
conservation of angular momentum. 

You are already familiar with certain kinds of stress. For example air pressure is 
a stress, albiet a stress of a simple form. Air pressure acts with equal magnitude in 
all directions simultaneously; as a matrix, it looks like a multiple of the identity 
matrix. In our examination of granular materials, the off-diagonal entries will be 
important. 

It is common in granular mechanics to consider compressive stresses as positive. 
This is natural for granular materials, since any tensile stress would pull the 
material apart causing it to disintegrate. This convention will introduce an extra 
minus sign later on that we must be aware of. In other branches of mechanics, the 
convention is different. 

We now need to model the deformation of a granular material. If the deforma- 
tion process is slow, then as the particles within the medium move, they slide over 
each other, one particle remaining in contact with another for an extended period 
of time. A simple model of such behavior is a generalization of sliding friction. 
Recall that the normal and tangential forces, N and T, acting on a block which sits 
on an inclined plane are related T < w’N. The block slides (i.e. deformation 
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occurs) only when equality holds. The friction coefficient u? is often related to the 
angle & which the plane makes with the horizontal—y? = tan(ys). How can we 
generate a model for deformation of a granular material based on similar ideas? 
We must generalize the notions of tangential and normal force. 

Since > is a2 X 2 matrix, it has two eigenvalues. Let call these a! and oa’, with 
oa! >o’. Recall from Linear Algebra that the determinant and the trace of two 
similar matrices are the same. [The trace of a matrix is the sum of its diagonal 
elements.] Using this fact, let us decompose > as follows: 


ao** _ ao” 


(7; l=} t) + 2 


ao*” 


(3) 


ao*” oo 2 ao** — a” 


2 


Now make a similarity transformation, conjugating > by a change of basis matrix, 
call it P, whose columns are the right eigenvectors associated to the eigenvalues 
o' and ao”. We can again decompose this new matrix into the form 


a! —o’ 


gito*(1 0), 9) 
2 0 1 


ao*” — 


PSP = 1 | (4) 


The pressure-like term (a! + a7)/2 plays the role of the normal force. The 
tangential force is given by the term (a! — o*)/2. Sliding friction can then be 
generalized to read 
a! —o’ ai +o?’ 
5 L— —. (5) 
Equation (5) is written in terms of the eigenvalues a 2 
express this equation in terms of a**, 0”, and a*”. 

What should we use as the friction coefficient 4? A simple solution suggests 
itself if we think about everyone’s favorite granular material, sand. When you were 
younger, perhaps you had a sandbox in your backyard. You might have tried to 
build castles and forts in the sand. If so, you undoubtedly noticed that you can 
make a heap of sand, the sides of which make a certain angle with the horizontal. 
If you tried to make the heap steeper, you couldn’t. A sandslide ensued and the 
heap retained its characteristic angle. That angle is called the angle of internal 
friction, often denoted by @, and plays a role analogous to the inclined plane angle 
Ww. FicureE 3 illustrates this relation. 


1 


and o%; as an exercise, 
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pG 


(uy 


Sliding friction on an inclining plane Internal friction in a granular material 


Figure 3. 
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When a granular material deforms, the particles can slide along any plane they 
wish and are not restricted to slide along a particular plane. Thus sliding will occur 
along whichever plane first mobilizes sufficient forces to be in the ratio given by 
Equation (5). In Figure 4, we have drawn a diagram showing Normal and 
Tangential forces, and a circle whose center is at (ao! + o”)/2 and whose radius is 
(o'! — o*)/2. This so-called Mohr Circle defines the directions in which Equation 
5 first allows frictional sliding. Drawing the angle ¢, the usual constraint of sliding 
friction imposes the condition T = wN. Simple trigonometry then gives a relation 
between the eigenvalues of > 

On — On 2 
sin( ¢) stag?’ 


Thus the friction coefficient yz in (5) is equal to sin(¢@). 


(6) 


Figure 4. Resolution of Normal and Tangential forces acting in a granular material. 


Before leaving this section, we remark that one critical difference between 
granular materials and fluids is the iuternal friction angle. For a material like sand, 
_@ = 35°. However, you cannot make a heap of fluid like you can with sand; fluids 
have an angle of internal friction of 0°. This difference dramatically alters the 
stress distribution set up within a sample of fluid or granular material. 


FORCES IN A BIN. We are now in a position to compute the forces that act on a 
bin holding granular material. We will begin by considering the simpler case of 
forces inside a cylindrical vessel, and then compute the forces in a converging 
hopper. In the following, we consider the static forces built up when a granular 
material rests inside a bin; inclusion of dynamic effects adds significant complexity 
to the problem. 


Cylindrical bin. Consider the forces shown in FiGurE 5, where p is the density of 
the material and G is the acceleration of gravity. In the diagram, we have not 
_ explicitly shown the internal stresses. Assume the origin of a Cartesian coordinate 
system lies along the centerline of the bin along the line marked y = 0. Instead of 
a complete analysis, we content ourselves with an approximate analysis, examining 
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Th 


Figure 5. Force balance for a cylindrical bin. 


the average stress 
D/2 
a= | / a’*(x,y) dk. 
—D/2 


Let + be the frictional force mobilized along the bin wall. Then force balance 
requires 


—D|a(y + Ay) — a(y)] + 27Ay = pGAyD. 


The extra minus sign on the first term comes from the convention of treating 
compressive stresses as positive. If we divide through by DAy and let Ay — 0 we 
have the equilibrium equation 

0 2T G 7 

-~—F+ — =pG. 
dy D7? (7) 
We now must model the wall force +r. Again our fundamental idea comes from 
simple sliding friction. The stress 7 is a shearing force, akin to the tangential force 
in the inclined plane figure; in our stress notation, 7 is the o*” stress. We assume 
this stress is equal to a friction coefficient times the “normal’’ stress a **: 
a**(D/2,y) = no**(D/2,y). 

From the sliding friction model, we assume the wall friction coefficient 7 = 
tan(¢,,), where @¢,, is the wall-material friction angle. Also, we assume that the 
eigendirection corresponding to a! is parallel to the x-axis, and the eigendirection 
corresponding to ao” is parallel to the y-axis. Then o** and o”” are the 
eigenvalues, and 


a**(x,y) = Ko”’(x,y) 
where K = (1 + sin(¢))/( — sin(¢)). This convention expresses the idea that, if 
we push harder in the x-direction, material will move along the y-direction. 


Combining these assumptions, we finally have the governing differential equa- 
tion 


0 2%nKa 
ay? ar; an —pG. (8) 
Solving, and using the condition that ¢ — 0 as y — H gives 
_  pGD 2K 
7 aK (1 20(—5 0 -*9)) ” 


The important point to notice is that, as y — O, the average stress in a granular 
material stored in a bin approaches an asymptotic value pGD/27K, independent 
of how tall the bin is. This behavior is much different than we found for a 
liquid-filled bin. 
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To give you some idea of the numbers involved, here are some typical values for 
a sand stored in a bin. 


e The mass density is about 1.3 g/cm’. 

¢ A typical bin diameter might be 5 meters. 

e The internal friction angle @ = 35°. 

e The wall friction angle ¢,, = 15°. 

The height of fill in a bin is many times the diameter usually; H = 5D is a 
reasonable guess. 


These values give an asymptotic value of the stress of ¢ = 400G. By way of 
contrast, if the bin had been filled with water (whose density is 1 g/cm”) to the 
same height, the pressure at the bottom would be about 10°G. The reason for this 
large difference is the internal friction. In effect, the internal friction transfers 
much of the weight of a granular medium to the walls; each bit of the bin wall 
holds up its share of the matrial inside. In the case of a fluid like water, all the 
weight is supported by the material below; thus the bottom of the bin is made to 
support all the weight, leading to high pressures. 


Converging hopper. FIGURE 6 shows the forces in a converging hopper. Mimicing 
the ideas we just used, we derive the differential equation for the average stress 


aan yo —pGA(y). (10) 


—( A(y)a) — 
The major difference between Equations 8 and 10 is the variable width A(y) in 
(10), instead of the constant width D in (8). To model the tangential force +, we 
have again assumed that the x and y-axes are parallel to the eigendirections. This 


Aly + Ay) 


yt+Ay 


- 
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Figure 6. Force balance for a converging hopper. 
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assumption is not valid for converging bins, but is a reasonable approximation if 
the walls are steep enough. Using a little trigonometry to find A(y), we have 


a 2nK \_ G H 
— + — —_ as = —_ e 
dy ° y sin( 9, ) ° p (11) 
Solving out we find 
To pGy 
T= — 12 
o yi-¢ —2-a ( ) 


where a = (27K /sin(@,,)). Now we see the assumption of steep walls coming into 
play. If 6 is small enough, a > 1 so the first term of Equation 12 looks like y'*~"!. 
That is, both terms decay to zero as y > 0. Again, this result for granular 
materials is in sharp contrast to the result for typical fluids. The constant of 
integration, 0, can be used to match the stress at the top of the hopper. 

A more precise force diagram for a converging hopper would use polar 
coordinates with the origin at the (virtual) vertex of the hopper, and stresses 7’, 
a”®, and ao °°. The eigendirections approximately align with the radial and circum- 
ferential directions; the misalignment is due to the fact that gravity acts vertically. 
If one carries out such an analysis, one finds 0” = pGf(@)r, where the function 
f(@) is 1 at the centerline of the hopper and decays slowly as 9 > 6... 


DISCUSSION. The previous two sections have introduced the concept of stress 
and used force diagrams and reasonable simplifying assumptions to describe the 
pressures that may build up inside a bin containing granular material. We have 
found that granular material behaves quite differently than typical fluids. The 
principle reason for this different behavior is the concept of internal friction. In a 
granular material, each particle resists deformation because of frictional contact 
with neighboring particles. In a bin, this frictional effect transfers the weight of a 
slab of material to the wall, reducing the overall pressure inside the bin. 

We have not discussed the effect of dynamics in this article. Dynamic effects 
introduce some additional complexity into the modeling, but some simple problems 
can be formulated and solved. One such problem is addressed in Savage’s paper 
[2]. The paper by Jenike [1] examines more completely the deformation of granular 
material inside a hopper in a quasi-static setting. 
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Orderly Currencies 


John Dewey Jones 


Despite the proliferation of credit cards, most of us take part in at least one cash 
transaction a day. In these transactions, we often hand over a sum larger than we 
owe, expecting to receive the right change. In 1991, the average American spent 
12.3 hours making or waiting for change.* 

Under these circumstances, it is clearly of the highest importance to develop a 
robust and efficient change-giving algorithm. An acceptable algorithm must also 
use the smallest number of bills or coins to give the correct change. Giving change 
in pennies is mathematically straightforward, but places an unacceptable burden 
on the nation’s pockets and purses. 

To develop a general algorithm, that is, one that will work in all currency 
systems, is impossibly difficult. But there is an algorithm that will work in any 
currency system meeting certain conditions. In this note, we set out the algorithm 
and establish the necessary and sufficient conditions that the currency system must 
meet for it to yield the right change. By observing these conditions when devising 
changes to the currency system, Treasury officials and dictators of newly created 
states can greatly increase the efficiency of cash transactions within their domains. 


Definition. A currency is an ordered set of rationals, d,,...,d,, with d, <d, < 
- <d,. We refer to the d; as denominations. 


Remark. The extension to the rationals is necessary to take into account fractional 
denominations, such as the half-penny, the farthing, and the groat. The use of 
fractional denominations, though picturesque, greatly complicates financial trans- 
actions, and it is our recommendation that they be eliminated. We can thus restrict 
the d, to the integers. 

We are not aware of any currency having negative denominations. Without 
wishing to prejudge the advantages of such a scheme, we shall, in the remainder of 
this paper, restrict the d,; to the natural numbers. 


Definition. A currency is inflated if the set d; has a highest common factor greater 
than unity. 


Remark. Due to inflation or other historical reasons, some currencies have all 
their denominations being multiples of some integer greater than unity. For 
example, the base unit of Kenyan currency is the cent, approximate value $US0.001, 
but there is no 1-cent coin, only 5-cent and 10-cent coins. In the remainder of this 
paper, we shall consider that all currencies have been deflated by dividing their 
denominations by their highest common factor. 


*All statistical data in this note have been made up in order to support my argument. 
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Definition. For a given currency, a payout of value N is an ordered set b,,...,b 
where the b,; are non-negative integers and 


n? 


Definition. The bulk of a payout is the sum By = L7_,b,. 


Remark. It might be argued that our aim should be to minimise the total mass of 
change given, rather than the total number of coins. This, however, would make 
the discussion unduly complicated. Instead, we offer the recommendation that all 
denominations should have a mass in proportion to their value. If this recommen- 
dation is adopted, the mass of a given payout becomes fixed. 


Definition. The Change Algorithm is a method used to generate a payout, as 
follows: 


Target = N 

For i =ntol 

b;, = int(Target /d;) 
Target = Target-b, * d; 
Next i 


Definition. The Change Series is the unique series b,...b, generated for a given 
sum N and a given currency d,...d,. It can be characterised by the relationships 


bd,<N<(b,+1)d, 


b,-14,-1 =< N — b,d, < (b,_, + 1)d,-1 


n—-1 


Remark. The definitions given so far are not adequate to ensure that the Change 
Algorithm will always work. For example, consider the currency (2,3), which fails 
when trying to make a payout of 7. A sufficient condition to ensure that the 
Change Algorithm does always work is that d, = 1. In the remainder of the paper, 
we shall consider only cases where this condition is met. 


Definition. A currency is orderly if the bulk of the payout generated by the 
Change Algorithm is less than or equal to the bulk of any other payout of the same 
value. 


Theorem. A necessary and sufficient condition for a currency to be orderly is that it 
meets the condition 


—d;_», 3 <j) <n. 


Proof: Suppose the theorem holds for currencies having up to m denominations. 
(We note that it holds for a currency having just one denomination.) To prove 
sufficiency, we start off by supposing b,,,, = int(N /d,,,,). If this really is the best 
value for b,,,,, we have established the inductive hypothesis. If it isn’t, then 5,,., 
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must be smaller than this, since it can’t very well be bigger. But if we reduce b,,,, 
by 1, we’ve got an extra d,,,, to take care of, and since d,,,, = 2d,, — d,,_, the 
best we can hope for is to increase b,, by 1 and reduce b,,_, by 1, which doesn’t 
reduce the sum of the b;. So putting b,,,, = int(N/d,,,,) must be the right thing 
to do, confirming the inductive hypothesis. 

Suppose on the other hand that d,,, < 2d, —d,,_,. Then d,,,, = 2d,, —- 
dn,—-1 — k for some positive integer k. We can represent this k as a sum Lc,d,. 
Now every time we reduce b,,,, by 1, we increase b,, by 1, reduce b,,_, by 1, and 
reduce each of the b; by c;, which obviously leads to a net reduction in bulk and 
thus refutes the inductive hypothesis. So we may conclude that the condition is 
both necessary and sufficient. 


RELATED WORK. Exercise 2 on p. 191 of Fundamentals of Computer Algorithms 
by Horowitz and Sahni, discusses a similar problem. They call the ‘Changegivers 
Algorithm’ the ‘greedy solution’, observe that it only works for certain currencies, 
and claim that any currency in which the lowest denomination is 1 and higher 
denominations are successive powers of any integer will be orderly, in our sense. 
This is a stronger condition that the one offered here, and while sufficient, is not 
necessary. 


SUGGESTIONS FOR FUTURE RESEARCH. We might consider the case where 
the d, are functions rather than constants. In countries suffering hyperinflation, 
for example, it is often necessary to re-issue all bank-notes every few months, 
adding a zero to the denomination each time. This involves considerable expense, 
waste of forestry resources, etc. To obviate this, the currency could be issued 
bearing a simple formula, such as ‘$10°°4"!°””, in place of the conventional 
denomination. Such a currency would appreciate with time, thus encouraging 
saving. 

Alternatively, this formula could be programmed into a microprocessor incorpo- 
rated in the note. The processor would then automatically update an LCD display 
on the face of the note. 
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An Interior Fixed Point Property 
of the Disc 


Robert F. Brown and Robert E. Greene 


1. STIRRING THE COFFEE. At the end of this evening’s meal, stir your cup of 
coffee and watch the motion of the top of the coffee. You will probably notice that 
there seem to be points where the coffee is not moving. The Brouwer Fixed Point 
Theorem, implies that such points must always exist. To be precise, suppose you 
could set the whole top surface of the coffee in motion at once. Then after a 
moment, every point of the surface would have moved. We could think of this 
motion as defining a map, that is a continuous function, f: D — D of the closed 
disc (the top surface) to itself. But Brouwer proved that the disc has the fixed point 
property, that is, that every map from D to itself has a fixed point: a point p € D 
such that f(p) = p. This contradiction shows that there is a point on the surface 
where the coffee is not moving. 

You will probably observe some fixed points, but they may turn out to be 
difficult to detect. Since Brouwer’s Theorem says nothing about the location of the 
fixed points, the fixed points might lie only on the boundary of the disc. If the way 
you stir your coffee produces a map with fixed points of this kind, it may be hard to 
convince you that Brouwer’s Theorem is true because the entire (interior) surface 
of the coffee would be in motion, with fixed points only where the coffee touches 
the cup. 

If a map f: D — D moves every point on the boundary of D, which we shall 
call S, then f must have a fixed point in int(D), the interior of D, since f must 
have a fixed point somewhere on D. But when f does have a fixed point on S, 
then f may or may not have additional fixed points in int(D). It is natural to ask 
what one can conclude about the interior fixed points just from knowing the 
restriction of f to the boundary S, which we denote by f|.. We will be concerned 
with the case where f|S takes S to itself. If f|S: S > S has at least one fixed 
point, then there is a map G: D — D with no interior fixed points such that 
G|S = f|S, as we will show in the next section. What is surprising is that, even if 
f\|S: S — S is smooth, that is, continuously differentiable, there may not be any 
smooth map G: D > D with G|S = f|S that has no interior fixed points. In joint 
research with Helga Schirmer [1], we investigated when smooth extensions of maps 
on the boundary of a compact differentiable manifold must have interior fixed 
points. Our goal in this article is to explain these ideas in a concrete and 
easily-visualized setting, the 2-dimensional disc. 

Most of the things that can be done continuously in topology can be done 
smoothly also, because of general results about smooth approximation of continu- 
ous functions. But there are some exceptions, usually surprising, to that general 
principle. We will tell you about a new surprising exception. 
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2. CONTINUOUS EXTENSIONS WITHOUT FIXED POINTS. We will think of 
the points of the plane as complex numbers z = x + iy and by D we mean the unit 
disc where |z| = yx? + y? <1. When the point is on the boundary S, we will 
emphasize this by using the corresponding Greek letter ¢ instead. 

Now we will present the general construction that we promised in the introduc- 
tion. 


Theorem 1. Given a map f: S — S with at least one fixed point, there exists a map 
G: D > D such that G|S = f and G has no fixed points on int(D). 


Proof: Choose a fixed point w of f. For each point z € D other than a, let ¢ 
denote the point that the ray from w through z intersects S (see FiGurE 1). Since 
z lies on the line segment with end points w and ¢, we may write 


z=(1-r)otrl 
for some r, with 0 <r, < 1. We define G: D > D by setting 
G(z) = (1-r?7)o + rif(£). 


Then G has no fixed points on int(D) because G(z) is proportionately closer to w 
on the line segment between w and f(¢Z) than z was on the original line segment, 
as we indicate in FiGuRE 1. 


IO 
Figure 1 


It is not difficult to believe that if f is smooth, the map G we constructed in 
Theorem 1 is smooth at every point other than w, but we will next see that, in 
many cases, no map with the properties of G could be smooth on all of D. 


3. THE INTERIOR FIXED POINT PROPERTY. A smooth map of S to itself 
which does not extend to a map on D with no interior fixed points is the complex 
squaring map s(¢) = ¢. If we define s: D> D by s(z) =z’, then s has an 
interior fixed point since s(0) = 0, as well as one fixed point w = 1 on S. You 
might expect that we could modify s to eliminate the interior fixed point without 
giving up smoothness. But that is impossible, according to the following result. 
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Theorem 2 (Interior Fixed Point Property). Suppose f: D — D is a smooth map 
and such that for some integer k > 2 we have f(£) = ¢* forall ¢ © S, then f(z) =z 
for some z € int(D). 


Thus by the k = 2 case of the Property, if we wish to eliminate the interior fixed 
point of the squaring map s(z) = z*, without changing the map onS, we can hope 
to do it only if we are willing to give up the smoothness of the map. That is what 
will happen if we apply the construction of G of the previous section; it gives us a 
map that fails to be smooth at w = 1. 

If f: D > D is asmooth map such that f(Z) = ¢* for ¢ © S where k < 2, must 
there still be fixed points in the interior of D? That is certainly not the case if 
k = 0 since we can just let f(z) = z° = 1 for all z € D. 

We can also find a smooth map f which is the identity on S, that is f(z) = z', 
but which has no interior fixed points. Identify the complex number z = x + iy 
with the pair (x, y) and let 


f(z) =f(%,y) = (¥+ 301 -x* —y*), y). 


The map f moves all the points in int(D) by a horizontal motion to the right. It is 
an elementary exercise to show that the map really takes D to itself, that is, when 
a point of int(D) moves to the right it doesn’t go outside the unit disc. We can see 
that f is smooth because its coordinate functions are polynomials. We can also use 
the same kind of horizontal motion construction for the case k = —1. Just set 


f(z) =f(%, y) = (* + 3(1 - x” — y”), -y) 


then for £ = (x, y) € S we have 


f(Q) = (4%, -y) = 7%. 


When k < —2, it is still true that the map f: S — S defined by f(£) = ¢* can 
be extended to a smooth mapf: D.— D without fixed points on int(D), as we just 
did for kK = 1,0, —1. However, this is carried out by means of a general construc- 
tion, rather than by writing down a formula for f(z); see [1], Corollary 7.4. 


4. THE FIXED POINT INDEX. Although the Interior Fixed Point Property is a 
“slobal” result, a statement about the behavior of a map on all of int(D), we will 
see that it depends on the local behavior of a smooth map near its fixed points. 
The fixed point index, the principal tool of fixed point theory for studying the 
behavior of a map in a neighborhood of a fixed point, was introduced by Heinz 
Hopf in the 1920s. We will present Hopf’s theory in the setting in which we need 
it: for maps of subsets of the plane with smooth boundary. 

Let M be the closure of an open connected subset of the plane, viewed as the 
complex numbers C, whose boundary is a union of smooth simple closed curves. 
There is a neighborhood U of M such that for each z € U — M there is a unique 
point £ on the boundary of M that is closest to z. (See Ficure 2.) For the unit 
disc, we view M =D as the closure of int(D); then U can be any open disc 
containing D, and ¢ = z/|zl. 

Let f: M — M be amap whose set of fixed points Fix(f) = {p € M|f(p) = p} 
is finite. Define F: U > C by 


z—-f(z), ifzeEeM 


F(z) = z-f(f), ifzeU-M. 
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Figure 2 


For any p © Fix(f ) we can find a circle T in U about p with the property that p is 
the only zero of F in the disc bounded by I. The index i(f, p) of f at p is defined 
to be the winding number (as in complex analysis) of the closed curve y = F(T) 
around the origin 0. We recall that, intuitively, the winding number measures the 
total number of times y goes around 0, where counterclockwise rotation counts 
positively and clockwise negatively. Compare the examples in Ficure 3. For a 
rigorous treatment of the winding number, you can refer to, for instance [3], pages 
114-116. 


winding number = 0 winding number = +1 winding number = —2 


Figure 3 


An important property of the winding number is that it is homotopy invariant. 
In general, a homotopy is amap H: X xX [0,1] — Y, which may also be thought of 
as a continuous family of maps h,: X — Y defined by h,(x) = H(x,t)for0 <t < 1. 
The map h, is said to be homotopic to h,. In particular, the closed curve y = F(T) 
is determined by a map FIT: T ~ C — 0 so a homotopy H: IT x [0,1] ~ C — 0 
defines a family of closed curves y, = h,. The homotopy invariance of the winding 
number states that the winding number of y, around the origin equals the winding 
number of the homotopic curve y, around the origin. 

Although the fixed point index is defined locally, Hopf proved an important 
global result which, in our setting, has the following statement. 


Theorem 3 (Hopf [2]). Jf M is the closure of an open, connected subset of the plane 
whose boundary is a union of smooth simple closed curves and f,g: M — M are 
homotopic maps with both Fix(f) and Fix(g) finite, then 


~ iuUf.p)= dL i(g,a@). 


pé&Fix(f) q & Fix(g) 
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Hopf’s Theorem implies the Brouwer Fixed Point Theorem, as follows. Let f: 
D — D be any map. For the constant map c taking D to the origin, the map 
F(z) = z — c(z) is the identity map so 


>» i(c,p) =i(c,0) = 1. 


p&Fix(c) 


Since D is contractible, all self-maps of D are homotopic. If a map f has an 
infinite number of fixed points there is nothing to prove. If Fix(f) is finite then, by 
Hopf’s Theorem, when we sum over all fixed points we must always have 


» i(f,p) =1, 


p€Fix(f) 


which clearly implies that Fix(f) is nonempty. 
In the next section, we will demonstrate the following. 


Theorem 4 (Boundary Fixed Point Index Theorem). Let f: D — D be a smooth 
map with a finite number of fixed points such that f(£) = ¢* for all £ © S, for some 
k > 2. If a is a fixed point of f that lies in S, then either i(f, 7) = 0 ori(f, 7) = —1. 


The Interior Fixed Point Property follows easily from this result. If f: D — D 
were a smooth map such that f(¢) = ¢£* for all £ © S, for some k > 2 and f had 
no fixed points on int(D) then Fix(f) is finite since f|S has finitely many fixed 
points. For each 7 € Fix(f) the Boundary Fixed Point Index Theorem tells us that 
if, 7) is either 0 or —1 so certainly we would have 


~ if.p)= DX i(f,7r) <0. 


pé&Fix(f) a & Fix(f) 


But that would contradict the consequence of Hopf’s Theorem we just used to 
prove Brouwer’s Theorem: if f: D — D has a finite number of fixed points, then 


~ i(f,p) =1. 


p €Fix(f) 


We didn’t state the Boundary Fixed Point Index Theorem in its most general 
form, but rather in a form that made it easy to derive the Interior Fixed Point 
Property from it. The Index Theorem concerns a local property of the map f, its 
index at boundary fixed points, and the proof depends only on.the smoothness of f 
near these fixed points, as we will see. Thus f need not be smooth outside of a 
neighborhood of its fixed point set. A modification of the proof can extend the 
Boundary Fixed Point Index Theorem to the case where k < 1, but the conclusion 
in that case is that either i(f,7) = 0 of i(f,77) = +1. That conclusion does not 
contradict Hopf’s Theorem and, as we discussed above, in fact there is a smooth 
extension of f(z) = z* without interior fixed points, when k < 1. 


5. PROOF OF THE BOUNDARY FIXED POINT INDEX THEOREM. The fixed 
point 7 € S can be written in polar coordinates (r, 9) as (1, 6)). If we introduce 
new coordinates 


xX, =64- 6) x,=1-Tr, 


then 7 is the origin 0 in these coordinates and, near 7, the boundary S of D 
corresponds to the x,-axis and the interior of D is in the upper half-plane: x, > 0. 
Furthermore, since f takes S to itself, f carries this portion of the x,-axis to itself. 
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Near 77, the map F has the form 
z-f(z), if z= (x, x,) with x, = 0 
z—f(¢), if z= (x, x,) with x, < 0,6 = (%),0). 


Writing F in coordinates as F = (F,, F,), we observe that (1) F(0,0) = (0,0), (2) 
F,(x,,0) = 0, that is, F takes the x,-axis to itself, and (3) if x, <0, then 
F (x1, X,) < 0 also. 

Conclusion (2) gives us a map g from the x,-axis to itself defined by g(x,) = 
F,(x,,0). The hypothesis of the Boundary Fixed Point Index Theorem: f(¢) = ¢* 
for all £ © S implies that if the map f is restricted to S, then in polar coordinates 
it has the form f(1,6) = (1, k@), where, as usual, the polar coordinates angle is 
measured modulo 27. Therefore, in the (x,, x,) coordinates, f(x,,0) = (kx, 0) 
for x, near 0. Thus we have 

g(x,) = F,(x,,0) =x, —-kxy, = (1 -—k) x. 

Another hypothesis of the Theorem is that the map f: D — D is smooth, that 
is, has continuous partial derivatives. AS we mentioned above, we do not really 
require the smoothness of f on all of D, but we do need it in a neighborhood of 
the fixed point, which is the origin in the new coordinates. The smoothness of f 
does not make the map F smooth in a neighborhood of the origin, since it is not 
smooth at the x,-axis. However, if we let F* be the restriction of F to the points 
(x,, x) near the origin in which x, > 0, then F* is a smooth map. Since we are 
assuming that k > 2, then 


F(z) = 


d 
de, fhe) =g'(x,)=1-k<0 


and, therefore, the smoothness of F* implies that 
OF 1 (%4, x 2) 
—————— < 


OX, 


0 


for (x,, x,) in an e-neighborhood of the origin, for e > 0 sufficiently small, and 
x, > 0. Let [I be a circle of a radius €/2 about the origin. The negative value of 
the partial derivative is what we will need to show that the winding number of 
y = F(T) about 0 is either 0 or —1. 

Let [* and I denote the half-circles above and below the x,-axis, respec- 
tively. (See Figure 4.) By property (3), we know that F maps [I~ to the “lower” 
half-plane where x, < 0. To calculate the fixed point index, that is, the winding 


Figure 4 
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number of y = F(T) around 0, we need to understand how F behaves on It. 
Since x, = 0 for (x,,x,) € [* the map we are concerned with is F*. We are 
assuming that f has only a finite number of fixed points, so we can make e small 
enough so that the only point on the circle I’ or its interior that F maps to the 
origin is the origin itself. Therefore, we can homotop the restriction of F* to ['* 
in C — 0 to the restriction of F* to the curve IT, for 6 > 0, given by 


T(t) = [5(2 -1),6(1-(1- 21)°)], 


where 0 < t < 1. See Ficure 4, and notice that for 6 small, T°’ is quite flat; it 
almost just runs along the x,-axis from (—(e/2),0) to (e/2,0). The homotopy 
avoids 0 so, by homotopy invariance, it will not change the winding number. 

We write the restriction of F* to T° in coordinates as 


F*(Us(t)) = (Fi (l¢ (0), Fi (Ts (OD) = (¢(t), WC). 


We claim that for 6 sufficiently small, 


d 0 
7 Pah!) < 


for all t. The intuition is that for 6 small, the map 9g, the restriction of F;’ to the 
“flat” arc T° is very close to g, the restriction of F;’ to the x,-axis and since we 
calculated that g’(x,) = 1 — k < 0, the derivative of ¢; should also be negative 
since F* is smooth: its partial derivatives cannot exhibit abrupt changes. A 
rigorous proof that the derivative of #, is negative can be obtained by a careful 
application of the chain rule. We furnish this proof as an Appendix to the paper. 

Once we accept the claim that #, has a strictly negative derivative, what this 
tells us about the curve F*(T;(t)) = (¢;(¢), J5(t)) is that the x,-coordinate is a 
strictly monotone function of t, as we indicate in Ficure 5. So, in particular, the 


x2 


F(T;(1)) F(T;(0)) 
foN 


F(T;(0)) 


xX} 


F(T;(1)) 


(b) 
Figure 5 
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curve intersects the x,-axis at just one value, that we denote by x3. As in FIGURE 
5, we can homotop the image of F(T;(t)) to the union of line segments 


| F(T3(1)), (0, x3)] U [(0, x3), F(T(0))| 


by vertical moves, and thus the homotopy avoids the origin. If, as in FiGuRE Sa, 
x5 <0, then F takes TUT, entirely into the lower half-plane and the winding 
number is clearly zero. If x3 > 0 as in Figure 5b, then the winding number of F 
on [TU I, and thus on I, is —1, which completes the proof of the theorem. 


6. BEYOND THE DISC. The Boundary Fixed Point Index Theorem is only 
concerned with the behavior of the map near its boundary fixed points, yet it 
implies the Interior Fixed Point Property, a statement about the behavior of a map 
throughout the interior of D. The fact about the disc that we depended on in 
Section 4 to prove the Property was the contractibility of the disc, which makes all 
its self-maps homotopic. The disc is the only surface (with or without boundary) 
that is contractible. (See [4].) Furthermore, the disc and the projective plane are 
the only surfaces with the fixed point property; on any other surface there is a map 
that has no fixed points. Thus, in general, we would not necessarily expect other 
surfaces to share properties of the disc. 

If M is a surface with boundary, its boundary consists of circles and we can 
define a self-map f. of each boundary component C using the map of S defined 
by f(Z) = £*, by letting h: C > S be a homeomorphism and setting fo = h™'fh. 
An Interior Fixed Point Property on M would be the property that a smooth 
self-map of M that has some combination of boundary maps of this type must have 
fixed points in the interior of M. But we don’t know of any surface other than the 
disc that has such a property. For instance, suppose M is the annulus, which we 
can view as complex numbers z such that 1 < |z| < 2 with boundary components 
S, the unit circle, and 2S, the circle of radius 2. It is convenient to think of a point 
of the annulus as z = t¢ where € © S and 1 <t < 2. Self-maps of the boundary 
components that are restrictions of a self-map of the annulus are homotopic and 
therefore a classical result requires them to be of the same degree which, in our 
terms, means that we must use the same k in f(£) = £* for each of f, and f5. A 
smooth map that extends these maps of the boundary and has no fixed points on 
the interior of the annulus can be defined by sending tZ to (1 + (t — 1)*)£*, so the 
annulus has no Interior Fixed Point Property. 

On the other hand, when the Boundary Fixed Point Index Theorem is general- 
ized to manifolds of dimension greater than 2, there are further global conse- 
quences. An interested reader can learn more about this from [1] and [5]. 


APPENDIX. We will prove that 
é 0 
— ft) < JU. 
—$4(1) 
Recalling that ¢,(t) = F/(T;"(t)), where 


[s(t) = [5 (24 -1),61-(1- 21)°)] 
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we apply the chain rule: 
d OFT 


Ox, 


& |S (21 — 1) + 


r,(t) dt 


OFS 
e+o6 
P(t) OX, 
We can choose 7 > 0 small enough so that the closed bounded set 
{(5(t):0<5<7,0<¢t<1} 


lies within the neighborhood of radius e about the origin, then since we proved in 
Section 5 that the continuous function dF; /dx, is negative on that neighborhood, 
in the upper half-plane, it follows that there exists A < 0 so that 


art 


Ox, 


d 
r,(t) dt 


ars 


|6(1 — (1 — 22)”)| 


OX, 


E<A 
T;(t) 


for all0 < 6 < yn, 0 <t¢t < 1. On the other hand, since dF /dx, is also continuous, 
there exists B > O such that ° 


ars 


OX 


<B 


r5(t) 


for all0 < 6 < 7,0 < t < 1. Therefore, by choosing 6 small enough so that 


OFS | A| 
5 44 — 8t|< 46B < — 
OX, r,(t) 2 
we have shown that 
d A 0 
— t)y<-—< 


for that choice of 6 and all t, as we claimed. 
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Le Cam’s Inequality and Poisson 
Approximations 


J. Michael Steele 


1. INTRODUCTION. For the sum S, of n independent, non-identically dis- 
tributed Bernoulli random variables X; with PCX, = 1) = p;, Le Cam [20] estab- 
lished the remarkable inequality. 


(oe) n 
Y |P(S, =k) — eakykt| < 2D pP, (1.1) 
k=0 i=1 

where A =p, +p ,+ °°: +2:,. 

Naturally, this inequality contains the classical Poisson limit law (just set 
p; = A/n and note that the right side simplifies to 2A*/n), but it also achieves a 
great deal more. In particular, Le Cam’s inequality identifies the sum of the 
squares of the p, as a quantity governing the quality of the Poisson approximation. 

Le Cam’s inequality also seems to be one of those facts that repeatedly calls to 
be proved—and improved. Almost before the ink was dry on Le Cam’s 1960 paper, 
an elementary proof was given by Hodges and Le Cam [18]. This proof was 
followed by numerous generalizations and refinements including contributions by 
Kerstan [19], Franken [15], Vervatt [30], Galambos [17], Freedman [16], Serfling 
[24], and Chen [11, 12]. In fact, for raw simplicity it is hard to find a better proof of 
Le Cam’s inequality than that given in the survey of Serfling [25]. 

One purpose of this note is to provide a proof of Le Cam’s inequality using 
some basic facts from matrix analysis. This proof is simple, but simplicity is not its 
raison d’etre. It also serves as a concrete introduction to the semi-group method 
for approximation of probability distributions. This method was used in Le Cam 
[20], and it has been used again most recently by Deheuvels and Pfeifer [13] to 
provide impressively precise results. 

The semi-group method is elegant and powerful, but it faces tough competition, 
especially from the coupling method and the Chen-Stein method. The literature 
of these methods is reviewed, and it is shown how they also lead to proofs of 
Le Cam’s inequality. 


2. MATRIX PROOF OF LE CAM’S INEQUALITY. If one is charged with the task 
of producing matrices that might help in understanding the distribution of the sum 
of n independent non-identically distributed Bernoulli random variables, a little 
time and thought is likely to lead to n matrices P, like the N X N matrix 


1 —p; Dj 0 te 0 0 
0 1—p, D; ne 0 0 
0 0 1-p - 0 
P=| | . (2.1) 
0 0 0 wee 1 —p,; D; 
0 0 O -- 0 1-2, 
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This is almost a Markov transition matrix, except of course the last row of P, 
does not sum to 1. A benefit of this choice of the P, is that they can be written as 


P,= (1 —-p,)I+D7,R, 


where J is the N X N identity matrix and R is the N <X N matrix with I’s on the 
first superdiagonal and 0’s elsewhere. Since each of the P’s is just a linear 
combination of J and R, any pair of the P; commute, and because the matrices P, 
are so much like Markov transition matrices, their analysis is still reminiscent of 
the elementary theory of Markov chains. | 

In fact, by the usual considerations that attend the multiplication of Markov 
matrices, you can quickly convince yourself that for n <N the top row of 
the n-fold matrix product P,P,P,--- P, is given by (P(S, = 0), PCS, = 1), 
P(S,, = 2),..., P(S, = n),0,0,...,0), ie. the first n + 1 elements of the top row 
of P,P, ...P, correspond precisely to the Bernoulli sum probabilities that we wish 
to estimate. Also at this point, it may be good to be reminded that JN 1s arbitrary 
except for the constraint n < N, so the padded 0’s can go on as far as we like. 

So far, we have found a matrix that helps us understand the Bernoulli sum 
probabilities PCS, = k), and now we would like to find a matrix that is intimately 
connected with the Poisson distribution. Given some past experience with calculat- 
ing matrix functions using the Jordan normal form, one can easily find candidates, 
but knowledge of Jordan forms is not required. One just needs to compute the 
exponential of P,, or, better yet, compute the exponential of a simpler matrix 
closely connected with P.,. 

When we write P, = I + Q;, we see Q, has the pleasing form, 


—D; D; 0 0 0 
0 —D; dD; 0 0 
0 0 —D; 0 0 
= —p,I + p;R, (2.2) 
0 0 0 —D; OD; 
0 0 0 0 —D; 


and the Poisson distribution emerges clearly when we compute exp Q:: 


e Pi p,e ?: wee ePipN-1/(N — 1)! 
0 er pe 
rokve=| 9 9 ©” | 
k=0 : : 
0 0 0 e? p;e 
0 0 0 0 e?: 
N-1 
= )) ple PiR’/r!. (2.3) 
r=0 


Note that the Q; commute, so exp(Q,;)exp(Q,) = exp(Q; + Q,), and, in detail, 
[ ] exp(Q;) = exp » 0, = exp(—AI + AR), (2.4) 
i=1 i=1 
where A =p, +p, +... +D,. 
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The essence of the method is now fully revealed, and we see the proof of 
Le Cam’s inequality boils down to comparing the top rows of [1?7_, exp Q, and 
_1P,. This can be achieved most systematically by introducing matrix norms. 

If A = (a,,) is any matrix, we set 


|| All = max }'Ia;,|. (2.5) 
, J 


This recipe provides a bona fide matrix norm, and, in particular, one can easily 
check the relations || AB|| < ||AIl|Bll, || 4 + Bll < [|All + ||Bll, and [IcAll = cllAl 
for c > 0. 

It is also easy to use the explicit formulas for P,, Q; and exp Q, to compute their 
norms: ||P,|| = 1, |lQ,|| = 2p,, and |lexp Q,l| < 1. When we compare II P, and 
Ilexp Q; using the norm defined by (2.5) we see that the top row attains the 
maximum, so we have the basic relation, 


N-1 
% |P(S, =k) -— e?ak7k!| = 
k=1 


[1?, - [leo (2.6) 


Next, it is easy to check that 
P, -:: P, — expQ, ++: expQ, = (P; — expQ))(P2 °° P,) 
— (exp Q,)(exp Q, ---expQ, —P, °°: P,). 
(2.7) 


This identity virtually completes the proof. We‘ just take norms, use the facts that 
||P, --: Pll <1 and llexp Q,|| <1, then repeat the process on the remaining 
(n — 1)-fold product to obtain 


Pic: P, — expQ, -+- exp Q,|| < ||P, — exp Q,| 
+ |lexp Q, ---expQ, — P, -:: P,ll 


< DIP, — exp Ql. (2.8) 
i=1 


How should we bound ||P; — exp Q,|I? Since exp Q; is defined by the expansion 
for e*, we naturally look to Taylor’s formula, but we should be careful enough to 
consider a finite expansion with a remainder term. For any smooth function f we 
have 


FL) = FO) + FO) + [Aud F"(u) du, (2.9) 


so, if we let f(t) =e'”, the derivative calculations f(0)=J, f’(0)=@Q and 
f'"(u) = Q7e"2 yield 


e2=1+Q+ fa — u)Q7e"2 du. (2.10) 
0 


Even for functions of matrices, integrals are just the limits of sums, so taking 
norms inside an integral only makes it larger, ic. for any g(u,v) we have 
ll fg(u, Q)dull < f\lg(u, Q)|| du. Also, just as we computed e% explicitly in order to 
bound its norm, we can compute e”2: explicitly to find |le“2*|| < 1. Applying these 
observations to the Taylor representation (2.10), we find 


IP, — e2|| < < 4107 |. (2.11) 


fa — u)Q7e"2 du 
0 
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This is just the tool needed to bound the right side of (2.8). Stringing together the 
identity (2.6) with inequalities (2.8) and (2.11), we find 


Y |P(S, =k) — ek /k!] < $Y MO?I < 2D p?, (2.12) 
k=0 i=1 i=1 


where the last inequality depended on ||Q?|| < |lQ,|I? and our earlier explicit 
calculation that ||Q,|| = 2p,. Since n < N is our only restriction on N, we can let 
N — © to obtain Le Cam’s inequality. 


3. THE SEMI-GROUP METHOD HAS VIGOROUS COMPETITORS. Deheuvels 
and Pfeifer [13] provide a version of Le Cam’s inequality that—in an asymptotic 
sense—has a solid claim on being the last word: 


(oe) n n 

y, |P(S, =k) — ek /k!| ~ (2/7e | Ewa} | a (3.1) 

k=0 i=1 i=1 
provided Y?_,p; ~ ~ and max(p,, p>,...,P,) > 0 as n > ». The essential ideas 
behind the proof of (3.1) have been seen in Section 2 in a basic form: one obtains 
an interpretation of the Bernoulli sum probabilities, introduces a semi-group (like 
exp(tQ)), finds ways to bound an approximation (like e2 — J — Q), and deals with 
the difference of two n-fold products. Variations on this pattern are visible in 
Le Cam [20], Shur [26], and one even can see similar steps in Feller’s exposition of 
Trotter’s proof of Lindeberg’s central limit theorem. 

Through the explicit matrix exponentiation calculations used here, the semi- 
group method can be seen to be friendly as well as useful. Continued exploration 
of the method is likely to lead to deep and interesting results, but the semi-group 
method should not be oversold. There are competitors with considerable power. 

The characteristic function method also has a role in Poisson approximations. In 
particular, the characteristic function method has been used by Rusenko [23] to 
obtain rates of approximation results for repeated samples taken without replace- 
ment, by Presman [22] to obtain refinements of Le Cam’s inequality, and by 
Yakshyavicius [32] to provide an inequality like Le Cam’s that is pertinent to 
classes of discrete distributions other than Bernoulli sums. Despite this litany, 
applications of the characteristic function method are infrequent in Poisson 
approximation, and it probably does not rank among the big three: the semi-group 
method, coupling, and the Chen-Stein method. Enough has been said about the 
first of these, and it is important to provide some sense of the promise inherent in 
the other two. 


4. THE COUPLING METHOD. Because of its simplicity, the coupling method 
deserves to be reviewed first. For any two random variables X and Y, we begin by 
defining their variation distance by 


d(X,Y) = sup|P(X € A) —- P(YeEA)|. (4.1) 
A 
For random variables that take values in Z*, the metric d(-,: ) has an easily 
proved alternative expression (cf. Serfling [25], p. 569) that reveals its relevance to 
Le Cam’s inequality: 
d(X,Y) =1D |P(X=k) -P(Y=k)I. (4.2) 
k=0 


The coupling method is based on the simple observation that for random variables 
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X and Y defined on the same probability space, one has 
d(X,Y)<P(X #Y). (4.3) 


A second observation that helps the coupling method work well with sums is that 
for S, = U7_,X; and S* = u7_.Y, one has 


d(Sy» Sz < Dax A) (4.4) 


From (4.2), (4.3), and (4.4) we see that a good plan for proving Le Cam’s inequality 
consists of building n bivariate couples Z; = CX;, Y;) such that the Z, are indepen- 
dent, X, is Bernoulli with parameter p,, 4 is Poisson with parameter A, = p,, and 
P(X; # Y,) is as small as possible. This plan has been successfully pursued in 
Hodges and Le Cam [18], Freedman [16], Serfling [24], Brown [10], Ahmad [1], and 


Wang [31]. In fact, the couplings of Serfling, Brown, and Wang all satisfy 
d( X;,Y,) = P(X, # Y;) =p, -—e-”), (4.5) 


from which Le Cam’s inequality (1.1) follows easily. As it happens, there is nov 
difficulty in constructing variables that satisfy (4.5)—just think how to simulate X, 
and Y, simultaneously using a single uniformly distributed random variable. 


5. THE CHEN-STEIN METHOD. The Chen-Stein method may be the most 
powerful method for obtaining Poisson approximations, and it is often as easy to 
use as the coupling or semi-group methods, even though it may be more subtle 
conceptually. If one does not stop for motivation, one can say that the Chen-Stein 
method is based on the fact that for each A > 0 and A Cc Z* there is a function 
x =x, 4: Z*—> R such that for any non-negative integer-valued random variable T 
one has the identity: 


E{Ax(T + 1) — Tx(T)} =P(T €A) -— ¥& eA‘ /k!. (5.1) 
keEA 


Actually, the left-hand side of (5.1) is a natural quantity to consider in the context 
of Poisson approximation, since by summation by parts one can check that 
EAf(T + 1) = ETf(T) for any f, provided T is Poisson with parameter A. The 
identity (5.1) was first developed by Chen [11], and some of its mystery can be 
removed by studying an analogous identity used by Stein [27] in the context of 
normal approximations. While it is a good exercise to solve (5.1) for x, all one 
really needs to know about x is that it 1s bounded and changes slowly. In 
particular, Barbour and Eagleson [5] sharpened earlier bounds of Chen [12] and 
showed that for all A and A > 0: 


Ix| < min(1,4A~?) (5.2) 
and 


Ax = sup |[x(m+ 1) —x(m)|<A7*(1-e™). (5.3) 

m>=0 
From these bounds it is easy to prove—and even sharpen—Le Cam's basic 
inequality (1.1). If we write W=S,, W,=S, —X;, A =p, +p, +--+: +p,, and 
q,;=1- — Pj, we can follow Chen [12] and obtain a second identity that together 
with (5.1) gives one virtually complete information about the Poisson approxima- 
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tion. We evaluate the left side of (5.1) as follows: 


I 
Ms 


E{Ax(W + 1) — Wx(W)} E{p,x(W + 1) - X,x(W)} 


j=l 


I 
Ms 


p,E{x(W + 1) —x(W, + 1)} 


yu p;{ p,Ex(W, + 2) 
j=l 
+q,Ex(W, + 1) — Ex(W, + 1)} 


pF E\x(W, + 2) — x(W, + 1)}. (5.4) 


I 
its 


J 
From the Chen-Stein identity (5.1) and the Barbour-Eagleson bound on Ax, we 


see that (5.4) gives 


sup|P(S,€A)-e"* ¥ at /k <A7(1-e") YE p?. (5.5) 
A kEA j=l 


Since A~'(1 — e+) < 1, the identity (4.2) shows that inequality (5.5) is sharper 
than (1.1). Obviously, the Chen-Stein method is very powerful, though it is only 
now beginning to be well understood. A richer understanding of the method can 
be obtained by studying Arratia, Goldstein, and Gordon [2], Barbour [3], Barbour 
and Eagleson [5, 6, 7], Barbour and Hall [8], Barbour [4], and, of course, Stein [28]. 
A definitive study of Stein’s method and its application to Poisson approximation 
has recently been given in the volume by Barbour, Holst, and Janson [9]. 


6. CONCLUSION. Le Cam’s inequality provides information on the quality of the 
Poisson approximation, but it also serves as a talisman that is able to charm 
concrete insights from general techniques. This survey relied on that second 
service to illustrate the semi-group method, coupling, and the Chen-Stein method. 
In the course of these illustrations, it has also been possible to survey most of the 
work on Poisson approximation since the review of Serfling [25], except for the 
cascade of work coming from the more refined developments of the Chen-Stein 
method that are dealt with in detail in the monograph of Barbour, Holst, and 
Janson [9]. 
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Edited by: John Duncan 


A Generalization of a Theorem of Euler 


Dorina Mitrea and Marius Mitrea 


One of the most important results of pure incidence in geometry is the so called 
“two-triangle’” theorem discovered by the French architect Girard Desargues 
(1591-1661) asserting that two triangles are perspective from a point if and only if 
they are perspective from a line. 

Recall that two triangles, with their vertices named in a particular order, are 
said to be perspective from a point, if their three pairs of corresponding vertices are 
joined by concurrent lines. The intersection point is called the perspective center of 
these triangles (see e.g. [CG]). Alternatively, two triangles are said to be perspective 
from a line if their three pairs of corresponding sides meet in collinear points. The 
line determined by these points is called the perspective axis. 

In practice, however, many times we encounter pairs of perspective triangles 
with supplementary properties, and we shall pay a special attention to those pairs 
that are also orthologic. 

In precise terms, we say that the triangles A,B,C, and A,B,C, are orthologic, 
and we write AA,B,C, ~ AA,B,C,, if the perpendiculars from the vertices 
A,, B,,C, on the sides B,C,, A,C,, A,B, pass through one point, called the 
orthology center of AA,B,C, with respect to AA,B,C,. 

The symbol A, ~ A, was intended to suggest, and we invite the reader to 
actually prove (by means of the ideas developed in the second part of the paper or 
otherwise), that orthology is indeed an equivalence relation on the set of all 
triangles in the Euclidean plane. 

Actually, any two arbitrary triangles can be translated and rotated in the plane 
up to a certain position in which they simultaneously are perspective from a point 
and orthologic (prove it!), but here are some perhaps more familiar examples of 
pairs of triangles with these two properties: 


(1) An arbitrary triangle together with its median triangle (i.e. the triangle whose 
vertices are the midpoints of its sides). In this case, the two orthology 
centers are O and H, the circumcenter and the orthocenter of the larger 
triangle, respectively. The perspective point is the centroid of the bigger 
triangle, while the perspective axis is improper. 

(2) An arbitrary triangle together with its orthic triangle (i.e. the triangle whose 
vertices are the feet of the altitudes in the initial one). In this situation, the 
two orthology centers are H and O, while the perspective center is H. 
Their perspective axis is called the orthic axis of the larger triangle. 

(3) An arbitrary triangle and the triangle determined by the contact points of its 
sides with its incircle, or together with the triangle determined by the contact 
points of its sides with its three excircles, or even with the triangle whose 
vertices are its three excenters. 
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(4) An arbitrary triangle together with its interior, or exterior Napoleon triangle 
(i.e. the triangle determined by the centers of the equilateral triangles 
erected internally, and externally respectively, on the sides of the initial 
triangle). 


It is a celebrated result of Leonhard Euler (1707-1783) that in any triangle the 
orthocenter, the centroid and the circumcenter lie on a line, called the Euler line of 
that triangle. If we want to emphasize the perspective and orthologic relationships, 
we can restate this result by saying that the orthology centers and the perspective 
center of an arbitrary triangle with respect to its median triangle lie on a line. 

In the light of this observation, the following theorem can be thought of as a 
generalization of Euler’s result. 


Theorem. /f two triangles are orthologic and perspective from a point, then the two 
orthology centers and the perspective center lie on a line which is perpendicular to 
their perspective axis (if this exists). 


We have already noticed that Euler’s configuration occurs precisely for the 
example (1) above but if we use the second part of our theorem for the example (2) 
above we obtain even more, namely that the Euler line of a triangle is perpendicular 
to its orthic axis. There are many other interesting applications of this theorem and 
we invite the interested reader to supply new examples. 

The rest of the paper is devoted to proving the main theorem. A basic 
ingredient in our proof will be the following lemma. 


Lemma. AA,B,C, ~ AA,B,C, if and only if 


MA, - B,C, + MB, - C,A, + MC, - A,B, =0, (1) 
for any point M in the plane. 


It is easy to see that the left-hand side of (1) is actually independent of M, so 
that in practice we shall verify (1) for only one point. In what follows, we shall use 
this remark several times. 

The proof of this lemma is very simple. On one hand, if the two triangles are 
orthologic, then (1) is clearly satisfied by choosing M to be the orthology center of 
AA,B,C, with respect to AA,B,C,. On the other hand, if (1) holds for any point 
in the plane, then for particular choice of M as the intersection point of the 
perpendiculars dropped from A, and B, onto B,C, and C,A,, respectively, we 
get that MC, - A,B, = 0. Thus MC, 1 A,B,, so that AA,B,C, ~ AA,B,C,. 

Let us digress and indicate how this lemma can be used to prove that e.g. ~ is 
reflexive (this is the only property of ~ we shall need in the sequel). Starting with 
AA,B,C, ~ 4A,B,C, and taking M to be A, in (1) we obtain 

———_> —_—> ——_——> ———_> 
A,B, : C,A, + A,C, - A,B, =0. 
But this corresponds to the equality 
——> ——> —> —_—> —> ——— 
MA, - B,C, + MB, - C,A, + MC, - A,B, =0 
written for M = A,, therefore AA,B,C, ~ AA,B,C,. 

Consider next two triangles, ABC and EFD, satisfying the hypothesis of the 
theorem. First we will look at the case when their perspective axis exists. Thus we 
are assuming that their three pairs of corresponding sides, (AC, DE), CAB, EF), 
(BC, DF), meet in some points X,Y, Z lying on the perspective axis, which we 
denote by A (see FiGure 1). 
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E 


Figure 1 


Let O denote their perspective center and let P and S stand for their orthology 
centers. It suffices to show that OP 1 A since similar reasoning with the roles of S 
and P interchanged yields OS 1 A, ie. O, S and P lie on the same line. __, 

To this effect, let a,b,c stand for OA , OB , OC, so that OD = 
yc, OE = ad, OF = Bb, for some real scalars y, a, B. Clearly, any two of the 
vectors a,b,c are linearly independent. Furthermore, the assumption that the 
perspective axis exists ensures that among the real numbers a, B,y there exists 
one which is different from the other two. With no loss in generality, let a be that 
number. Using our Lemma for the triangles ABC and EFG we get 


—> —> — —> ——> —> 
MA : FD + MB: DE + MC: EF =0, (2) 
for any point M in the plane. In particular, taking M = O, (2) becomes 
a( ye - Bb) + b(aa - yc 


Introducing 6 = a’: b, y= b-@ and Qi= 
be equivalent to 


+ @(Bb - ad) = 0. (3) 


- C, the identity (3) is readily seen to 


a(O- og) + B(¥—- 0) + y¥(e-) =0. (4) 


Going further, OP L A is equivalent to A DOF ~ AXAY or, once again in virtue 
of our Lemma and the remark immediately following it, to 
—> 


—_ -——> —_- —— —> 
MD * AY + MO * YX + MF : XA =0 
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for some point M in plane. Choosing M = O, we are left with proving that 


— —>- —> —> 
OD: AY + OF : XA =0. (5) 
Let OY = OA +A(OA — OB )= OF +yu(OE — OF). As-@ and b 


—> 


are linearly independent, it follows that A = B( — a)/(a — B), ie. AY = OY 
—— —> ——> = 
—~ OA = OA — OB )=BU -a)/a — BX@ — b). 
In a similar fashion we get XA = y(1 — a)/(a — yXc — @). Consequently, (5) 
becomes 


,BU-a@),, 3s + y(l-a@) , . 
wap (2-5) + Bb ~~ (2 - a) = 0 
ge-e wb-@ 


+ 
a-B a-y 
=(a-y)(e-—w) + (a—-B)(¥—- 6) =0 
=a(p— 8) +P(6—-) + y(¥—- ¢) =0, 
i.e. (5) is equivalent to (4), and the conclusion follows in this case. 

Finally, in the case in which we do not have a proper perspective axis (i.e. this is 
thrown to infinity), we infer that a = B = y. Hence AABC and AEFD are 
homothetic with constant a. Now, the orthology centers of AEFD and AABC 
become the orthocenter of AEFD and the orthocenter of AABC, respectively. 

Since these triangles are homothetic, it follows that O, S and P lie on the same 
line. Moreover, OS =aOP . This completes the proof of the Theorem. 
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The Existence of a Triangle with 
Prescribed Angle Bisector Lengths 


Petru Mironescu and Laurentiu Panaitopol 


Given three arbitrary positive numbers m,n, p, does there exist a triangle with 
angle bisectors of length m,n, p? The answer is YES! Moreover, the triangle is 
unique up to an isometry. This contrasts with related results for the triangle. The 
median lengths m,n, p satisfy m <n + p (and the two other cyclic inequalities) 
and the altitude lengths satisfy 1/m < 1/n + 1/p; thus the lengths m,n, p cannot 
be arbitrary in these cases. 
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The referee has kindly submitted to us several bibliographic notes which show 
the long history of this problem. Brocard proposed in 1875 in the Nouvelle 
Correspondance Mathématique the problem of constructing such a triangle. The 
problem was reduced to that of solving an equation of degree 16 by F. J. Van Den 
Berg (Nieuw Archief voor Wiskunde, 1889). In 1896, P. Barbarin showed in 
Mathesis that the equation can be chosen to be of degree 14 and that it is 
irreducible in general. He also showed that this equation becomes an irreducible 
cubic when two of the angle bisector lengths are equal—which shows the impossi- 
bility of a Euclidean construction for the triangle. More detailed references can be 
found in [1], [2]. 

We now prove the announced result. First, we derive some needed formulas. If 
a,b,c are the side lengths, m,n, p the angle bisector lengths and s the 
semiperimeter of a triangle, then we have 


ybes(s — a) (1) 


with similar formulas for n and p. We prove (1) by an area argument (see FIGURE 


m= 
b+c 


D 
B a 


Figure 1 


1). If SUMNP) denotes the arc of triangle MNP, then 
2 S( ABC) = 2 S( ABD) + 2 S( ACD) 


so that 
be sin( A) = bmsin( A/2) + cmsin( A/2) 
and hence 
be  sin(A) 2bc 2be 
m = be sin( 4/2) = bac 4/2) = ae 8(s - a) /be | 
One can easily check that 
| 2 ; 
b+et eal =4m>+ [a+ (b-c)]’ (2) 


and (2) gives 


b+co=ym?+(s—b) + ym?+(s—c)’. (3) 
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We get two related equalities for c + a and a + b. With the substitutions 
Xx=S-—4, y=s—b, =s-C (4) 


the relation (3) can be rewritten as 


x= | ym? +y? ~y|/2 + | Vm? +27 — z|/2 =f(y,m)+f(z,m) (5) 
where f(u,v) = [Vu? + v? — uJ/2. 


Given arbitrary positive real numbers m,n, p, let F: [0,0)* > (0,%)° be de- 
fined by 


F(x,y,z) =(f(y,m) +f(z,m), f(z,n) +f(%,n), f(x, p) + f(y, p)). 


Taking (5) into account, we get 


F(x, y,2) = (x,¥52) (6) 
whenever x, y, z are as in (4) in a triangle with angle bisector lengths m,n, p. 
Conversely, if (6) holds, then in the triangle with sides lengths y + z,z+x,x +y 
the equality (3) holds (together with the two analogous ones) so that m must be 
given by (1), in virtue of the monotonicity of the right side of (3) in the variable m. 
Hence the given problem is equivalent to the existence and uniqueness of a fixed 
point for F. 


EXISTENCE. Since f(u,v) € [0,v/2] for nonnegative u,v, it follows that 
F(K) GK where K = [0, m] x [0,7] x [0, p]. Note that (0,0,0) is not a fixed 
point of F. Since K is a convex compact set in R* and F is continuous, the 
existence follows by the Brouwer Fixed Point Theorem (see, for example, [3]). 


UNIQUENESS. For v # 0, u #t, and D = Vu? + v” + Vt? + v’, we have 
2| f(u,v) —f(t,v)| =lu —¢ll[1 —(u+2t)/D] <lu-tl. (7) 
For (x, y, Zz) # (x’, y’, z’), (7) gives 


|F(x,y,z) — F(x’, y’,z')| < (1/2) VX(ly — y+ lz - 2'/)’ 


<||(x, y, z) ~ (x', y’,2’)| (8) 
where > stands for the cyclic sum and || || denotes the Euclidean norm in R?. 
Uniqueness follows immediately from (8). 
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THE EVOLUTION OF... 
Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


An English major may or may not be a novelist or a poet, but would undoubtedly 
be expected to be able to evaluate a novel or a poem. The term “English major” 
implies some historical, philosophical, and evaluative training and competence. It 
is sad but true that the term “mathematician” does not imply corresponding 
training and competence. 

Integration of the narrowly mathematical and historical, philosophical and 
critical aspects of our discipline is bound to make it more meaningful not only to 
those who identify themselves as mathematicians but also to those who have no 
more than a tangential interest in the subject. 

To promote such integration, and thus encourage an approach to mathematics 
that emphasizes its meaning and significance, the Monthly will publish every two 
months an article of 2-5 pages under the generic title ‘““The evolution of...” The 
core of such an article will be an account of important mainstream mathematics. 
The essay that follows exemplifies the kind of material, and the approach, we have 
in mind. 

While we prefer original articles, we will also publish translations or adapta- 
tions of appropriate articles in the public domain. 


Abe Shenitzer 


The Evolution of Integration 


A. Shenitzer and J. Steprans 


THE GREEK PERIOD. The Greek problem underlying integration is the quadra- 
ture problem: Given a plane figure, construct a square of equal area. 

It is easy to solve the quadrature problem for a polygon, a figure with rectilinear 
boundary. The first quadrature of a figure with curvilinear boundary was achieved 
by Hippocrates in the fifth century B.C. Hippocrates showed that the area of the 
lunule in Figure 1 (that is, the figure bounded by one-half of a circle of radius 1 
and one-quarter of a circle of radius ¥2 ) is equal to the area of the unit square B. 

Hippocrates managed to square two other lunules.* 

In the third century B.C. Archimedes effected the quadrature of a parabolic 
segment. He showed that its area is ZA, where A is the triangle of maximal area 
inscribed in the parabolic segment. 

Archimedes effected a number of other quadratures (and cubatures). Some of 
his quadratures involved inventive constructions but most relied on the technique 
of wedging an area between ever closer upper and lower approximating sums. 


*Two more quadrable lunules were found by T. Clausen in the 19th century. In the 20th century, 
two Russian algebraists proved (independently) that these five lunules are the only quadrable ones. 
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Figure 1 


Figure 2 


Analogs of such sums are a key element of the definition of the Darboux integral (a 
variant of the Riemann integral introduced by Darboux in the 19th century ) as well as 
of quadrature programs for computers. We illustrate both of Archimedes’ ap- 
proaches next. 

Consider Figure 3. Here the hypotenuse AB of the right triangle OAB is 
tangent to the spiral at A. It then turns out that the side AB is equal to the 


O peer ee 
A 
! 
B 
Figure 3 
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circumference of the circle with radius OA. (This is a special case of Archimedes’ 
rectification of circular arcs by using tangents to spirals.) Since he knew that the 
area of a circle is half the product of its circumference by its radius, we can say 
that Archimedes used (a tangent to) a spiral to rectify a circle and square its area. 
Their brilliance notwithstanding, such constructions have been reduced to histori- 
cal footnotes because they failed to yield general methods. 

FicureE 4 shows a turn of Archimedes’ spiral r = a@ and the associated circle of 
radius 27ra, and thus of area K = 47r°a*. To compute the area S of the turn of the 
spiral in Figure 4 Archimedes approximates it from below and above by unions of 
circular sectors indicated in FiGurE 5. 


Figure 4 Figure 5 


The areas of these approximating figures are, respectively, 
4a? 2a°a?(n — 1)(2n — 1) 

Si = —|224224 --- +(n — 197] = —————_—_—_— 
n> | ( | 3n? 

and 


ar? Qr3a2(n +1)(2n + 1 
Sr=—([V4+27+ an ent] = TE 
n n 


It is not difficult to see that 
4 
S' < =a? < S" 
3 
for all m. This double inequality can be rewritten as 
K 
S,<—2><5S, 
3 
for all m. Obviously, 


S,<S<S) 


for all n. To prove that S = (K/3) Archimedes shows that S” — S’, = (42°a?/3n?) 
and is thus small for large n. He can now show that the assumption S # K/3 leads 
to a contradiction and can conclude that S = (K/3). 

While Archimedes makes no explicit use of limits, he relies on the “method of 
exhaustion,” and, in modern terms, the final part of the argument in a proof 
involving the method of exhaustion (in the above example it is disproving S # K/3) 
amounts to proving the uniqueness of the limit of a Cauchy sequence. 
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CONTINUATION IN THE 17TH CENTURY. Using nonrigorous infinitesimal 
techniques (rather than rigorous algebraic methods of the kind used by Archimedes) 
Cavalieri (1598-1647) managed to compute (what we now write as) {jx* dx for 
k = 1,2,...,9. His chief difficulty was the evaluation of 1* + --- +n*. In about 
1650 Fermat evaluated {¢x?/? dx by means of a brilliant yet simple computation. 
Further progress was due to Torricelli, Wallace, and Pascal. In particular, Pascal 
interpreted Cavalieri’s “sum of lines” (the equivalent of area) as a sum of 
infinitesimal rectangles. 

If we combine Fermat’s result with Cavalieri’s understanding of the linearity of 
the definite integral (our terminology!) then we see that by the middle of the 17th 
century one could evaluate {?P(x) dx, P(x) a “polynomial” with rational expo- 
nents. 

In 1647 Gregory St. Vincent made a discovery that linked Napier’s logarithm 
function and the area under the hyperbola xy = 1. This connection is now 
expressed as log (x) = {{(dt/t). 

Newton and Leibniz invented the calculus and made it into a tool with countless 
applications but neither gave what we would call a rigorous definition of a definite 
integral (or saw the need for such a definition). Such concerns became dominant in 
the 19th century. 


FROM CAUCHY TO LEBESGUE. The first rigorous definition of a definite 
integral was given by Cauchy in the 1820s. Cauchy dealt with continuous functions. 
In view of the importance of Fourier series whose coefficients are given by 
integrals it was necessary to define the integral for more general functions. This 
was first done by Riemann. The limitations of the Riemann integral were reme- 
died at the beginning of the 20th century by Lebesgue. An explanation follows. 

With each theory of integration there is associated a theory of measure. 
Specifically, if f is a function on a set E and f=f*—f~ (recall that f*(x) = 
max{ f(x),0} and f(x) = max{—f(x),0}) then f,f is defined as the difference 
lef" — f-f— of the measures f,f* and {,f~ of the ordinate sets of the nonnega- 
tive functions f* and f~ respectively. 

The measure underlying the Riemann integral is Jordan measure and the 
measure underlying the Lebesgue integral is Lebesgue measure. How do they 
differ? In what way is one “better” than the other? 

Consider the simple case of the ordinate set M of a bounded, nonnegative 
function f on an interval, 0 < f(x) <c for x in [a, b]. The Jordan measure of M 
is the common value, if any, of the outer and inner Jordan measures of M. The 
outer Jordan measure of M is the glb of the areas of the coverings of M consisting 
of finite unions of rectangles. The inner measure of M is the difference between 
the area C(b — a) of the rectangle S with base [a, b] and height C and the outer 
measure of the complement of M in S. Lebesgue replaced the word “finite” in the 
Jordan definition of the measure of a subset of S by “countable.” This increased 
greatly the number of measurable subsets of S and led to a theory of integration 
far more comprehensive and mathematically flexible than Riemann’s. 


THE HK-INTEGRAL. Surprisingly, Henstock (in 1955) and Kurzweil (in 1957) 
came up with a new version of the Riemann integral—call it the HK-integral (see 
[7])—that is “as good as” the Lebesgue integral! Its definition and main character- 
istics follow (see [7]): 


Definition: A tagged division of [a, b] given by a finite ordered set a = x) <x, < 
- <x, = b of points, together with a collection of tags z; such that x;_, <2; < 
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x; for i = 1,...,n. We denote a tagged division by D(x,, z;) and the corresponding 
Riemann sum by 


S(D(x;, z;)) = eae! — Xj). 


A gauge on [a,b] is a function 6 defined on [a,b] such that 6(x) > 0 for all 
x € [a,b]. An important example of a gauge is a constant function. If 6 is any 
gauge on [a,b], we say that a tagged division D(x;, z;) is 6-fine in case that 
[x;_,,%,] Cz; — 6(z;), z; + 6(z;)]; that is, in case z; — 6(z;) <x;_, $2; <x, < 
z, + 6(z;) for all i= 1,2,...,n. Finally, we say that the number A is an HK- 
integral of f if, for every e > 0, there exists a gauge 6, such that if D(x,, z;) is any 
tagged division of [a, b] that is 6,-fine, then we have 


|S(D(x;, Z;)) — Al < «. 


It turns out that “the HkK-integral of a function is uniquely defined when it 
exists and that a function is Riemann integrable if and only if the gauge 6, can be 
chosen to be constant.” More importantly, ‘“every Lebesgue integrable function is 
HkK-integrable with the same value.” 


THERE IS NO PERFECT INTEGRAL. While in the eyes of some mathematicians 
the Lebesgue integral was the final answer to the difficulties associated with 
integration, there were others who were not willing to give up the search for the 
perfect integral, one which would make all functions integrable. Because Lebesgue’s 
construction had shown that the key to a comprehensive theory of integration was 
the construction of an appropriate measure, the search now focussed on finding a 
total measure on R, that is, one which assigns a measure to each subset of the real 
numbers. 

Vitali [6] showed that a total measure on the reals cannot be countably additive 
and translation invariant. This being so, it is natural to ask which of these 
properties should be retained. This decision is, of course, somewhat arbitrary. 
While retaining translation invariance leads to some fascinating group theory and 
the Banach-Hausdorff-Tarski Paradox, we will consider what happens if countable 
additivity is retained instead. 

In 1930 S. Ulam [1] showed that there is no such measure on @,, w, or on any 
cardinal! which is the successor of some other cardinal. Ulam’s proof was a 
spectacular advance in that it did not rely on any of the geometric assumptions, 
such as translation invariance, on which earlier proofs of the existence of non-mea- 
surable sets had relied. 

By Ulam’s theorem, the existence of a countably additive measure on R that 
measures all of its subsets implies that 2*° is not the successor of any other 
cardinal, that is, it is a limit cardinal. By arguing a bit more carefully one can show 
that there must exist some limit cardinal A < 2*° which is not the union of fewer 
than A sets of size less than A. The existence of such a cardinal has a profound 
influence on set theory. 

In order to understand this influence, it is necessary to recall (a consequence of) 
Gddel’s second incompleteness theorem which says that set theory can not prove 
its own consistency. One way to prove the consistency of a theory is to find a model 


‘Cantor introduced the notation w to represent the next ordinal after the integers and it is still 
favored by set theorists today. The next cardinal after w is denoted w, and so on. 
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of that theory, that is, a mathematical structure satisfying all of the axioms of that 
theory. We ask: What are the implications for set theory of the existence of a 
model of set theory? Recall the procedure for the construction of the hierarchy of 
sets. One begins with the empty set—call it V,—and then defines V, to be the 
power set of V,_, for each integer k > 1. This is not the end, though, because one 
can then define V,, to be the union of the sets V, and then define V,,,, to be the 
power set of V,,. If one continues this as far as possible and takes the union one 
gets a model of set theory—or, at least, what would be a model of set theory if it 
were a Set and not a proper class. 

How soon, if ever, does this construction process lead to a model of set theory? 
It turns out that many of the axioms of set theory are satisfied at early stages of the 
construction. For example the axiom of infinity is satisfied as soon as a single 
infinite set is included and this is already true of V,,,,. The power set axiom is 
satisfied at any limit stage because any set which occurs, occurs at a stage before 
the limit and so all of its subsets are added at the very next stage. The power set 
itself is therefore added in no more than two stages and, in any case, before the 
limit. For similar reasons, the pairing axiom is also satisfied at all limit stages. 
Well-foundedness and comprehension are also easy to deal with. 

The problematic axiom is the axiom of replacement, which says that the range 
of any function defined by a formula is a set. It has already been mentioned that 
V..., Will satisfy all of the axioms of set theory except for replacement. Replace- 
ment fails because the mapping which takes 2n to w +n and 2n + 1 to n Is 
definable by a formula and its domain is w which belongs to V,,, C V,,.. 
However, the range of this function is w + w which does not belong to V,,,,. The 
same argument can be used to show that V, is a model of set theory if and only if 
the following holds: 


e if A <a then 2’ <a 
e if A < @ then any function F: A — a (defined using only parameters from V,) 
has range bounded in a. 


Any cardinal satisfying these requirements is known as a large or inaccessible 
cardinal. Since the existence of a large cardinal implies that a model of set theory 
exists, it follows from Gddel’s Theorem that it is impossible to prove the existence 
of inaccessible cardinals. 

Ulam’s argument shows that if there is a countably additive measure which 
measures every set of reals then there is a cardinal a which satisfies the second 
requirement of being an inaccessible cardinal. Such cardinals are known as weakly 
inaccessible. Another of Gdédel’s major contributions is the notion of the Con- 
structible Universe, one of whose consequences is that any model of set theory 
contains a submodel which satisfies the generalized continuum hypothesis. This 
allows us to conclude that if there is a weakly inaccessible cardinal then, in the 
Constructible Universe, the weakly inaccessible cardinal is in fact an inaccessible 
cardinal; this is so because the cardinal arithmetic of this smaller model of set 
theory easily implies the first requirement for being a large cardinal. 

In other words, if there is a countably additive measure which measures every 
set of reals than set theory is consistent. This and Gédel’s theorem show that the 
existence of a perfect integral is not provable. On the other hand, it is conceivable 
that some day there may be a proof that it is not possible to have a perfect 
integral. The impact of this on set theory would be devastating. It would follow 
that many of the large cardinals which experts now consider quite innocuous, and 
which have played an important role in many important independence results, do 
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not exist. While this would not show that set theory itself is inconsistent it would 
severely shake our faith in the assumption that it is. 


* OK OK 


We've told our story but would nevertheless like to tack on the following 
relevant “postscript”: 

In what sense does the integral solve the Greek quadrature problem and what is 
its conceptual significance? A telegraphic answer to these two questions follows. 

The integral provides a direct “analytic” solution of the Greek quadrature 
problem for regions of the form 


f(x) 


Figure 6 


Indeed, the area of the region in the figure is 


A= ['f(x) de. 


If we rewrite this as 


A= ['F(x) de = (b~a)[ — ['F(2) ar), 


then it is clear that our “integral region” has been replaced by a rectangle of equal 
area with base b —a and height (1/(b — a)){?f(x) dx. The quantity (1/(b — 
a)){°f(x) dx is the average of the functional values of f on [a, b]. This averaging 
ability of the integral is the key to its importance in countless applications. 
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Pascal’s Matrices 


I recently read your article “Pascal’s Matrices” in the April, 1993 issue of the 
American Mathematical Monthly. It has some attractive observations about Pascal’s 
triangle, and I applaud your efforts to raise the consciousness of the Monthly 
readership about matrix exponentials. However, I must express sadness that you 
passed up the occasion to point even higher. Specifically, the theme of your article 
would have been strengthened, it seems to me, if you had pointed out that any one 
parameter group of matrices must be the exponential of some unique matrix, which 
then may be computed by the method you indicate, of differentiating at the origin. 


Having said this, you then could have mentioned that this converse fact is basic to the 
theory of Lie groups. Of the many references to which readers interested in following 
up these hints might have gone, one of the more convenient for Monthly readers 
might be my article of November 1983. 


Roger Howe 

Department of Mathematics 
Yale University: 
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PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on 
the inside front cover. Please include solutions, relevant references, etc. Three copies 
are requested. 


Solutions of published problems should arrive before June 30, 1994 at the MONTHLY 
PROBLEMS address given on the inside front cover. Solutions should be typed with 
double spacing, including the problem number and the solver’s name and mailing 
address. Two copies suffice. A self-addressed postcard or label should be included if 
an acknowledgment is desired. 


An asterisk ( * ) after the number of a problem, or part of a problem, indicates that 
no solution is currently available. Partial solutions will be useful in such cases. 
Otherwise, the published solution is likely to be based on a solution which is complete 
and correct. Of course, an elegant partial solution or a method leading to a more 
general result is always useful and welcome. In addition, references to other 
appearances of MONTHLY problems or to solutions of these problems in the 
literature are also solicited. 


10354. Proposed by Hassan Ali Shah Ali, Tehran, Iran. 

Determine the least natural number N such that, for all n > N, there exist 
natural numbers a, b with n = |aV2 + by3 |. 
10355. Proposed by Joaquin Gémez Rey, I. B. “Luis Bunuel’’, Alcorcén (Madrid), 
Spain. 


Two players of equal strength play a tournament consisting of 2” matches. Let 
T be the random variable that counts the number of times the score is tied during 
the tournament (including the initial 0-0). What is E(T) + E(T7)? 


10356. Proposed by Shalosh B. Ekhad, Princeton, NJ. 


Let X,, be defined by X, = 0, X, = 1, X, = 0, X, = 1, and for n > 1, 


(n?+n+1)(n +1) ; n+1 
X43 = 0D ih + (rn tt 1Y XY 


n 


X,,- 


n 


n 
Prove that X,, is a square of an integer for every n > 0. 


10357. Proposed by Ira Gessel, Brandeis University, Waltham, MA. 
Define integers, a,, ,, by 
1 


[oa] 
—— a up". 
1—u—v + 2uv L myn 


m,n= 


J 


0 
Show that (—1)/a,,5,,,. is the Catalan number (*") (i + 1). 
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10358*. Proposed by Jiang Huanxin, student, FuDan University, ShangHai, China 


In triangle A ABC, find all points P such that the triangle ADEF (with 
D = AP BC, E = BP (CA, F = CP Q AB) is equilateral. 


10359. Proposed by Raphael M. Robinson, University of California, Berkeley, CA. 


Two pairs of sides of the unit square 0 < x < 1,0 < y < 1 are identified in such 
a way that the surface obtained has a locally Euclidean metric. How many such 
surfaces are there which are inequivalent as metric spaces? 


10360. Proposed by Richard P. Stanley, Massachusetts Institute of Technology, 
Cambridge, MA. 


Let L be the integer lattice in R%, i.e. L is the set of points (x,, x,,...,Xq) with 
all x, € Z. Consider L as a graph by declaring two lattice points to be adjacent if 
the distance between them is 1. Define a sequence S,, S,,... of subsets of L 
inductively as follows: 


Sy = {(0,0,...,0)} 


5S, = (P —EL— UW S,: P is adjacent to exactly one element of (J S,}. 


O<k<n O<k<n 


Let S be the subgraph of L whose vertices are US,. Thus, P € S is adjacent to 
P’ & S if the distance between P and P’ is 1. 


(a) Find a simple condition for a point of L to belong to S. 

(b) For P € §S, find a simple rule to determine i such that P € §,. 

(c) How many elements are in S,? 

(d) How many P € §, are adjacent to no points of S,,,? 

(e) Show that S is a tree. 

(f) Investigate the (vertex) density of S in L, and compare it to the largest 
density of a subset of L for which the induced subgraph Is a tree. 


NOTES 


Notes: (10358) Partial results, giving the properties of such points, would also be of 
interest. (10360) The case of d = 2 already has many features of the general 
problem. It would be interesting to discover the extent to which this is typical of all 
dimensions. Note that the only graph structure used in the problem is that induced 
from the original definition of adjacency in L. 
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SOLUTIONS 


A Generalized Gamma Function with Independent Branches 


6649 [1991, 168]. Proposed by D. E. Knuth, Stanford University, Stanford, CA. 


Let P be a monic polynomial of degree m with complex coefficients and let w 
be a primitive mth root of unity. Let A be the m by m matrix in which the 
element in the jth row and kth column is 


Ax = / (wkt)’eP') dr (0 <j,k <m—1). 
0 
Prove that A is nonsingular. 


Solution by the proposer.The integrals A ,, obviously exist because P(w*t) = t™ 
+ O(t”~') as t > », Suppose that we have complex numbers ay,..., @,,_; such 
that LZ 9a,A, = 0 for 0 <j < m. We wish to prove that ay = -** =a@,,_, = 0. 
Write P’(x) = a,x" +--+ +a,,_,, where a, = m; then we have, for all j > 0, 


_ ” k,\it! k —P(w* 
BA jim + UMAjem—1,k + 7 +4m—1Aj41,k =, (att) P’(wtt)je TO? dt 


co 


—w~*(w*t) ee Po' 


+(i+1)f (ofty'e PO" at 
0 


Hence, from solving the above for A,,,,, ,, it follows inductively that 08 A x 
= 0 for all j > 0. Thus 


m-1 00 
Y af Q(wtryeMe™ dr = 0 («) 
k=0 


for any polynomial Q. 

If P(x) =x™, we have A, = w*/(ftie~” dt = wB, where B, # 0. Hence 

m—la,w*! = 0 for 0 <j <_m. The Vandermonde matrix (w*’) is nonsingular, so 
=: =a,_,=0. 

Otherwise, suppose P(x) = x” + g(x) where deg(g) < m, and let O,(x) = 
vio g(x)*/k!, so |e& — O(x)| < |g(x)|"el@@!m!. Then, on using (*) with 


Q(t) = t/Q,(t), 


» a, | (wt) e OO d= V af (w*t)’ e8D-POND dt 
k=0 ~°0 k=0 “0 


m—1 00 . 
=- ¥ af (w*t)'(e8e" — O,(w*t))e~Po" dt. 
k=0 0 
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Since g(x) = O(x™~!) and |g(w*t)| — P(w*t) = -—t”™ + O(t”~!) it follows, for 
constants K and K, independent of n, that 


[ (ott) (ese! — Q,(w*t))e-Po" dt 
0 


< = [74 g(w*t)|leleo*o-Pe"| ay 
n! 0 


K” ,0 | 2k" 
n! Jo m:n! 
00 4 (n—-1m — Ky( pn)! 
f (ur 1m 1)/m 6-4 dy < 7 
0 n! 


where p = (m — 1/2)/m and j < m. From Stirling’s formula, 
lim K{(pn)!/n!=0 if0 <p <1, 


and it follows that D”*=Ja, [*(w*t)ie~@*o” dt = 0. But this is the case P(x) =x” 
already dealt with and so a, = 0 for0 <k <m. 


No other solutions were received. 
Tossing Coins Until All Show Heads 


E 3436 [1991, 366]. Proposed by Lennart Rade, Univesity of Géteborg, Sweden. 


Suppose we have n identical coins for each of which heads occurs with 
probability p. Suppose we first toss all the coins, then toss those which show tails 
after the first toss, then toss those which show tails after the second toss, and so on 
until all the coins show heads. Let X be the number of coins involved in the last 
toss. 


(a) Find PCX =) for i = 1,2,...,n and ECX). 
(b) Let p, = P(X = 1). Analyze the behavior of p, as n > ~, 


Solution of (a) by Peter Griffin, California State University, Sacramento, CA. 
The event X, = i occurs just when, for some k, i coins first show heads on the k th 
toss while the other n — i coins produced a head at some time before the last toss. 
Letting g = 1 — p, this yields 


P(x =) = (7) (neta ay 


(When i =n and k = 1, we interpret 0° as 1, while for i <n the sum can be 
started at k = 2.) By expanding the binomial, one can write this as a finite sum as 
follows: 


(-1)' 
1— git’ 


P(X, =i) = ("od (" 74] 


To calculate ECX,,), recognize that the interior sum of 


EiP(x, =i) = ¥ ¥ i("\(oa*")\( — ger" 


i=1 k=1i=1 
can be evaluated by regarding it as (pq*~' + 1 —q*~')” times the mean of a 
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binomial variable with n trials and success probability pq*~1/(pq*~! + 1 — q*~') 
= pq*~'/( — q*), yielding 


EX, = Yn-—,(1- a4‘) 
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= P(X, = 1)/4@. 

Solution of (b) by O. P. Lossers, Eindhoven University of Technology, Eind- 
hoven, The Netherlands, and the editors. We prove the slightly surprising fact that 
p,, does not converge as n — ©, that instead it oscillates around the pseudo-limit 
—p/logq with small, but nevertheless, non-vanishing amplitudes. Let k, = 
| —log n/log q| so that 1 < nq* < 1/q. Let A, = q7{~'8"/'8%, where {x} stands 
for x —|x], the fractional part of x. Then g*"/A, = q7'8"/!84 = 1/n, which 
may be written as nq* = A. We may now rewrite p, as 


co 


P, =p Li a*(1-q*) 


k=0 
~ n—-1 
=m > gi tkn(4 — qitkn) 
l= —k, 
ora) A q! n—1/d,q' And 
=p di A,q' [1 - - | 
l=—k, n 
~p Y A,qie~%, (1) 
[=—-—© 


where the error in the last approximation approaches 0 as n — o. It 1s clear that 
(1) depends on n only through A,, which in turn is periodic in the argument log n 
(with period log q). Although (1) is reasonably well-approximated by the integral 


GQ=ay pe 

logq oy log gq” 
which depends only on p and gq, it is not (quite) independent of n. To see this, 
write A,=q* for 0<x <1 to form a continuous analog of (1): f(x) = 
prX7.._~q' *e? . In the case of a fair coin (p = gq = 1/2), this function takes all 
values between about .721340 and .721355. Formula (1) samples this function at a 


dense sequence of points; thus p, will oscillate indefinitely around 1/(2 log 2) = 
.7213475 ... without converging. 


np J q*(1-q*)"' dx = —p 


Editorial comment. Howard Taylor has indicated a method to connect part (b) 
with more familiar methods of Mathematical Statistics. Each of the n individual 
coins is replaced by an exponentially distributed random variable 7, with parame- 
ter A, such that P(n, > x) =e~**, for x > 0. The number of tosses of a coin is 
[y;|, and X, is the number of 7; in the interval (k —1,k], where k = 
[max{7,,...,7,]. The analysis leading to (1) is then performed for an exponential 
distribution. 
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Bennett Eisenberg, Gilbert Stengle and Gilbert Strang, “The asymptotic proba- 
bility of a tie for first place’, Annals Appl. Probab. vol. 3, no. 3 (Aug. 1993) deals 
with slight variant of this problem. Freed of the coin-tossing model, the probability 
p, of score j is no longer constrained to follow the geometric distribution. The 
probability distribution is assumed to be the same for all “‘players’’, and the effect 
of the number of players on the number obtaining the highest score is studied. The 
results obtained for the geometric distribution are similar to those given above. 
Slightly more general results were obtained by J. J. A. M. Brands, F. W. Steutel 
and R. J. G. Wilms, “On the number of maxima of a discrete sample,” Statistics 
and Probability Letters (to appear). 


Part (a) also solved by R. A. Agnew, M. N. Deshpande (India), O. P. Lossers (The Netherlands), 
R. M. Robinson, J. Sarkar, P. Warner (student), Western Maryland College Problems group, and the 
proposer. P. Griffin and the Western Maryland College Problems group also discovered, by computa- 
tional evidence, the oscillatory behavior of p,,. 


Divisibility of the Determinant of a Langley Graph 


6657 [1991, 372]. Proposed by J. J. Rotman and P. M. Weichsel, University of 
Illinois at Urbana-Champaign. 


Suppose we make a finite group G into a graph [ by defining two elements a, b 
of G to be adjacent if (ab~')? # e. If the elements of G are v,,U5,...,vU,, let A 
be the n by n matrix in which the element in the ith row and jth column is 1 if vu, 
and vu, are adjacent and is 0 if v, and vu, are not adjacent. (A is an adjacency matrix 
for T.) : 

Show that det A is an even integer. 


Solution I by David Beckwith, Sag Harbor, NY. A determinant is unaltered if 
any one column is replaced by the sum of all the columns; hence if M is a square 
matrix with integer entries such that all row-sums are even, then det M is an even 
integer. 

For the given finite group G, define X = {x € G: x* ¥ e}. Then X has an even 
number, yu, of elements (possibly zero) because x € X implies both x # x7! and 
x! © X. Consider now the ith row of the adjacency matrix A. Its jth entry is 1 if 
v; = x~'v, for some x € X and 0 otherwise. Thus the row-sum equals yz, indepen- 
dent of 7, and the result follows. 


Solution IT by M. N. Ellingham, Gordon F. Royle and C. C. Timar, Vanderbilt 
University, Nashville, TN. More generally, the valency of a regular graph is a 
divisor of the determinant of its adjacency matrix. Let ¢(M;.x) denote the 
characteristic polynomial of an integer matrix M. Then 6(M; x) is a monic integer 
polynomial with constant term det(/ ). Thus, any rational zero of d(M; x) must be 
an integer dividing det(/). However, the adjacency matrix of a regular graph of 
valency k has the column vector (1, 1,..., 1) as an eigenvector with eigenvalue k. 

Now, the argument of Solution I shows that I is a regular graph of valency yp, 
so {det A. Since p is even, the result follows. 


Editorial comment. The two selected solutions were typical of most solutions. In 
some cases, all algebra with the adjacency matrix was done modulo 2, or modulo 
yw. Tad White used an argument which paired non-zero terms in the usual 
expansion of det A and Paul J. Zweir used a similar pairing on an expression for 
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the value of det A in terms of certain subgraphs of IT (see D. M. Cvetkovi¢, M. 
Doob and H. Sachs, Spectra of Graphs, Academic Press, 1979, sect. 1.4, or F. 
Harary, “The determinant of the adjacency matrix of a graph”, SIAM Review 4 
(1962)). 

Thomas Honold noted that the matrix A is L,. L(g), where L is the left 
regular representation of G. This matrix is similar to one in block diagonal form 
with blocks obtained in the same way from the irreducible representations of G. 
Each irreducible representation occurs with multiplicity equal to its degree, so 
det A can be expressed as a product of powers of corresponding determinants for 
the irreducible representations. The factor arising from the trivial representation is 
yw, and the factor arising from each other representation is an integer. This is a 
more elaborate version of the use of eigenvectors in Solution II. However, this 
approach also identifies other factors of det A. If G contains a central element z 
of order 2, then any representation restricting to the non-trivial representation of 
{e, z} contributes a factor of zero to det A, so that det A = 0 in this case. Several 
solvers also noticed this fact, with a direct proof based on noticing that the rows of 
A corresponding to e and z are the same. On the other hand, it appeared to have 
escaped notice that the group A, also leads to a zero determinant. An infinite 
family of examples can be generated by considering, for each positive integer k, 
the group of all linear functions x > ax + b over a field of g = 2* elements. This 
group of order g(g — 1) elements has gq — 1 one dimensional representations and 
one (q — 1) dimensional representation. The latter representation is easily seen to 
contribute a zero factor to det A. Other examples of a similar nature have been 
found, but a complete characterization of groups G with det A = 0 is unknown, 
and would be interesting. 


Solved also by D. Alvis, D. Astles (U.K.), D. Callan, R. J. Chapman (U.K.), P. Check and D. Birsan 
(students, Canada), A. J. Douglas (England), W. H. Gustafson, L. Hogben, Th. Honold (Germany), S. 
Kanetkar, D. W. Koster, C. Lanski, S. C. Locke, O. P. Lossers (The Netherlands), T. White, G. Zappa 
(Italy), P. J. Zweir, Central Michigan University Problem group, National Security Agency Problems 
Group, and the proposers. 


A Cauchy-Schwarz Inequality for Determinants 


E 3464 [1991, 852]. Proposed by S. J. Bernau and Gavin G. Gregory, University of 
Texas at El Paso. 


If m and n are given positive integers and if A and B are m by n matrices with 
real entries, prove that (det AB’)? < (det AA’ (det BB’). 


Composite solution I by many readers. If m > n, then AB’ and AA’ are m by m 
matrices with rank at most n, so det( AB’) = (det AA’ (det BB’) = 0, and the 
desired inequality is obvious, so we may assume m <n. For any m Xn matrix C 
and any ascending sequence w of integers 1 < w, < -:- <w, <n, let C,, denote 
the determinant of the m X m submatrix of C composed of columns w, through 
w,,. The Binet-Cauchy formula (see F. R. Gantmacher, The Theory of Matrices, 
Chelsea, 1960, vol. I, sect. 1.2) asserts that for any pair A, B of m by n matrices, 


det( AB’) = )(A,B,. 
Here we have summed over all w. Then the assertion of the problem becomes 


(za.n) <(Ze4') (Ee 
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which is the Cauchy-Schwarz inequality applied to the (” }-vectors (...,A,,...) 
and (...,B,,...). Thus equality holds iff A, =AB,, for all w, for some fixed 
number A. 

The above argument is also valid for complex matrices if transpose is replaced 
with conjugate transpose. 


Solution II by Dennis I. Merino, Southeastern Louisiana University, Hammond, 
LA. As in solution I, we may assume m <n. Denoting the ith singular value of a 


matrix X by o,(X), we have 


(det AB?) = T]o( 487) 


< [lo Ayo B7Y" 


3 


= Te, ,(A)’o,(B) 


= det AA’ - det BB’. (1) 


The inequality (1) is a special case of Horn’s inequality, Theorem 3.3.4 of Roger A. 
Horn and Charles R. Johnson, Topics in Matrix Analysis, Cambridge, 1991. 


Solution III by Olaf Krafft, RWTH, Aachen, Germany. Since AA’ and BB? are 
nonnegative definite, one has det AA’ > 0 and det BB’ > 0. Therefore, the 
inequality is trivially satisfied if det AB’ = 0. Assume that det AB? # 0. Then 
AB’ is regular, hence rank A = rank B = m; and, in particular, BB’ is regular. 
The matrix J — B’(BB’)~'B is symmetric and idempotent, thus nonnegative 
definite. Now, it is a corollary of the Courant-Fischer theorem (see R. Bellman, 
Introduction to Matrix Analysis, McGraw-Hill, 1960 (or second edition, 1970), sect. 
7.7) that the ordered eigenvalues of C + D and D satisfy A,(C + D) = A,(D) if D 
is symmetric and C nonnegative definite. If D is also nonnegative definite, so that 
the A,(D) = 0, then det(C + D) > det D. Applying this to 


C = A(I — B™( BB’) 'B)A™, D = AB™( BB) BA’, 
one gets det AA’ > det AB’(BB’)~'BA’ = (det AB’)*(det BB’)~'. 


Solution IV by Thomas L. Markham, University of South Carolina, Columbia, 
SC. It is obvious that 


_[A][A? Bl 
c-|3| 
is positive semidefinite of order 2m. If each m by m block of C ts replaced by its 
determinant, the resulting compressed matrix is 
det( AA’) det( AB’) 
det(BA’) det( BB’) 


Using general results about compressed matrices, such as those found in R. C. 
Thompson, “A determinantal inequality for positive definite matrices’, Canad. 
Math. Bull. 4 (1961), 57-62, one finds that this matrix is also positive semidefinite, 
which proves the result. 
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Editorial comment. Although the analogous result over C using the complex 
‘conjugate of the transpose was mentioned only briefly in Solution I, several solvers 
phrased their solutions in those terms, and all approaches to the problem can be 
modified in this way. 

A number of readers gave lengthier solutions along the lines of Solution II, 
including a proof of the applicable part of Horn’s inequality. 

Albert Nijenhuis noted that the Binet-Cauchy formula is an explicit coordinate 
representation for an inner product defined on the m-th exterior power AV 
where V is either R” or C”. In particular, if a,,...,a,,,b,,...,5,, € R”, introduce 
an m by n matrix A whose i-th row is a, to describe the element a, A... A a,, 
and a matrix B constructed in the same way from the b,, and define (a, A... A 
a,,0,\...Ab,) to be det(AB’). This is shown to be well-defined in texts on 
Multilinear Algebra (e.g., M. Marcus, Finite Dimensional Multilinear Algebra, Part 
I, Marcel Dekker, 1973). The use of the Cauchy-Schwarz inequality in Solution II 
leads to (det AB’)* = (det AA’ )(det BB’) if and only if the exterior product of 
the rows of A, and that of B are linearly dependent in AV. When A and B 
both have rank m, this happens if and only if the two matrices have the same row 
space. 

For further compression inequalities as employed in Solution IV, and general- 
izations of the problem, Thomas L. Markham supplied the reference: C. R. 
Johnson and T. L. Markham, ‘‘Compression and Hadamard power inequalities’, 
Linear and Multilinear Algebra 18 (1985), 23-34; and Roy Mathias supplied 
references to J. de Pillis, “Transformations on Partitioned Matrices”, Duke Math. 
J. 36 (1969), 511-515 and to R. A. Horn and R. Mathias, Cauchy-Schwarz 
Inequalities Associated with Positive Semidefinite Matrices, Linear Algebra Appl. 
142 (1990), 63-82. 


Solved also by J. M. Arregi and J. Sangoniz (Spain), D. W. Bailey, C. I. Caldwell (student), D. 
Callan, R. J. Chapman (U.K.), Y. Diao, M. Dresevi¢é (Yugoslavia), J. C. Goding, G. Janee, J.-P. Grivaux 
(France), J. W. Grossman and S. S.-S. Wang, K. P. Hart, L. Hogben, D. Jespersen, P. G. Kirmser, N. 
Komanda, R. Mathias, G. Miller (student, Canada), J. M. Monier (France), T. S. Nanjundiah (India), Y. 
Nievergelt, A. Nijenhuis, I. Olkin, A. Pedersen (Denmark), W. So, F. B. Strauss, A. Tissier (France), G. 
Trenkler (Germany), M. Vowe (Switzerland), University of Wyoming Problem Circle, and the pro- 
posers. Two incorrect solutions were received. 


Schroder Numbers Modulo 3 


E 3470 [1991, 955]. Proposed by B.M.M. de Weger, University of Twente, Enschede, 
The Netherlands. 


Let S, be the nth Schréder number, defined as the number of polygonal paths 
in the Cartesian plane that start at (0,0), end at (n, n) contain no points above the 
line y =x, and are composed of steps taken from the set {(0, 1), (1, 0),(, 1}. E 
3343 [1989, 734; 1991, 367] asserted that S, = 0mod3 when n is even. Determine 
S,, modulo 3 when n is odd. 


Solution I by Michael Vowe, Therwil, Switzerland. We will show that S,,,., = 
(—1)"*'C_ (mod3) for m > 0, where C, = (2"] /(n + 1) is the nth Catalan 
number. In problem E3343, the recurrence for the Schroder numbers: S,,,, = 3S, 
+ D771S,S,_, for n = 1 was obtained, with S$, = 1 and S, = 2. 

Letting R,, = S,,_, (mod 3), we can reduce the recurrence modulo 3 to obtain 
Roi = Ln Rj Rm+1-; (mod 3) for m > 1, with R, = 2. This uses S,, = 0mod 3. 
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If we treat this as a recurrence in integers, it will yield numbers congruent to these 
modulo 3. Let Q, =2 and Q,,., = U7L,0;Qn+1-; for m>=1. Let f(x) = 
Lm>1Qmx™ The recurrence can be expressed as f(x) — 2x = f7(x). The solution 
to this is f(x) = 5(1 — v1 — 8x), since f(0) = Q, = 0. Using the extended bino- 
mial theorem, we compute the coefficient of x” to be Q,, = (2” /m)(2"=?), 
Since 2 = —1mod 3, we obtain R,,,, = (-1)"*'C,, (mod 3), as desired. 


Solution II by O.P. Lossers, Eindhoven University of Technology, Eindhoven, The 


Netherlands. Let m =m, +m,:3+...+m,-3' be the ternary expansion of m. 
Then S,,,= ( 2 \( ; )...(2] (mod 3) for m > 1. In particular, modulo 3 we 


mo my 


have 


if | m/3]| has a 2 in its ternary expansion, 
if | m/3| has no 2 and m = 1 mod 3, 
if | m/3]| has no 2 and m ¥ 1 mod 3. 


Som-1 


= WN © 


We bégin with the expression 2 f(x) = 1 — v1 — 8x for f(x) = XQ_,,x” found 
above, which we can rewrite modulo 3 as f(x) = —1+(1+-.x)'”. To obtain the 
desired result, it suffices to show 


(lt+x)”= Vo (ity | me )(m, } °° 2” (mod 3). 


m>=0 


Consider the power series F(x) = Les o( @ ve jks we have [F.(x)? = (1 + 


x)>"*!, Since (1 + x)?" = 1 + x*" (mod 3), we have [F,(x)? =14+x4+2x7 4+ x77! 
(mod 3). By making n sufficiently large, we can make any desired initial segment of 
coefficients in F(x) agree with those of (1 + x)'/* modulo 3. 


This yields the desired result, because as n grows, each coefficient ( woe 


reaches and remains at the desired value modulo 3. This uses the theorem of 
Lucas (circa 1878, see L. E. Dickson, History of the Theory of Numbers, Chelsea, 
1971, vol. I, ch. IX, p. 271), which states that if a = La;p' and b = Xb, p’, then 


(:)= r(*| (mod p), where p is a prime. Since 3” + 1)/2=2+1-3+-°: +1 


- 3"! the coefficient of x” in F(x) reaches and remains at the value described 
above when n > [log, m] + 1. Hence Q,, has the value claimed, modulo 3. 


Editorial comment. Most solvers gave explicit descriptions of the set of values of 
n for which S,,,_, is congruent to 0, 1, or 2 modulo 3 in terms of the ternary 
expansion of m. There were various expressions, with the relatively short argu- 
ments being those using the formula of Lucas. Fred Galvin and George Kangas 
extended the results of E 3343 in a different manner by showing for non-negative 
m that S,,, = 2(- 1yn+1( 2m (mod 9). In this work, the more complicated generat- 
ing function of the Schrédder number was used with the congruences modulo 9 
entering later in the argument. 


Solved also by D. M. Bloom, D. Callan, R. J. Chapman (U.K.), R. High, N. Komanda, A. Nijenhuis, 
J. H. Steelman, D. M. Wells, Anchorage Math Solutions Group, National Security Agency Problems 
Group, and the proposer. 
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Promising Sequences 


10185 [1992, 60]. Proposed by William C. Waterhouse, Pennsylvania State Univer- 
sity, University Park, PA. 


If P is a polynomial with coefficients in Z, it is immediate that P(b + c) — P(b) 
is an integer multiple of c for all integers b and c. Are there any non-polynomial 
functions f: Z — Z with this property? 


Solution by Gerry Myerson, Macquarie University, Sydney, NSW, Australia. Yes. 
Let 


f(n) =n + n(n? — 17) + n(n? — 17)(n? — 27) 
+ n(n? — 1*)(n? — 27)(n? — 37) + +: 


Observe that for each integer n, the sum above consists of only finitely many 
non-zero terms. We show that f has the desired properties. For each whole 
number a, let f,() be the sum of the first a terms in the series for f. Then f,(1) 
is a polynomial in n with integer coefficients, and f (m) = f(m) for all integers m 
with |m| < a. Now, given integers b and c, let a be the larger of |b + c| and |b|. 
Then f(b +c) — f(b) = f,(b + c) — f(b) is an integer multiple of c, since f, isa 
polynomial with integral coefficients. For positive integers b, the summands in 
f(b) are all non-negative and the last non-zero summand is (2b — 1)!. Thus, 
f(b) = (2b — 1)!, whence f grows faster than any polynomial. 


Editorial comment. Most of the solutions received were similar to the solution 
given above. Several solvers pointed out that a more general choice for f along the 
same lines is 


f(n) = La(n+i)(n+i-1)---(n-J), 


j=0 


where the a, are integers and infinitely many are non-zero. A few solvers also 
mentioned polynomials such as f(n) = (n* — n‘*)/2. Note that, while f is a 
polynomial, it is not in Z[x], and yet f(b + c) — f(b) is an integer multiple for c 
for all integers b and c. 

A small set of solvers ignored explicit formulas entirely. Starting from any 
function n ~ a, from N onto Z, call a finite sequence of integers by, b,,..., b; 
promising (terminology of Robert High) if it satisfies (a; — a;)|(b; — b,) for0 <i < 
J < k. In this approach, one shows that it is always possible to define b, ,, in many 
ways to extend a promising sequence. This demonstrates the existence of uncount- 
ably many functions with the required property while there are only countably 
many polynomials. 


Solved also by D. Boixader Ibanez (Spain), G. Calinescu (student, Romania), R. J. Chapman (U.K.), 
P. R. Chernoff, F. J. Flanigan, I. Gessel, R. High, K. S. Kedlaya (student), J. H. van Lint (The 
Netherlands), O. P. Lossers (The Netherlands), G. A. Martin, A. Nijenhuis, J. L. Pietenpol, I. Praton, 
K. Schilling, N. C. Singer, R. Stong, O. Wyler, Theory First, University of South Alabama Problem 
Group, and the proposer. 
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A Family of Invertible Matrices 


10214 [1992, 362]. Proposed by Stephen Penrice, Emory University, Atlanta, GA. 


For all integers n > 1, let f(m) denote the largest real number such that, for 
any set of non-negative real numbers satisfying a, + -:: +a, < f(n), the n by n 
matrix with a,,...,a, along the main diagonal and —1 in all other positions is 
invertible. Show that f(n) is well-defined, and obtain an explicit formula for it. 


Solution by Dirk P. Laurie, Potchefstroom University for Christian Higher Educa- 
tion, Vanderbijlpark, South Africa. The matrix concerned can be written as 
M = D - ee’, where D = diag(1 + a;) and e is a column of ones. By The Sher- 
man-Morrison formula (see Alston S. Householder, The Theory of Matrices in 
Numerical Analysis, Dover, 1964, sect. 5.1), 


M-!=D-1(1—e’D~'e) 'D~'ee7D7!; 


hence M is invertible whenever e’ E~'e # 1. Using the fact that the arithmetic 
mean of positive numbers is not less than their harmonic mean, we obtain 


n 1 


Tn-1 
e' D e= ) 
,i +a; 


~ 


i 


IV 
= 
— 
iMs 
— 


n + f(n) 
> lif f(n) <n’?—-n. 


The function f(n) = n? — n is best possible, since if all a; = n — 1, then M is not 
invertible because Me = 0. 


Editorial comment. Many solvers noted that the same result holds if the 
condition a; > 0 is weakened to a; > —1. 


Solved also by J. C. Binz (Switzerland), B. W. Brock, F. Brulois and T. Shore and D. B. Tyler, D. 
Callan, R. J. Chapman (U.K.), M. Dindos (Slovakia), Z. Franco, K. S. Kedlaya (student), N. Komanda, 
O. Krafft (Germany), O. P. Lossers (The Netherlands), M. Mocsy (Hungary), A. Nijenhuis, A. Pedersen 
(Denmark), E. Schmeichel, R. Stong, M. Tsatsomeros (Canada), and the proposer. 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 


Answer to Picture Puzzle 
(p. 27) 


Solomon Lefschetz. 
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REVIEWS 


Edited by Darrell Haile 
Indiana University, Bloomington, IN 47405 


Mathematical Cranks by Underwood Dudley. The Mathematical Association 


of America, Washington, 1992, v + 372, $25.00. 


Reviewed by Yan Stewart 


A few years ago I wrote a short article in New Scientist, in which I made passing 
reference to “the Fibonacci sequence 1,1, 2,3,5,8,13,....” (The article was 
actually about the fractal structure of diffusion-limited aggregates, but that’s by the 
by.) Shortly afterwards the editors received a somewhat vituperative letter com- 
plaining about my appalling ignorance. I explained to the editors why I believed 
myself to be blameless, and they spiked the letter. Over the next few weeks they 
received repeated telephone calls from the same person, arguing the same point; 
he also tried to telephone me to discuss the matter, but my secretary managed to 
divert him with lies about my absence. 

What was this terrible error that I had made, an error so important that the 
attempt to have it corrected occupied so much time and effort? 

Simple. The Fibonacci series goes zero, 1,1, 2,3,5, 8,13, .... 

I don’t particularly want to argue the toss here, but I think most mathematicians 
would concur that whether you include an initial zero is largely a matter of 
convention. As it happened, I had excellent journalistic reasons for excluding it. 
The pattern of numbers then fitted the experimentalists’ picture precisely. If I had 
included the zero, I would also have had to spend several sentences explaining why 
it didn’t show up in the picture. All of this would be a distraction from the main 
point—about pattern-formation—and the word limit of 500 was already tight. 

-I don’t know that the person involved was a crank—but they exhibited one of 
the crucial symptoms, an obsession with trivialities. Very specific trivialities: I 
imagine he specialised in Fibonacci numerology. He was probably a perfectly 
reasonable person on most matters, but push the Fibonacci button and he would 
perform. 

Cranks are a pest: to their families and friends, to us professionals, and to 
themselves. So why would anybody write an entire book about them? Because, says 
Underwood Dudley, ‘cranks are interesting,’ And he’s right. 

Mathematical Cranks fits into a tiny but respected genre, which began with 
Augustus de Morgan’s A Budget of Paradoxes. Martin Gardner’s Fads and 
Fallacies is another, though not addressed to mathematics; and there’s also 
Dudley’s own A Budget of Trisections. Dudley collects cranks—it would be more 
prosaic, but less representative of the spirit of the enterprise, to say he collects 
crank writings—and he has solved a major problem for many of us. Before, when 
we received a crank letter, we would have to debate whether to answer it (and risk 
an endless and pointless ensuing correpondence), be exceedingly rude to its author 
in the hope that he (it virtually always is a he) will go away, or just bin it. (1 must 
here confess to once binning a letter from a very nice doctor whose patient was the 
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crank. The doctor had no way of knowing whether the patient’s mathematical 
research was valuable or not, and wanted professional advice. My problem was 
that the patient was at the time confined to a secure unit for the mentally ill that 
specialises in homicidal maniacs. Would you tell such a person—however nice and 
well-meaning his doctor is—that his life’s work on Fermat’s Last Theorem is 
complete trash?) Be that as it may, Dudley’s hobby of crank-collection has solved 
all such dilemmas. Now I just shove the lot in an envelope and send it to him. I 
recommend that you all do the same. (He’s at DePauw University, Greencastle, 
IN. No, look, he really wants this stuff. ...) 

The activities exposed in Mathematical Cranks take a variety of forms, ranging 
from utter nonsense to ideas that might—in other peoples’ hands—form the basis 
of some genuine research projects. The Top Ten of mathematical crankhood is 


Squaring the Circle 
Trisecting the Angle 
Fermat’s Last Theorem 
Proving the Parallel Postulate 
The Golden Number 

Perfect Numbers 

The Four Colour Theorem 
Number Bases 

Cantor’s Diagonal Argument 
10 Duplicating the Cube. 


OOnNNNMANBWN 


It may seem strange that the ‘three classical problems of antiquity’ (actually 
nothing of the sort) come as numbers 1, 2, and then a huge gap until 10. But 
cranks’ intuition agrees with that of most mathematicians: duplicating the cube is 
boring in comparison to the other two. Number base cranks advocate a change 
from base ten to something more sensible, usually 12 but sometimes 8 or 16. They 
are cranks not because this is a bad idea—on the contrary, base 12 would make 
excellent sense—but because as a matter of practicality it is even less likely to 
come about than a change from the QWERTY keyboard. (My own view here is 
that it would be simpler to reorder the alphabet to start at Q. Let me tell you all 
about it, it shouldn’t take more than a week...) The four-colour theorem has 
dropped down the charts since Appel-Haken: nowadays cranks confine themselves 
to seeking simple proofs—or at least proofs simpler than many hours of supercom- 
puter time. Cantor’s proof of the uncountability of the real numbers is perhaps the 
most sophisticated member of the Top Ten; Fermat’s last theorem is the most 
significant open problem (added in proof: well, it was); and the parallel postulate 
has had the greatest influence on mathematics. 

There are three patterns to this list. The first is that cranks have an unbounded 
belief in their own ability to solve problems for which there is a theorem that no 
solution exists. The second is that if a result is counter-intuitive, a crank will prefer 
to deny it rather than to refine his intuition. The third is that the kind of problem 
that cranks know about is the sort of thing that a schoolteacher might throw out to 
the odd bright pupil. While the rest of the class is struggling with how to bisect 
angles, there’s one little lad with shining eyes who’s set his sights on dividing them 
into n equal parts. Kindly teacher points out that the case n = 3 is known to be 
impossible; or perhaps just vaguely recalls being told that nobody has ever been 
able to solve it. At that instant a crank is born: a message is imprinted somewhere 
in the depths the child’s brain. That message is not that the problem has been 
understood long ago: it is that it remains unsolved and that vast fame and fortune 
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await the intrepid solver. It is in the nature of the disease that it will then lie 
dormant for anything up to 50 years, to reveal its deadly symptoms only upon 
retirement, when the sufferer has finally found the time to work on the question 
that has been festering away these past many decades. 

Teachers (and I here count myself as one): do not worry about this. We do far 
more good to the majority of our students by making it clear that mathematics 
includes both unsolved problems and unsolvable ones. We also do far more 
damage to our students in other unavoidable ways, and we usually never know it. 
That is in the nature of our chosen calling, just as doctors kill most of their 
patients in the long run. In any case, it seems likely that crankhood is in fact a 
genuine medical condition. There is a known disease of the mind—regrettably I 
can’t recall its name—whose symptoms include an obsession with very specific 
trivia. Cranks surely suffer from some version of this. 

Some samples. 

There is the fascinating case of HJ (Dudley uses initials only on the grounds 
that it is the ideas of cranks, not the people themselves, that he wishes to discuss) 
who invented Celestial Calculus. Ordinary calculus adds a little bit (6x) to x and 
sees by how much (dy) the function y = f(x) changes. HJ multiplies x by 
something a little bit bigger than 1 and see how much y multiplies by. This is a 
neat idea. It replaces the usual differential operator D = d/dx by a different one 
L, which can be expressed as 


d(ln y 
L(y) = An * 
It’s rather cute: 
L(x") =n 
L(e*) =x 


L(uv*') =L(u) + L(v). 


The formula for L(u + v) is not so cute, however, and L(L(u)) is a pig. A keen 
calculus class would have fun with this operator. What’s so cranky here? Nothing 
—until you read HJ’s estimation of its importance: “Clearly the most unifying and 
comprehensive identity in mathematics.” 

WD simply couldn’t bear the Banach-Tarski Theorem, that a sphere can be cut 
into a finite number of pieces and reassembled to give two spheres the same size as 
the original. This would be paradoxical if it referred to physical spheres and to 
pieces that could actually be made, but it doesn’t and it’s not. WD realised that the 
proof of the Banach-Tarski Theorem depended on set theory, so he decided that 
set theory had to go. His chosen weak link was Cantor’s diagonal proof that the 
real numbers are uncountable, and his idea was to construct a countable list of all 
real numbers. Buried in the verbiage is the list: .1, .2, .3, .4, .5, .6, .7, .8, .9, .10, .11, 
12, .13,... and so on. Students often make the same error, but they usually 
understand when it is pointed out that the list includes only the terminating 
decimals. Cranks don’t. 

JB mailed a two-page paper to every math department in the United States, 
with the original and succinct title: 
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His discovery is the impressive-looking formula 


(N—-1)® 


x (eVNF ~ VNF-1) =1. 


The notation’s slightly adrift, and it’s a useful exercise to correct it. He meant that, 
for example, when N = 3, E = 2 the sum should be 


(V9 — v8) + (v8 — v7) + (v7 — v6) + (6 — y5) + (y5 — v4). 


Think about it. 

GB, some kind of engineer in his mid-fifties, spent a lot of time trying to find 
good ways to calculate tables of primes and the like. As a mathematician who knew 
him explains: ‘‘He genuinely expected some sort of ovation from the mathematical 
community, which he envisioned squirming in its chairs for want of good methods 
of calculating square roots, tabling Pythagorean triples, and listing the primes... 
What he has been doing lately is this: he considers the one-sided infinite matrix 
whose (i, j) entry is i? + j, and looks at the numbers that lie on lines, parabolas, 
and hyperbolas drawn in this array, with the idea of discovering patterns. He has 
discovered some God-awful patterns so far.” For instance, the diagonal running 
upwards to the right from entry (10, 1) reads 101, 83, 67, 53, 41, 31, 23, 17, 13, 11. All 
primes! One can see why GB was so interested. Unfortunately, because no 
polynomial can take only prime values, his search is ultimately doomed to failure. 

Cranks have infinite confidence and zero modesty. LJ was so incensed about the 
primitive state of the theory of differential equations that he wrote The Stupid, 
Ridiculous Oversight to put the matter right. It is quite the exception among crank 
writing: it tackles such things as Yukawa’s solution of the wave equation for the 
muon, and Schrodinger’s equation for the energy levels of hydrogen. It contains an 
exact solution of van der Pol’s equation, which LJ recasts as 


y” + Ay’ + By*y' + Cy = —Die™. 

His solution takes the form 
[cos wt) © (n/n!)(at)”. 
n=0 

We need to be told what n is; and LJ explains in a series of equations that went 

Q= 1 

1= -(A+B)w"'/2 - Dw~?/2 
and so on until 

4 = | A(31— 3)m — B(3 — 271+ 612+ 2111) 
—~C(2~1)]o~? + 62 —1,.... 


Dudley remarks that “the ‘...’ doesn’t help much, but never mind,” and goes 
on to test the solution—whatever ‘...’ means—by taking the extreme case 
A=Bz=+C=D = 0. Now the equation becomes y” = 0. The solution is 


y=at+b 
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for constants a and b. LJ’s solution, on the other hand, reduces to 
1 a) 4 
y = (cos wt)({1 + 5 (4) + qr (@2) +... ]. 


“LJ must have committed an oversight, perhaps stupid and ridiculous.” 

As well as providing a representative survey of crank writings, and a generally 
compassionate but occasionally exasperated view of their authors, Mathematical 
Cranks also addresses various practical issues: how cranks are created, how to 
respond to them, and their (totally ungrounded) belief that it is possible to make 
money out of mathematics. In particular, we are urged never to encourage a crank 
with vague statements that he will read as praise but you know are the 
opposite—for example “If your work were to contain significant results, I would 
be happy to publish it in my journal.” The crank doesn’t see the ‘if’—or if he does, 
he knows his results are significant. So why won’t you publish them? Equally, it is 
highly dangerous to say “I found your results very interesting but I am not 
competent to judge them’”’ when what you mean is “ Your work is obvious trash but 
I’m not prepared to waste my time saying why’. You may well find yourself quoted 
in the next version of the work—as being incompetent, but dazzled by its 
brilliance. 

Mathematical Cranks is unique, and it exists. For both of these properties we 
should be grateful. 


Mathematics Institute 


University of Warwick 
Coventry CV4 7AL, England 


Complex Analysis: The Geometric Viewpoint. By Steven G. Krantz. The Carus 


Mathematical Monographs, Number 23, The Mathematical Association of 
America, ix + 212, 1990. 


Reviewed by John Polking 


One of the more fruitful and beautiful approaches to complex function theory in 
one and in several variables involves the interaction between holomorphic func- 
tions and mappings and various natural Riemannian metrics. The applications of 
this geometric approach include value distribution theory. The simplest example of 
this theory is the Liouville Theorem which says that any bounded entire function 
must be a constant. A more difficult example is the Little Picard Theorem, which 
says that any entire holomorphic function whose range omits two points must be a 
constant. Another application of the geometric approach is the study of properties 
of the group of biholomorphisms of a domain. These can range from determina- 
tion of the group in simple cases such as the disk or the plane, to finding 
conditions under which the group is compact or noncompact. 

Historically the first element of the geometric approach was provided by the 
introduction of what is called the Poincaré metric on the unit disk by Poincaré in 
1881. This is the Riemannian metric 


ds? = (1—|z|") (dx? + dy?). 
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Using this metric the length of a curve parametrized by y(t) of a < t < b is given 
by 


ly'(t)| 
1—|y(t)/? 


The Poincaré metric provides the disk with a geometric structure which is a model 
for noneuclidean geometry, a fact that is still a source of inspiration to geometers. 

The application of complex analysis begins with Georg Pick’s version of the 
standard Schwarz Lemma of complex analysis in 1916. This result, reinterpreted in 
modern geometric parlance, says that a holomorphic function f from the unit disk 
to itself cannot increase distance in the Poincaré metric. In terms of the length of 
curves it says that L(f oy) < L(y). From this point on the geometric approach to 
complex function theory became a fruitful area of research. 

A signal result in the development of the geometric approach was provided by 
Lars Ahlfors in 1938. He noticed that the Schwarz-Pick result remained true for 
maps from the disk equipped with the Poincaré metric to a domain U with an 
arbitrary metric of the form 


L(y) = f° 


ds* = o(z)*(dx? + dy’), (1) 


as long as this metric has negative curvature, bounded above by —4 (the curvature 
of the Poincaré metric is identically equal to —4). “Curvature” refers to the 
ordinary curvature of Riemannian metric. In the case of metric like (1) a computa- 
tion shows that it is equal to 


—A log a(z) 
K,(Z) = ——~~3— > (2) 
a(z) 
where 
0? 0? 
A=-—->+->. 
dx* = dy? 


Over the past fifty years the geometric approach has seen remarkable advances in 
both one and several complex variables. It has proven to be a principal unifying 
theme in complex analysis. 

As beautiful as this subject is, and as important as it has become in current 
research, there has not been an expository account available until the book by 
Steven Krantz appeared. Krantz’s book is aimed at readers who have had one 
semester of complex analysis. Although geometry is used throughout, no prior 
knowledge of differential geometry is required to read it. Very little geometry is 
actually used, and that can be explained quite easily. The entire book can be 
considered to be an elementary approach to certain topics in complex function 
theory. In 200 small pages the geometric theory as outlined above (and more) is 
presented in a most coherent and clear manner. The book is a delight to read. It 
provides a wonderful advertisement for modern ideas in complex analysis and 
geometry. 

The book begins (in Chapter 0) with a survey of the beginning of complex 
analysis. It is not a complete survey of that subject, since the author is interested in 
only part of it. Particular attention is paid to results involving restrictions on the 
range of a function such as the Liouville Theorem, the Casorati-Weierstrass 
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Theorem, and the Picard Theorems, and on results involving conformal mapping 
such as the Riemann Mapping Theorem and the classification of the biholomor- 
phisms of the disk. The proofs here are not always complete, but they always 
convey the key ideas. In fact, foregoing complete proofs at times enables the 
author to provide a very nice overview of this part of complex analysis. 

The basic geometric ideas that are needed in the book are introduced in 
Chapter 1. The only Riemannian metrics considered are those which are confor- 
mal to the euclidean metric, i.e. those of the form indicated in (1). Thus only the 
single coefficient function o is of concern, greatly simplifying the geometrical 
considerations. This chapter ends with the Schwarz-Pick Lemma referred to 
earlier. 

Chapter 2 begins with the introduction of the notion of curvature. The defini- 
tion given is that in (2). Although this analytic formula is all that is needed, the 
formula seems unmotivated. The author realizes this, and includes a paragraph 
discussing the point. However this paragraph only directs the reader to the 
Appendix where there is a very formal discussion of curvature. Perhaps it would 
have been better to include a more informal discussion of curvature which would 
serve to convince the reader of the geometric content of the formula. The model of 
Chapter 0 could have been very appropriately followed here. 

This minor quibble aside, Chapter 2 is the heart of the book. After the 
discussion of curvature, Ahlfors’ version of the Schwarz Lemma is presented, 
followed fairly quickly by proofs of the Little Picard Theorem, a unifying discus- 
sion of the notion of the normal families using the spherical metric, and proofs of 
Montel’s Theorem and the Great Picard Theorem. The quickness is due to the 
unifying effect of the Ahlfors result. For all of these applications one global result 
is needed—the rather easy fact that on the complex plane with two points 
removed, a metric can be found which has negative curvature bounded above by a 
constant —B < 0. 

In Chapter 3 the author introduces the Carathéodory and Kobayashi /Royden 
metrics (in one variable). These are metrics which are defined on arbitrary 
domains in the complex plane (although they may degenerate in particular cases), 
and which are invariant under biholomorphisms. The principal applications are to 
the study of biholomorphism groups, and to normality of families of functions. 

Finally in Chapter 4 the author turns to several complex variables. After some 
preliminaries, he introduces the Carathéodory and Kobayashi metrics in this 
context and uses the Kobayashi metric to prove that the unit ball and the bidisk in 
C’ are not biholomorphic. Since these domains are homemorphic, this is in sharp 
contrast to the Riemann Mapping Theorem in one variable which says that any 
planar domain which is homeomorphic to the disk is biholomporphic to it. This 1s 
one example (out of many that are available) that shows that new and interesting 
phenomena occur in function theory in more than one complex variable. 

To sum up, this is a fine book—one that continues the tradition of the Carus 
Mathematical Monographs for excellent exposition of important mathematics. 

A final point about elementary textbooks in complex analysis. As the preceding 
discussion indicates, the geometric approach to function theory is too advanced to 
be given much coverage in a beginning, one semester course on complex analysis. 
The most one can expect at that level is a proof of the Schwarz Lemma, and the 
application to the classification of the biholomorphisms of the unit disk. However, 
given the importance of the geometric approach, at least this much should be 
included. An incomplete survey of beginning books on complex analysis reveals 
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that even that little is missing from many modern books. Some do not even 
mention the Schwarz Lemma. Others include it as a simple application of the 
maximum principle, but without any mention of applications. This kind of thing 
makes mathematics look like an exercise in mental gymnastics. Only about a third 
of the books examined contained both the Schwarz Lemma and its application. 
This lack of attention to a very important result is regrettable. A clear exception is 
the book of Ahlfors, which contains a discussion of the Lemma, followed by a 
proof, and then a series of eight wonderful exercises containing the application to 
the classification of biholomorphisms in addition to the connection with the 
Poincaré metric that is the result of the Schwarz-Pick Lemma. 


Department of Mathematics 
Rice University 

P.O. Box 1892 

Houston, TX 77251-1892 


Submitted by Ralph Bean, Stockton State College, Pomona, New Jersey. 


94 REVIEWS [January 


TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely.manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 


T : Textbook 
C : Computer Software 


1-4: Semester 
** ° Special Emphasis 
?? : Questionable 


P : Professional Reading 
L : Undergraduate Library 
13: Grade Level 


S : Supplementary Reading 
Readers are advised that price information is subject to change. Selected books and 
software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, 


MN 55057-1098. 


Mathematics Appreciation, T(14-15: 1), S, 
C, L. Exploring Mathematics With Your Com- 
puter. Arthur Engel. New Math. Lib., V. 35. 
MAA, 1993, ix + 301 pp, $38 (P). [ISBN 0- 
88385-639-5] A nice idea, not very well ex- 
ecuted. Leaps into topics without much mo- 
tivation. Still, there are mathematical nuggets 
worth panning. BC 


History, S, L. The Life of Isaac Newton. 
Richard S. Westfall. Cambridge Univ Pr, 1993, 
Xxii + 328 pp, $24.95. [ISBN 0-521-43252-9] 
Condensed version of author’s award-winning 
biography Never at Rest: A Biography of Isaac 
Newton (1980). This shorter, less technical ver- 
sion aims at a general audience. HD 


Number Theory, P. Algebraic Number The- 
ory. A. Frohlich, M.J. Taylor. Stud. in Adv. 
Math., V. 27. Cambridge Univ Pr, 1993, xiv 
+ 355 pp, $29.95 (P). [ISBN 0-521-43834- 
9) An introduction to classical algebraic num- 
ber theory, with some interesting twists. Cov- 
ers standard topics, also cubic and biquadratic 
fields, elliptic curves, binary quadratic forms, 
Brauer relations. Some exercises. SG 


Number Theory, P. From Number Theory to 
Physics. Eds: M. Waldschmidt, et al. Springer- 
Verlag, 1992, xiii + 690 pp, $89. [ISBN 0-387- 
53342-7] Fourteen self-contained, expository 
lectures on many aspects of number theory. 
Most require no specialized background, but 
extend to recent developments in the field. A 
useful collection for any mathematician who 
wants to learn more about these fields. MPR 


Number Theory, P. A Tribute to Emil Gross- 
wald: Number Theory and Related Analysis. 


1994] 


Eds: Marvin Knopp, Mark Sheingorn. Con- 
temp. Math., V. 143. AMS, 1993, viii + 612 pp, 
$79 (P). [ISBN 0-8218-5155-1] 39 articles 
on topics in number theory, modular functions, 
combinatorics, and analysis. 


Linear Algebra, T(14: 1). Introduction to 
Linear Algebra. Gilbert Strang. Wellesley- 
Cambridge Pr, 1993, viii + 472 pp, $42. [ISBN 
0-9614088-5-5] Basic philosophy same as au- 
thor’s Linear Algebra and Its Applications 
(TR, August-September 1988), but moves more 
slowly. Covers standard topics plus applica- 
tions, numerical linear algebra. Includes some 
MATLAB programs. LC 


Algebra, P. General Algebra and Applica- 
tions. Eds: K. Denecke, H.-J. Vogel. Res. & 
Expos. in Math., V. 20. Heldermann Verlag, 
1993, 237 pp, DM 78 (P). [ISBN 3-88538-220- 
2} 20 articles based on lectures given at the 
“43. Arbeitstagung Allgemeine Algebra” (Pots- 
dam, 1992). 

Algebra, P. Representation Theory of Groups 
and Algebras. Eds: Jeffrey Adams, et al. Con- 
temp. Math., V. 145. AMS, 1993, x + 491 pp, 
$50 (P). [ISBN 0-8218-5168-3] Proceedings 
of the Representation Theory Group at the Uni- 
versity of Maryland during the years 1989- 
1992. Lecture notes from keynote speakers as 
well as research and expository articles. 


Complex Analysis, P. Several Complex Vari- 
ables in China. Eds: Chung-Chun Yang, Sheng 
Gong. Contemp. Math., V. 142. AMS, 1993, 
xii + 173 pp, $36 (P). [ISBN 0-8218-5164-0] 
9 papers, several in survey style, on topics in 
several complex variables (singular integrals, 
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function spaces, differential operators, factor- 
ization, biholomorphic mappings, etc.). PZ 
Complex Analysis, P. Several Complex Vari- 
ables: Proceedings of the Mittag-Leffler In- 
stitute, 1987-1988. Ed: John Erik Fornaess. 
Math. News, V. 38. Princeton Univ Pr, 1993, vii 
+ 701 pp, $39.50 (P). [ISBN 0-691-08579-X] 
38 papers by participants in the special year 
devoted to several complex variables. 
Differential Equations, S(17-18), P. Oscil- 
lations in Nonlinear Systems. Jack K. Hale. 
Dover, 1992, ix + 180 pp, $7.95 (P). [ISBN 
0-486-67362-6] Periodic and almost-periodic 
solutions to non-linear ODE’s with a small pa- 
rameter. Treats integral manifolds, method of 
averaging. SK 

Partial Differential Equations, T(17—18: 2- 
4), L. An Introduction to Partial Differen- 
tial Equations. Michael Renardy, Robert C. 
Rogers. Texts in Appl. Math., V. 13. Springer- 
Verlag, 1993, xiii + 428 pp, $42. [ISBN 0-387- 
97952-2} Text for a broad-based introductory 
graduate course in PDE’s. JO 


Functional Analysis, P. Banach Spaces. 
Eds: Bor-Luh Lin, William B. Johnson. Con- 
temp. Math., V. 144. AMS, 1993, xviii + 
201 pp, $42 (P). [ISBN 0-8218-5157-8] | Pro- 
ceedings of January 1992 workshop at the Uni- 
versidad de Los Andes in Merida, Venezuela. 


Analysis, T(17-18: 1), P. Spline Functions 
and Multivariate Interpolations. B.D. Bojanov, 
H.A. Hakopian, A.A. Sahakian. Math. & Its 
Applic., V. 248. Kluwer Academic, 1993, ix 
+ 276 pp, $122.50. [ISBN 0-7923-2229-0] 
Comprehensive introduction to the theory of 
spline functions, stressing new developments. 
Some results appear as exercises, with generous 
hints. Historical notes, extensive bibliography, 
relatively few exercises. HD 


Analysis, P. Representation of Lie Groups and 
Special Functions, Volume 2: Class I Represen- 
tations, Special Functions, and Integral Trans- 
forms. N. Ja. Vilenkin, A.U. Klimyk. Transl: 
V.A. Groza, A.A. Groza. Math. & Its Applic., 
V. 74. Kluwer Academic, 1993, xviii + 607 pp, 
$290. [ISBN 0-7923-1492-1] Second of three 
volumes treats (i) properties of special functions 
and orthogonal polynomials that are related to 
the class 1 representations of certain groups; 
(ii) g-analogs of classical orthogonal polyno- 
mials related to representations of the Cheval- 
ley groups; (iii) special functions connected to 
fields of p-adic numbers. LC 

Differential Geometry, T(15—16: 1, 2), S, L*. 
Curves and Singularities: A Geometrical In- 
troduction to Singularity Theory, Second Edi- 
tion. J.W. Bruce, P.J. Giblin. Cambridge Univ 
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Pr, 1992, xviii + 321 pp, $69.95. [ISBN 0- 
521-41985-9] This delightfully written book 
overflows with beautiful mathematics, requir- 
ing only linear algebra, multi-variable calculus, 
and a little mathematical sophistication. Buy a 
copy for a friend. JO 


Differential Geometry, P. The Problem of 
Plateau—A Tribute to Jesse Douglas & Tibor 
Radé. Ed: Themistocles M. Rassias. World 
Scientific, 1992, x + 335 pp, $94. [ISBN 981- 
02-0556-2] First section recounts lives and 
works of Plateau, Douglas, and Radé6. Second 
collects recent work on Plateau’s Problem. JO 


Geometry, T(18: 2). Convex Bodies: The 
Brunn-Minkowski Theory. Rolf Schneider. 
Ency. of Math. & Its Applic., V. 44. Cambridge 
Univ Pr, 1993, xiii + 490 pp, $89.95. [ISBN 
0-521-35220-7] Basic properties and bound- 
ary structure of convex bodies, Minkowski ad- 
dition, curvature measures, basic properties of 
mixed volume and mixed area measures, in- 
equalities satisfied by mixed volumes. RWJ 


Control Theory, P. Lecture Notes in Control 
and Information Sciences-185: Analysis and 
Optimization of Systems: State and Frequency 
Domain Approaches for Infinite-Dimensional 
Systems. Eds: R.F. Curtain, A. Bensoussan, 
J.L. Lions. Springer-Verlag, 1993, xiii + 
648 pp, (P). [ISBN 0-387-56155-2] Proceed- 
ings of the 10th International Conference on 
Analysis and Optimization of Systems (Sophia- 
Antipolis, France, 1992). 

Control Theory, P. Lecture Notes in Con- 
trol and Information Sciences-161: Topics in 
Stochastic Systems: Modelling, Estimation and 
Adaptive Control. Eds: L. Gerencsér, P.E. 
Caines. Springer-Verlag, 1991, 401 pp, $61 (P). 
[ISBN 0-387-54133-0] 16 survey papers on 
recent work and trends. 


Probability, P. Spatial Stochastic Processes: 
A Festschrift in Honor of Ted Harris on his 
Seventieth Birthday. Eds: Kenneth S. Alexan- 
der, Joseph C. Watkins. Progress in Prob., 
V. 19. Birkhauser, 1991, xii + 256 pp, $84. 
[ISBN 0-8176-3477-0] 11 papers on branch- 
ing processes, percolation, interacting particle 
systems, and stochastic flows. 


Stochastic Processes, P. Selected Works of 
A.N. Kolmogorov, Volume III: Information The- 
ory and the Theory of Algorithms. Ed: A.N. 
Shiryayev. Transl: A.B. Sossinsky. Math. & Its 
Applic., V. 27. Kluwer Academic, 1993, xxv 
+ 275 pp, $132. [ISBN 90-277-2798-8] A 
lengthy biography, then twelve major papers on 
information theory, €-entropy and capacity, and 
algorithm analysis. TAV 


Statistics, T(16-17: 2), L. Introduction to 


[January 


Probability and Statistics, Second Edition, Re- 
vised and Expanded. Narayan C. Giri. Stat.: 
Textbooks & Mono., V. 136. Marcel Dekker, 
1993, xv + 537 pp, $59.75. [ISBN 0-8247- 
9037-5] Combines First Edition volumes on 
probability (Part I, TR, November 1975) and 
statistics (Part IT, TR, March 1976). New edi- 
tion expands coverage of multivariate distribu- 
tions, the normal in particular. Added topics 
include decision theory and the application of 
ANOVA to experimental design. RWJ 


Statistics, T(16~—17: 1). Statistical Methods in 
Agriculture and Experimental Biology, Second 
Edition. R. Mead, R.N. Curnow, A.M. Hasted. 
Chapman & Hall, 1993, xi + 415 pp, $44.95 (P); 
$99.95. [ISBN 0-412-35480-2; 0-412-35470- 
5] New edition includes topics uncommon in 
introductory texts: modeling, analysis of mul- 
tiple measurements, etc. Mathematically ele- 
mentary. (First Edition, TR, April 1984.) KB 
Statistics, P. Pitman’s Measure of Closeness: A 
Comparison of Statistical Estimators. Jerome 
P. Keating, Robert L. Mason, Pranab K. Sen. 
SIAM, 1993, xx + 226 pp, $26.50 (P). [ISBN 0- 
89871-308-0] Nicely presents history of Pit- 
man’s measure of closeness (PMC), applica- 
tions to single-parameter estimation problems, 
PMC anomalies, and asymptotics. RWJ 


Statistics, T(14-15: 1). Applied Nonparamet- 
ric Statistical Methods, Second Edition. P. 
Sprent. Chapman & Hall, 1993, x + 342 pp, 
$34.50 (P). [ISBN 0-412-44980-3] More em- 
phasis on computing and links between appar- 
ently disparate procedures. New material on 
categorical data analysis. (First Edition, TR, 
November 1989). KB 

Algorithms, T(16),S, L. The Design and Anal- 
ysis of Parallel Algorithms. Justin R. Smith. 
Oxford Univ Pr, 1993, xviii + 510 pp, $75. 
[ISBN 0-19-507881-0] Stresses SIMD algo- 
rithms, data level parallelism. Examples from 
numerical and scientific computation, symbolic 
algorithms. Discusses models (PRAM, dis- 
tributed memory), design principles, existing 
systems, the language C*. RM 

Algorithms, P. Algorithms: Main Ideas and 
Applications. | Vladimir Uspensky, Alexei 
Semenov. Math. & Its Applic., V. 251. 
Kluwer Academic, 1993, xii + 269 pp, $116. 
[ISBN 0-7923-2210-X] Fundamental, theo- 
retical treatment of the general theory of al- 
gorithm, motivated by mathematical and philo- 
sophical foundations, with attention to histori- 
cal development. RM 

Computer Graphics, T(16~17: 1, 2), C, P, 
L. Curves and Surfaces for Computer Aided 


1994] 


TELEGRAPHIC REVIEWS 


Geometric Design: A Practical Guide, Third 
Edition. Gerald Farin. Comp. Sci. & Sci. 
Comp. Academic Pr, 1993, xvii + 473 pp, 
$49.95. [ISBN 0-12-249052-2] Main ideas 
of Computer Aided Geometric Design; a chap- 
ter by Bezier treats origins of Bezier splines. 
Included disk contains C programs, PostScript 
files. (Second Edition, TR, January 1991.) JO 

Computer Science, T(17: 1), P. String- 
Rewriting Systems. Ronald V. Book, Friedrich 
Otto. Texts & Mono. in Comp. Sci. Springer- 
Verlag, 1993, viii + 189 pp, $39. [ISBN 
0-387-97965-4] String-rewriting systems, in- 
troduced by Thue in 1914 for discussion of 
“‘word-type” problems, have been applied to 
theory of formal languages, more general 
rewriting problems, automated deduction, and 
algorithmic problems related to algebraic struc- 
tures. Compiles recent results in the field. RM 


Computer Science, P. Logic Programming 
Languages. Eds: K.R. Apt, J.W. de Bakker, 
J.J.M.M. Rutten. MIT Pr, 1993, xiv + 204 pp, 
$32.50. [ISBN 0-262-01134-4] Research re- 
sults from a project funded by ESPRIT, a Euro- 
pean multinational organization, that aims to in- 
tegrate logic and functional programming. RJA 
Applications (Physical Science), P. Air Pollu- 
tion Modeling and Its Application IX. Eds: Han 
van Dop, George Kallos. NATO Challenges of 
Mod. Soc., V. 17. Plenum Pr, 1992, xii + 803 pp, 
$165. [ISBN 0-306-44248-5] Proceedings of 
the 19th NATO/CCMS International Technical 
Meeting on Air Pollution Modeling and Its Ap- 
plication (Crete, Greece, 1991). 


Applications, P. Biological Neural Networks 
in Invertebrate Neuroethology and Robotics. 
Eds: Randall D. Beer, Roy E. Ritzmann, 
Thomas McKenna. Neural Networks: Found. 
to Applic. Academic Pr, 1993, xi + 417 pp, 
$64.95. [ISBN 0-12-084728-0}] 17 papers 
from a 1991 ONR workshop for invertebrate 
neuroethologists studying sensorimotor con- 
trol and engineers studying autonomous mobile 
robots. Aims to apply biological knowledge 
and examples to engineering problems. Papers 
treat control of leg movement, orientation, com- 
puter modeling, robotics. RM 


Reviewers 


RJA: Richard J. Allen, St. Olaf; KB: Karla Ballman, 
Macalester; LC: Laura Chihara, St. Olaf; BC: Barry 
Cipra, St. Olaf; HD: Hung Dinh, Macalester; SG: Steven 
Galovich, Carleton; RWJ: Roger W. Johnson, Carleton; 
SK: Steve Kennedy, St. Olaf; RM: Richard Molnar, 
Macalester; JO: Jeff Ondich, Carleton; MPR: Matthew 
P. Richey, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; 
PZ: Paul Zom, St. Olaf. 
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Symbolic Computation in 
Undergraduate Mathematics 


Education 


Zaven Karian, Editor 


lf you are considering putting a symbolic comput- 
ing system into your curriculum, this is one publi- 
cation you should have. 

—Mathematics Teacher 


This well-written book should be helpful to anyone 
using symbolic computation as an aid in teaching 
undergraduates—The book provides anumber of 
examples for presenting probability and statistics 
ina way that removes the tedium and emphasizes 
the underlying ideas. 

—AAAS, Science Book and Films 


lf you have any plans to integrate symbolic com- 

puting into your program, read and study this book 

first. Your students will thank you for it. 
—AMATYC Review 


This volume brings together many of the facets 
associated with the pedagogic uses of symbolic 
computation. 


Part | consists of articles that deal with general 
issues of learning mathematics and the role of 
symbolic computation in that process. The articles 
in Part Il describe the use of symbolic computation 
in teaching calculus. Some of the areas covered 
are the use of symbolic computation in a labora- 
tory calculus course, the uses of Derive in the 
instruction of calculus, antidifferentiation and the 
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Symbolic Computation 
in Undergraduate 
Mathematics Education 


Zaven A Karian, Editor 


MAANofes 24 
The Mothematicol Association of Americ 


definite integral, and the experiences and reflec- 
tions of teachers who have used symbolic compu- 
tation in calculus instruction. 


Part Ill consists of papers on sophomore-level 
courses on linear algebra and differential equa- 
tions. The articles in Part lV describe what can be 
done in using symbolic computation in teaching 
combinatorics, probability and statistics courses. 
The articles and references in Part V will help you 
get started in using some of these ideas at your 
own institution. 


200 pp., 1992, Paperbound 
ISBN 0-88385-082-6 

List: $22.00 

Catalog Number NTE-24 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 2003 

1-800-331-1622 Fax (202) 265-2384 


Total $ 
Payment O Check 0 VISA O MASTERCARD 


Credit Card No. 


Signature Exp. Date 


The Concept of Function 
Aspects of Epistemology 


and Pedagogy 


Guershon Harel and Ed Dubinsky, Editors 


The contributors of this volume probe the idea of 
what it means to learn the concept of function and 
how instruction, based on research, could assist 
teachers in finding ways of helping their students 
understand this all-important mathematical con- 
cept. 


The concept of function is one that will appear 
again and again in a student’s mathematics train- 
ing. Arithmetic in the early grades, algebra in 
junior high school, and transformational geometry 
in high school are all largely based on the idea of 
function. Moreover, people involved in calculus 
reform know that understanding the idea of func- 
tion is an indispensable part of the background 
students need to understand calculus. As math- 
ematical education is being renewed and reformed 
throughout the world, this movement requires that 
we learn more about the concept of function both 
from epistemological and pedagogical points of 
view. 


There are several major themes that emerge in 
the pages of this volume. They are theoretical 
perspectives of development of the function con- 
cept, theory-based teaching experiments, con- 
ceptions held by students and teachers, and the 
use of pedagogical software. The volume begins 
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The Concepe 
of Function 


with a Summary and overview of the subject and 
is followed by a brief glossary of terms. 


The development of the papers presented in the 
volume began with a conference held in West 
Lafayette, Indiana in October 1990 with the sup- 
port of Purdue University and the Exxon Founda- 
tion. This volume is, however, much more than 
just a conference proceedings. It is a truly coop- 
erative writing effort by a group of dedicated 
researchers and educators. 


350 pp., 1992, Paperbound 
ISBN 0-88385-081-8 


List: $22.00 


Catalog Number NTE-25 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

(202) 387-5200 Fax (202) 265-2384 


Total $ 
Payment Q Check Q VISA Q MASTERCARD 


Credit Card No. 


Signature Exp. Date 


JOURNEY INTO 
GEOMETRIES 


Marta Sved 
Foreword by H.S.M. Coxeter 


All prospective teachers of secondary mathemat- 
ics should be required to study this book—both 
content and style. —CHOICE 


...an excursion into the wonderland of advanced 
geometry. Experts will frolic, amateurs will grin 
and eventually understand, and beginners will 
smile right up to the point (pun intended) where 
they get lost. A joy to read and even better the 
second time through, it is a fitting sequel to the 
original adventures. 

—Journal of Recreational Mathematics 


This is definitely a book for the school library, and 
for the teacher's shelf too. 
—The Australian Mathematics Teacher 


This charming book introduces us to topics in 
hyperbolic geometry in a delightfully informal style. 
Early in the 19th century, Janos Bolyai created 
“non-Euclidean” geometry, discovered indepen- 
dently by two other mathematicians of Bolyai’s 
day, Gauss, and Lobachevsky. At the time these 
concepts were too revolutionary to make a serious 
impact. However, later developments in relativity 
theory and twentieth century perceptions made 
hyperbolic geometry an integral part of geometry, 
logically as perfect as classical geometry, yet still 
strangely surprising. 
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Bb oucney into 
Geometries 


Journey into Geometries can be read at two 
levels. It can be studied as an informal introduc- 
tion to post-Euclidean geometry, brought to life in 
dialogues between three fictitious figures: asome- 
what grown up Alice, Lewis Carroll and their visitor 
from the Twentieth century, Dr. Whatif. It also can 
serve as background material for university stu- 
dents, for the material presented in the text is 
extended by carefully selected problems. The 
background required is minimal, standard high 
school geometry, yet the serious student, aided by 
problems attached to each chapter, should ac- 
quire a deeper understanding of the subject. 


192 pp., Paperbound, 1991 

ISBN 0-88385-500-3 

List: $22.00 MAA Member: $16.00 
Catalog Number JOG 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-800-331-1622 Fax (202) 265-2384 


Total $ 
Payment O Check O VISA O MASTERCARD 


Credit Card No. 


Signature Exp. Date 


LURE OF THE INTEGERS 


Joe Roberts 


A joy to read and ponder, this book is a welcome 

addition to the body of mathematical literature. It 

belongs in every mathematical library. 
—dJournal of Recreational Mathematics 


Will enrich library collections serving curricula 
with theory of numbers courses. —Choice 


In some small way, this book is an introduction to 
a mythical book which might go under the name 
of The Book of Integers. This mythical book has 
on page n all of the interesting properties of the 
integer n. This introduction stems from many 
years’ casual accumulation of numerical facts. 
Most of the material presented belongs to el- 
ementary mathematics in the sense that no deep 
or profound mathematical background is required 
in order to understand what is said. Much of the 
material is drawn from the theory of numbers. 


Many of the topics touch on contemporary re- 
search and most of the results are stated without 
proof. As a general rule, one cannot tell from the 
statements of the results whether or not their 
proofs will be elementary. Indeed, this is a hall- 
mark of mathematics and is one of the things that 
gives the subject a special flavor and interest. 
Until one knows that expert practitioners have 
been unable to solve a problem, one does not 
know that the problem is difficult. Even then it 
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Some of the material will be familiar to people 
having only a small acquaintance with math- 
ematics. Even in those cases, the author pro- 
vides something new. On the other hand, much 
of the material is sufficiently out of the main 
stream of concern that even professional math- 
ematicians may be unfamiliar with the results. 
The many references to the literature will almost 
always enable a reader to track down further 
information. In Lure of the Integers the author 
has presented a body of material which will prove 
interesting to the enlightened layman as well as 
to the professional. 


300 pp., Paperbound, 1992 
ISBN-0-88385-502-X 

List: $28.50 MAA Member: $19.50 
Catalog Number LURE 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-800-331-1622 Fax (202) 265-2384 


Foreign Orders Please add $3.00 per item ordered 
to cover postage and hiandling fees. The order willbe 
sent via surface mail. If you want your order sent by 
air, we will be happy to send you a proforma invoice 
for your order. 


Total $ 
Payment C) Check | VISA C MASTERCARD 


Credit Card No. 


Signature Exp. Date 


The Search for E.T. Bell 


also known as John Taine 


Constance Reid 


No one today writes about mathematics and mathemati- 
cians uith more grace, knowledge, skill, and clarity, and 
no one is going to produce a more dehghtful, informative, 
accurate account of Eric Temple Bell and his work, and 
that of his alter-ego, the prolific pioneer of science fiction, 
John Taine. This 2s a fine book. | —Martin Gardner 


Enric Temple Bell has been one of my heroes for 60 years...I 
congratulate Constance Reid on a remarkable achieve- 
ment. I hope it is greeted with the success tt deserves, and 
revives interest in an extraordinary and multi-talented 
man. —A. C. Clarke 


Eric Temple Bell (1883-1960) was a distinguished 
mathematician and a best selling popularizer of 
mathematics. His Men of Mathematics, still in print 
after almost sixty years, inspired scores of young 
readers to become mathematicians. Under the 
name “John Taine,” he also published science 
fiction novels (among them The Time Stream, Before 
the Dawn, and The Crystal Horde) that served to 
broaden the subject matter of that genre during its 
early years. 

In The Search for E.T. Bell, Constance Reid has 
given us a compelling account of this complicated, 
difficult man who never divulged to anyone, not 
even to his wife and son, the story of his early life 
and family background. Her book is thus more of 
a mystery than a traditional biography. It begins 
with the discovery of an unexpected inscription in 
an English churchyard and a series of cryptic 
notations in a boy’s schoolbook. Then comes an 
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inadvertent revelation, by Bell himself, in a re- 
spected mathematical journal...You will have to 
read the book to learn the rest. 

Originally agreeing to write only a profile of 
Bell, Mrs. Reid soon found herself involved in a 
full-length biography. The discoveries she made in 
the course of her five years of research will neces- 
sitate a fresh evaluation of his extensive math- 
ematical work and his science fiction novels as well 
as the revision of almost every statement currently 
in print about his family background and early life. 
Mrs. Reid is already well known as the author of 
acclaimed biographies of David Hilbert, Richard 
Courant, and Jerzy Neyman. 


Includes a collection of over 75 photographs. 


384 pp., Hardbound, 1993 

ISBN 0-88385-508-9 

List: $35.00 MAA Member: $28.00 
Catalog Number BELL 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

(202) 387-5200 1-(800)331-1622 
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Game Theory and Strategy 


Philip D. Straffin, Jr. 


This valuable addition to the New Mathematical 
Library series pays careful attention to applica- 
tions of game theory in a wide variety of disci- 
plines. The applications are treated in consider- 
able depth. The book assumes only high school 
algebra, yet gently builds to mathematical think- 
ing of some sophistication. Game Theory and 
Strategy might serve as an introduction to both 
axiomatic mathematical thinking and the funda- 
mental process of mathematical modelling. It 
gives insight into both the nature of pure math- 
ematics, and the way in which mathematics can be 
applied to real problems. 

Since its creation by John von Neumann and 
Oskar Morgenstern in 1944, game theory has 
contributed new insights to business, politics, eco- 
nomics, social psychology, philosophy, and evolu- 
tionary biology. In this book, the fundamental 
ideas of game theory share the stage with applica- 
tions of the theory. How might strategic business 
decisions depend on information about a rival 
company, and how much would such information 
be worth? When is it advantageous to vote for a 
candidate who is not your favorite? What are the 
optimal strategies for teams in the football draft, 
and what paradoxes can result from following 
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those strategies? What is a fair way to share the 
costs of a development project? What can we learn 
about the problem of “free will” by imagining 
playing a game with an omnipotent Being? How 
might natural selection lead to altruistic behavior 
in animal species? Game theory gives insight into 
all of these questions. 

The book includes many exercises, with answers, 
which allow the reader to try out calculations, and 
explore alternative formulations of game-theo- 
retic ideas. 


200 pp., 1993, Paperbound 
ISBN 0-88385-637-9 

List: $27.50 MAA Member: $22.00 
Catalog Number NML-36 
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Proofs Without Words 


Exercises in Visual Thinking 


Roger B. Nelsen 


Just what are “proofs without words?” First of all, 
most mathematicians would agree that they cer- 
tainly are not “proofs” in the formal sense. Indeed, 
the question does not have a simple answer. Proofs 
without words are generally pictures or diagrams 
that help the reader see why a particular math- 
ematical statement may be true, and howone could 
begin to go about proving it. While in some proofs 
without words an equation or two may appear to 
help guide that process, the emphasis is clearly on 
providing visual clues to stimulate mathematical 
thought. Proofs without words bear witness to the 
observation that often in the English language to 
see means to understand, as in “to see the point of an 
argument.” 


Proofs without words have a long history. In this 
collection you will find modern renditions of proofs 
from ancient China, classical Greece, twelfth-cen- 
tury India—even one based on a published proof 
by a former President of the United States! How- 
ever, most of the proofs are more recent creations, 
and many are taken from the pages of MAA jour- 
nals. 


The proofs in this collection are arranged by topic 
into six chapters: Geometry and Algebra; Trigo- 
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nomety, Calculus and Analytic Geometry; Inequali- 
ties; Integer Sums; Sequences and Series; and 
Miscellaneous. Teachers will find that many of the 
proofs in this collection are well suited for class- 
room discussion and for helping students to think 
visually in mathematics. 


The readers of this collection will find enjoyment 
in discovering or rediscovering some elegantvisual 
demonstrations of certain mathematical ideas that 
teachers will want to share with their students. 
Readers may even be encouraged to create new 
“proofs without words.” 


160 pp., Paperbound, 1993 

ISBN 0-88385-700-6 

List: $27.50 MAA Member: $22.00 
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Count on Houghton Mifilin for 
the finest in math texthooks 


% 
Applied Finite Mathematics, Second Edition wew 
Alan Hoenig, John Jay College, City University of New York 


Hoenig offers comprehensive, balanced coverage of the math topics that 
business, economics, and life- and social-science majors need to know. 
Clear, careful exposition of the topics, a wealth of real-world applications, 
and continuous skill practice make this text outstanding. 


740 pages ® hardcover ¢ complete support package ® just published (©1994) 


Calculus and Its Applications 


Brief Calculus and Its Applications 
Daniel D. Benice, Montgomery College 


These texts provide a clear conceptual framework that reveals the “hows” 

and “whys” of calculus, with excellent exercises and realistic applications. 
Students develop their critical-thinking skills through writing exercises and 

an emphasis on problem-solving strategies. 


Both hardcover ¢ complete support packages ® now available for review (©1993) 


Fundamentals of College Mathematics wen! 
Introduction to Algebra 

Intermediate Algebra 

Sandra Clarkson and Barbara Barone 

Both of Hunter College, City University of New York 
All paperback © complete support packages ¢ just published (©1994) 


Prealgebra New! 

Richard N. Aufmann, Palomar College 

Vernon C. Barker, Palomar College 

Joanne Lockwood, Plymouth State College 

672 pages ® paperback © complete support package ® just published (O1994) 


Algebra for College Students: A Functions Approach wet 
Richard N. Aufmann, Palomar College 

Joanne Lockwood, Plymouth State College 

768 pages © hardcover ® complete support package ® just published (©1994) 


Business Mathematics, Second Edition we! 

Richard N. Aufmann, Palomar College 

Vernon C. Barker, Palomar College 

Joanne Lockwood, Plymouth State College 

704 pages ® paperback ¢ complete support package ® just published (©1994) 


Also available in a Brief Edition 
384 pages * paperback ¢ complete support package ® just published (O1994) 


To request an examination copy, contact your Houghton Mifflin sales 
representative or call 1-800-733-1717. In Canada, call 1-800-268-4404. 
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AN OFFICIAL PUBLICATION OF THE MATHEMATICAL ASSOCIATION OF AMERICA 


NOTICE TO AUTHORS 


The Monthly publishes articles, notes, and other fea- 
tures about mathematics and the profession. The 
readership of the Monthly is intended to include ev- 
erybody who is mathematically inclined, including of 
course professional mathematicians and students of 
mathematics at all collegiate levels. While no single 
article or feature is likely to appeal to everyone, mate- 
rial should interest and be accessible to a large num- 
ber of readers. This is the most important criterion for 
acceptance. 


Articles may be expositions of old results or presenta- 
tions of new ones. They may concern all of mathe- 
matics or one small area, a broad development or a 
single application, historical reminiscences or one 
important event. While some articles may contain the 
author’s new research, the novelty of material and 
generality of the results is far less important than the 
clarity of exposition and general interest. Discussing 
one illuminating case of a well known result is far 
better than providing all the details of an obscure but 
new proposition. Articles in the Monthly are sup- 
posed to inform and to entertain; they are meant to 
be read rather than archived. 


Notes are short and possibly informal articles. A note 
may concern a clever new proof of an old theorem, a 
novel way to present tired material, or a lively discus- 
sion of a philosophical (but still mathematical) issue. 
Also, any topic is suitable, so long as it is related to 
mathematics. Because a note is short, the first few 
sentences are the most important part: They should 
explain the purpose and invite the reader in. Pho- 
tographs or diagrams often will attract the reader’s 
attention. 


All articles and notes should be sent to the editor: 


JOHN EWING 

Department of Mathematics 
Indiana University 
Bloomington, IN 47405 


Please send 3 copies, typewritten on only one side of 
the paper. Illustrations should be carefully drawn on 
separate sheets of paper in black ink; the original 
should be without lettering and two copies should 
have appropriate captions and lettering indicated. 


Proposed problems or solutions should be sent to: 


RICHARD BUMBY, 
P.O. Box 10971 
New Brunswick, NJ 08906-0971. 


Please send 2 copies of all material, typewritten if 
possible. 


Letters to the Editor, both for publication and for 
private reading, should be sent to the Editor at the 
address given above. Comments, including criti- 
cisms, are welcome, as are all suggestions for mak- 
ing the Monthly a lively, entertaining, and informative 
journal. 
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COMMENTS 


The only way to get rid of responsibilities is to discharge them. 
—Walter S. Robertson 


Dear Professor X: 

I was disappointed that you refused to write a letter for our candidate’s tenure case. I 
was even more disappointed because you waited to tell me until after the deadline. 

You said you were too busy to write. I Anow you are busy this summer. That’s why I 
wrote to you more than two months in advance to ask whether you would write. I sent you 
the Curriculum Vitae along with detailed instructions about the nature of the letter and the 
date it was due. I promised to send you copies of the candidate’s papers as well. When you 
agreed (after a telephone call), I sent those papers promptly. That’s proper etiquette for 
such requests—ample time and ample information. 

That was my responsibility. Now here is yours: When you agree to write, you should do 
so. Your letter ought to contain enough information to make it clear you have read and 
judged at least a small portion of the candidate’s work. It should contain a short opening 
paragraph that sets the stage for your specific comments, and it should end with one or two 
summary sentences that help to guide the reader (and give people preparing the case some 
quotable sentences). Such letters do not have to be long, nor do they have to contain 
exquisite detail. They should contain honest opinion, based either on your knowledge of the 
candidate’s written research or your personal contact. And the “letter” needs to be a signed 
letter...not an email message or a paragraph dictated to my secretary. 

Are there too many requests for such letters? Absolutely—and I’m doing my best to 
persuade deans and college committees that fewer letters will serve the pupose just as well. 
But some letters will always be necessary in a profession with a system of tenure and 
promotion like ours. 

Writing letters has been a part of our profession for most of this century. Quite likely, a 
half dozen or so mathematicians took time to write such letters for you in the past. Quite 
likely, they were busy people as well. 

Our profession has been infected by the me-generation philosophy of the 1980’s. Writing 
letters of recommendation, refereeing papers, reviewing grant proposals—for many these 
are activities without personal profit, and hence (according to the philosophy) without 
reward. You are not alone; increasing numbers of mathematicians view professional service 
as the sort of activity other (less busy) people ought to do. Requests for recommendations 
and referee reports often draw no response—or worse, no results. When most people 
subscribe to this philosophy, our profession will face a crisis. 

I will make phone calls and send email to solicit another letter. It will be hard to find 
someone else to write, especially since I will have to ask that person to write in a matter of 
weeks or even days. I will find someone, however. And you might find it surprising that the 
person who finally does write the letter turns out to be...just as busy as you. 

—John Ewing 
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Yueh-Gin Gung and Dr. Charles Y. Hu 
Award for Distinguished Service 
to J. Sutherland Frame 


David W. Ballew 


“Sud” Frame’s primary interest has always been in his students, their professional 
growth and their eventual success. He has been instrumental in the growth of Pi 
Mu Epsilon, personally installing more than fifty Chapters, and in creating and 
developing in 1952 the highly successful Pi Mu Epsilon Summer Student Paper 
Conferences in conjunction with the American Mathematical Society and the 
Mathematical Association of America. He is widely respected as “Dr. Pi Mu 
Epsilon.” Professor Frame faithfully attends the student presentations, raising 
questions and making personal comments to most of the young members on how to 
improve their papers and on the future directions their research might follow. 

These innovative efforts spanning more than half a century have given the 
professional mathematics community a new understanding and appreciation of the 
research capabilities of undergraduate and Master’s level students. After Dr. 
Frame established the first National Pi Mu Epsilon Student Paper Conference at 
Michigan State University in the early 50’s, there came a recognition that more 
should be done to encourage our best undergraduates to seek careers in research 
and teaching. In the 1970’s and 1980’s many MAA sections followed his lead and 
initiated student paper sessions, several in joint sponsorship with Pi Mu Epsilon. 
The success of student presentations caught attention at the national level and led 
to the establishment of MAA Student Chapters and additional student paper 
sessions at national meetings. 

James Sutherland Frame earned his Doctorate (1933), Master’s (1930), and 
Bachelor’s (1929) at Harvard; his main research interest is the Theory of Repre- 
sentation of Finite Groups, a field where he has published over forty of his one 
hundred and seven papers. He has taught at Harvard, Brown, Allegheny College, 
and Michigan State University and has appointments at the Institute for Advanced 
Study and as a Consultant for Graduate Mathematics Programs in Thailand. He 
has served at the local, regional, national and international levels holding positions 
on the Board of Governors of the MAA and as Chair or member of many, many 
scholarly and civic organizations ranging from the Presidency of the Michigan 
Academy of Arts and Letters to membership on The East Lansing Board of 
Education and The National Council of the AAUP. He was President of Pi Mu 
Epsilon for nine years (1957-66) and served as secretary during the organization’s 
period of most rapid growth (1951-54). He was instrumental in the founding of the 
Pi Mu Epsilon Journal in 1949 and the first Student Paper Conference in 1952. His 
many Honors include Phi Beta Kappa, Who’s Who in America and the Senior 
Research Award of Sigma Xi at Michigan State. 


1994] AWARD TO J. SUTHERLAND FRAME 107 


Almost every summer, ‘Sud’ attends as many of the student paper presenta- 
tions as possible. He immediately grasps the student’s message and, afterwards, 
speaks to each student in a non-intimidating way to help the student gain deeper 
insight to improve his or her results. The students are amazed that this grand old 
gentleman, at 86, can see to the core of the problem so quickly and then see 
further and clearer than they, who worked so long and so hard. As one student 
told me after the 1992 Summer Conference, “That man really loves students, 
doesn’t he?” At the bottom, that says it all. J. Sutherland Frame simply loves 
students and has spent his life proving that. May he do so for another 86 years! 


Department of Computer Science 
Western Illinois University 
Macomb, IL 61455 


Counting 


God made the integers, all else is man’s 
invention, said Kronecker, but I 

prefer to follow Genesis 

where God made one and just one 
more. Dissatisfied with 

only one and two, 

Adam and Eve 

began the 

chain to 

more. 


One 

added 

forever, 

Joined by zero, 

matched by opposites, 

constructs the integers, 

inspires invention of more 

new numbers from old—ratios, 

radical roots and transcendentals, 
transfinite cardinals. Man’s mind is bold. 


—JoAnne §. Growney 

Department of Mathematics and 

Computer Science 

Bloomsburg University, Bloomsburg, PA 17815 
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A New Look at Euler’s Theorem for 
Polyhedra 


Branko Grunbaum and G. C. Shephard 


1. INTRODUCTION. Euler’s Theorem for Polyhedra is one of the most beautiful 
results of elementary geometry. If v, e and f are, respectively, the number of 
vertices, edges and faces of a polyhedron P, then the relation 

v-et+f=2 (1) 
is true for cubes, pyramids, prisms, octahedra, and many other polyhedra. One 
might be tempted to think (as Euler himself apparently did) that this equality holds 
for all polyhedra, but it is easily seen that it fails for the picture frame of FIGURE 
1(a). Here v = 16, e = 32 and f = 16so v —e + f = 0. The discrepancy is usually 
dealt with by saying that (1) holds only for polyhedra without any “holes”, and 
then rewriting it in the form 


v-e+f=2-2g (2) 


(a) 


Figure 1. (a) A polyhedron to which Euler’s theorem in its elementary form uv — e + f = 2 does not 
apply. (b) A polyhedron of genus g = 1 to which Euler’s theorem in the form v — e + f = 2 — 2g does 
not apply. 


for a polyhedron of “genus” g (that is, “with g holes passing through it’’, or “with 
g handles”). Note that here by “polyhedron” we mean the 2-dimensional manifold 
P which is the boundary of the “solid polyhedron”. The quantity on the right-hand 
side of (2) is usually called the Euler characteristic of that manifold and denoted 
by x(P), so that (2) can be restated as 
v-et+f=y(P). (3) 
Equations (1), (2) and (3) relate the numbers of vertices, edges and faces of the 
polyhedron to the topological properties of the polyhedron itself. As a picture 
frame has just one hole (that is, g = 1), relation (2) holds for the polyhedron of 
Figure 1(a). However, this simple solution is not applicable in all cases. The 
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polyhedron shown in FicurE 1(b) also has g = 1, but v = 16, e = 24 and f = 10, 
so v — e + f = 2; hence relations (2) and (3) are no longer true. Also, how should 
one deal with a polyhedron like the funnel-shaped one shown in FicurRE 2 (the 
boundary of a cube with a pyramid attached to its base and with a pyramidal cavity 
drilled into it until the apex of the cavity just meets the apex of the attached 
pyramid)? Does this have a “hole” or not? In this case no integer g fits equation 
(2)! How should the right side of (1) be modified to deal with this situation, or with 
the polyhedra of Figure 3? What appeared at first sight to be a simple numerical 
identity is now seen to be hedged with additional conditions or exceptional cases. 


Figure 2. A funnel-shaped polyhedron for which the elementary forms of Euler’s theorem are not valid. 


The present paper gives the unexpectedly simple answers to these and related 
questions. We begin by defining a large class of sets called “polyhedral sets” which 
generalize familiar polyhedra; they may be closed, open, or neither, connected or 
not, bounded or not, and their parts may have different dimensions. Examples are 
shown in Figures 1, 2, 3 and later diagrams. For each such polyhedral set P we 
define an integer y(P) called the Euler characteristic of P, and show how this is 
related to the geometric features of P. While this approach to the Euler character- 
istic is not new (see references given in Section 6), it has the advantage of allowing 
very easy determination of the Euler characteristic even for complicated and 
unusual polyhedral sets. However, although—in the unmatchable words of Richard 
Guy “this is well known to those who well know it“‘—the circle of those who know 
it seems to be very small. 

To obtain an analogue for polyhedral sets of relation (3) we shall define subsets 
of P called k-scaffolds (for k = 0,1, 2,3) and use these to calculate integers V, E, 
F and C which, in simple cases, correspond to numbers of vertices, edges, faces 
and cells of P. We then show that 


V-E+F-C=yY(P) (4) 
for every polyhedral set P. Clearly, relation (4) is a true generalization of (1), (2) 
and (3) to polyhedral sets. 
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1 
(12, 20, 11, 2) (16, 25, 11, 1) (14, 24, 12, 1) (8, 16, 10, 1) 


(20, 33, 14, 1) (8, 16, 9, 0) 


(20, 32, 12, 0) (16, 24, 10, 1) 


4 
pul 
1 


(18, 31, 13, 1) (16, 24, 10, 1) (16, 26, 12, 1) (10, 16, 8, 1) 


Figure 3. Examples of somewhat unusual polyhedra, after Hajos [8]; we interpret all these solids as 
closed sets. The single integer in the first line of the caption to each part is the Euler characteristic 
x(P) of the solid P; it can be determined by the methods of Section 3. The second line of each caption 
lists (V, E, F,C); the meaning of these numbers will be explained in Section 4. The Euler characteristic 
of the boundary of each polyhedron P is C + y(P). 


The present approach is simple and elementary; for clarity we shall restrict 
attention to sets in three-dimensional Euclidean space E>, though generalizations 
to higher dimensions present no difficulties. As defined below, the Euler charac- 
teristic is fully additive, and may be positive, negative or zero. Sums (rather than 
alternating sums), are used in its definition. Notice also that we regard the faces of 
polyhedral sets as relatively open, in contrast to the traditional approach in which 
they are considered to be closed sets. 

The paper is organized as follows. In Section 2, after some preliminary defini- 
tions which may already be familiar to the reader, and are included here in order 
to avoid any ambiguities, we define polyhedral sets and their dissections. In 
Section 3 we define the Euler characteristic and establish its fundamental proper- 
ties. In Section 4 we define the k-scaffolds, establish the Euler relation in the form 
(4), and present the definition of j-faces. Extensions of these results to unbounded 
polyhedral sets are presented in Section 5. Section 6 includes a short synopsis of 
the historical development of the Euler characteristic and related concepts, as well 
as references to the literature. 
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2. DEFINITIONS AND TERMINOLOGY. As usual, we shall use N(x, 5) for the 
5-neighborhood of a point x in E*. A set is bounded if it is contained in some 
neighborhood of a point. The interior of a set S is denoted by int S, and the 
boundary of S by bd S. A set S is closed if it contains bd S and is open if it 
contains no point of bd S, that is, if S = int S. 

By a flat, or affine subspace, we mean any translate of a linear subspace; if the 
dimension k of a flat is specified, we shall say that it is a k-flat. For any S, we 
denote by aff S the affine hull of S, that is, the smallest flat that contains S. A set 
S is said to be k-dimensional if aff S is k-dimensional. Hence a single point is 
0-dimensional, a line segment is 1-dimensional, etc. The dimension of a flat L is 
denoted by dim L. 

The above definitions of interior and boundary apply to sets of any dimension, 
but for our purposes, relative properties are more important. If E is a k-flat and if 
N(x, 5) is a neighborhood of a point x € E, then the intersection N(x, 5) 9 E is 
called a k-neighborhood of x. Thus a 2-neighborhood of a point x is a small open 
circular disk of radius 6 centered at x, and a 1-neighborhood of x is an open line 
segment (that is, a segment without its endpoints) of length 26 centered at x. The 
0-neighborhood of a point x is just the point x itself. 

The relative interior of S is the set of those points x € S for which there exists a 
k-neighborhood of x, that is contained in S, but for no point y is any (k + 1)- 
neighborhood of y contained entirely in S; clearly, there is a unique value of k 
with this property. Sometimes the relative interior of S will be called its k-interior. 
The 3-interior is the same as the interior for any set in E>. If the k-interior of a set 


Qonrteree Ape reene sO aaa ° 
oe om 
' ro fe) a ‘, 
My . e 
A 3 ar 
‘ v 
fe) fe) 
Ov cv seree serene esee ee 6 @------ 2 eee eee eee eee 2 
0 1 4 —2 


1 
(e) (1,2, —1) (f) (6, 3, 0) (g) (3, 4,2) 


Figure 4. Examples of bounded polyhedral sets P. For simplicity of description and graphical represen- 
tation, these sets are all in the plane, and all polygonal regions and segments are taken as relatively 
open. The polygonal regions are shaded. Solid lines and solid dots indicate segments and points 
included in the set, dashed lines and hollow dots indicate segments and points that are not included. 
The data in the caption to each part are as in Figure 3, except that C is not listed since it is 0 in all 
cases. The Euler characteristic of P M relbd P is x(P) — F. 
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S is nonempty, we define the relative boundary of S as the set of all points of bd S$ 
which do not belong to the relative interior of S. The relative interior and relative 
boundary of S are denoted by relint S and relbd S, respectively. The set S is said 
to be relatively open if S = relint S. 

A single point, an open ray or segment, a straight line, a plane, or the whole 
space E> are examples of relatively open sets. If Q is a square region in E> (so 
aff Q is a plane, and Q is 2-dimensional) then relbd Q consists of the union of four 
Open segments (each of which is the relative interior of one of the sides of the 
square) together with the four vertices of the square. Further, relint O is the part 
of aff Q that lies inside relbd Q. If S is the boundary of a cube then S contains no 
points whose 3-neighborhoods lie in S, hence int S is the empty set. The points 
whose 2-neighborhoods lie in S are the points in the faces of the cube, that is, the 
points of S apart from the edges and vertices. This set is therefore the 2-interior of 
S. Moreover, since int S is empty, the 2-interior of S is, by definition, the relative 
interior of S. Thus relint S is the union of the relative interiors of the six square 


<4—_—___._ p> 
<4—_—______— 
(a) 0 (b) —1 
(3, 4, 1) (0, 2, 1) 


(d) 3 
(3, 7, 2) (5, 4, 2) 


(e) 0 (f) —2 
3,4, D (2,5, 1) 


Figure 5. Examples of unbounded polyhedral sets it the plane. The conventions are the same as in 
Figure 4; in addition, arrowheads are used to distinguish halflines and lines from segments. (a) shows a 
line-free closed convex set, and the set in (b) is closed and convex, but not line-free (L is a line and 
k = 1 in the notation of Section 5). 
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(a) (b) 


Figure 6. An illustration of the dependence of the relative interior of a set on the mutual position of its 
parts. In (a), the set T = S UQ, where S is the boundary of a cube and Q is a closed square not 
coplanar with any face of the cube. Hence relint T = relint S U relint Q. In (b), for the set U = S$ UQ, 
in which Q is coplanar with the top face of the cube, meeting it along the edge E, we have relint 
U = relint S U relint QO U relint E. The Euler characteristic is 2 in both cases, as can be verified using 
the definition given in Section 3. In the notation introduced in Section 4, scaf,T consists of seven open 
squares, scaf, T consists of 15 open segments, and scafyT of 10 vertices; hence V = 10, E = 15, and 
F = 7. On the other hand, scaf, U consists of five open squares and one open rectangle, scaf, U of 12 
open segments, and scaf, U of eight points. 


faces of the cube. If T consists of the set S just considered together with a square 
Q which is not coplanar with any face of the cube (see FicurE 6(a)) attached to S 
along a common edge E, then relint T = relint S U relint Q; but for a set U (see 
FiGuRE 6(b)) in which Q is coplanar with a face of S we have relint U = relint S U 
relint Q U relint E. A more complicated example is shown in FiGurRE 7 and 
explained in the caption. 

A set S is called convex if, given any two distinct points x, y € S, the closed 
line segment with endpoints x and y lies entirely in S. Thus a line segment is 
necessarily convex; a single point and the empty set are also convex, since the 
definition is vacuous in this case. A closed half-space is the (unbounded convex) set 
of points that lie to one side of, or on, a plane in E°. Any set which is the 
intersection of a finite number of closed half-spaces is called a closed convex 
polyhedron. Familiar examples of closed convex polyhedra are (closed) cubes, 
(closed) squares, (closed) line segments, and single points. These are of three, two, 
one and zero dimensions respectively. But our definition includes also unbounded 
convex polyhedra, such as the examples in Figures 5(a) and 5(b); their edges can 
be rays (halflines) or straight lines, and faces and cells can be unbounded as well. 

Euler’s Theorem in its basic form (1) holds for the boundary of 3-dimensional 
closed and bounded convex polyhedra. Many elementary proofs of this fact are 
known, see Section 6. The case of unbounded polyhedra will be considered in 
Section 5. 

A relatively open convex polyhedron is the relative interior of a closed convex 
polyhedron. It should be noted that whereas an open convex polyhedron (which is 
necessarily 3-dimensional) is the intersection of a finite number of open half-spaces, 
the same is not true for relatively open convex polyhedra of dimension less than 
three. On the other hand, if P is a relatively open convex polyhedron lying in a 
d-flat E = aff P (where d = 1 or 2) then P can be written as the intersection of 
finitely many open half d-flats (each of which is a relatively open set) lying in E. 
Alternatively, P is the intersection of E with a family of open halfspaces. As we 
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Figure 7. A polyhedral set P (assumed closed, to simplify the discussion) consisting of two solid cubes 
joined by two triangles and a segment, and of two squares attached to segments which touch the cubes; 
clearly int P = scaf,P consists of the two open cubes, so C = 2. The polyhedral set Q = bd P consists 
of the 14 squares shown, together with the two triangles and three segments. Its relative interior 
relint Q = scaf,Q = scaf,P consists of the relative interiors of the squares and the triangles, together 
with the open segment JJ along which the upper triangle is joined to the cube; hence F = 15. The set 
R = relbd Q consists of the following open segments: all the edges of the two cubes except HJ, the four 
edges of one of the squares and three of the other, and the segments GY, H/J, IK, JK, LM, LN, NT, WX; 
moreover, R also contains all the vertices of the cubes and squares, and the point W (but not the points 
G and L). It follows that relint R = scaf, R = scaf, Q = scaf,P consists of the following open 
segments: AB, GY, HI,WZ, twenty edges of the two cubes, seven edges of the two squares, and four 
edges of the two triangles; hence E = 35. Finally, relbd R = scaf)R = scafyQ = scaf,P consists of all 
the vertices of the two cubes except N,7, X, as well as of the four vertices of one of the squares and 
two of the other, and the points J and W; hence V = 21. It follows that y(R) = —1, y(Q) = 1, and 
x(P) = —1. These values can easily be verified by directly using the definition of the Euler characteris- 
tic. 


shall see, relatively open convex polyhedra play a central role in our treatment of 
the Euler characteristic. 

From the definition it is clear that the intersection of any two closed convex 
polyhedra is a closed convex polyhedron. Analogously, though this requires some 
additional reasoning, it may be verified that regardless of their dimensions the 
intersection of any two relatively open convex polyhedra is a relatively open convex 
polyhedron. Following convention, we shall sometimes refer to a bounded convex 
polyhedron of two dimensions as a polygon, and a bounded convex polyhedron of 
one dimension as a line segment or simply segment; in each case we must, of 
course, specify whether the polyhedron is closed, relatively open, or neither. 

If a set P is the union of members a finite family @ = {C,,C,,...,C,} of 
pairwise disjoint sets C,,C,,...,C,, we say that & is a dissection of P and we 
write P = WU @, where the dot in the union symbol indicates that the sets C; are 
pairwise disjoint. If P is a set that admits a dissection @ all elements of which are 
relatively open convex polyhedra we shall say that P is a polyhedral set, and 
express this by writing 


P= U.¢ (5) 
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where the subscript c is to remind us that each element of @ is a relatively open 
convex polyhedron. Such a family @ is called a relatively open convex dissection of 
P and each element of @ is called an element of the relatively open convex 
dissection. All the sets shown in FiGures 1 to 9 are polyhedral sets in this sense. It 
will be observed that the definition is very general in that it does not require P to 
be open or closed, connected, simply connected or even homogeneous in the sense 
that neighborhoods of all points of P are of the same dimension. Nor are there 
any restrictions on the incidences of the closures of the elements. On the other 
hand, a bounded and closed polyhedral set is necessarily compact. The collection 
of all polyhedral sets will be denoted by P. These polyhedral sets are the sets to 
which relation (4) will apply. 

It is clear that except when P consists of a finite number of points, its 
expression (5) as a disjoint union of relatively open convex polyhedra is not unique. 
In fact, every relatively open convex polyhedron of dimension greater than 0 
admits infinitely many open dissections. 

Let P= U.@ and P= WU. be two dissections of P; we shall say that the 
latter is a refinement of the former if each element of @ is a disjoint union of 
elements of .. If we are given any two dissections P= U.&@ and P= U, ZF of P 
then it is possible to find another dissection P = WU. ¥ which is a refinement of 
each; any such ¥ will be called a common refinement of the dissections @ and J. 
In fact it suffices to take, as elements of .Y, all non-empty sets of the form C NM D, 


(a) 


(b) 


Figure 8. (a) A polyhedral set P consisting of an open cube S and a relatively open square Q. This 
dissection of P is not complex-like. (b) A relatively open convex dissection of P which is complex-like. 
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where C € @, and D € @. From the properties of relatively open convex sets it is 
immediate that if the dissections @ and JG are relatively open convex dissections, 
then the common refinement ¥ is another dissection of the same kind. Moreover, 
among such common refinements of the relatively open convex dissections @ and 
QJ of a polyhedral set P it is possible to find a common refinement ¥ which is 
complex-like. By this we mean that whenever an element of .Y meets the relative 
boundary of another element of FY, it is contained in that boundary, see FicureE 8. 
These simple observations are basic for all that follows. 


3. THE EULER CHARACTERISTIC. Throughout this section we shall restrict 
attention to bounded polyhedral sets; the extension of these results to unbounded 
sets will be given in Section 5. 

For any polyhedral set P € P we define an integer y(P) in the following way: 


(a) y(@) = 0. 

(b) If P is a relatively open convex polyhedron of dimension d, then y(P) = 
(~1)%. 

(c) If P= U.@ is a relatively open convex dissection of P then y(P) = 
ce eX (C). 


This integer y(P) is called the Euler characteristic of P. Part (c) of the 
definition appears to imply that the definition of y(P) depends on the dissection @ 
that is used. However, this dependence is only apparent. This, and other important 
properties of y(P) are given by the following theorems: 


Theorem 1. The Euler characteristic x(P) is well-defined in the sense that its value, 
as given by (c) above is independent of the relatively open convex dissection ¢ of P 
used in the computation. 


Theorem 2. /f P is a closed and bounded convex polyhedron then x(P) = 1. 


Theorem 3. The Euler characteristic x(P) satisfies the valuation property: if P, <= P 
and P, = P, then 


X(Pi) + xCP2) = XC PLN Po) + xCP1 UY Po). (6) 


It should be noted that Theorem 2 refers to the Euler characteristic of the 
(bounded and closed) convex polyhedron itself, and not—as in the discussion in 
Section 1—to the Euler characteristic of its boundary. 

In some treatments of the Euler characteristic Theorems 2 and 3 are used to 
define y(P) for closed convex sets and for those sets which can be obtained from 
them by taking finite unions. Thus the definition of y(P) given here may be 
regarded as an extension of the traditional approach to sets which need not be 
closed. On the other hand, our definition is restricted to polyhedral sets, hence not 
applicable to non-polyhedral sets even if they are convex. 

The proofs of Theorems 1, 2 and 3 are omitted since they follow the usual 
techniques. We note only that for Theorem 1 we rely on the fact mentioned earlier 
that two convex dissections @ and J of a polyhedral set P have a complex-like 
common refinement 7. 
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Immediate consequences of these theorems include the following: 


Corollary 1. If ¢ is any dissection of the polyhedral set P such that each C € @ isa 
polyhedral set, then 


x(P) = Li x(C). 


CEeg 


Corollary 2. (Inclusion-exclusion property). If @ is any family of polyhedral sets 
such that P = U @, then 


x(P) = Lx(C) — Lex(C, 9 C2) + Vx(C, NC, N C3) -— ++ 


where the first sum is over all C © @, the second sum is over all sets C,,C, € ¢ of 
two distinct elements of @, the third sum is over all sets of three distinct elements of 
G, etc. 


Corollary 3. For any polyhedral set P € P, 
x(P nN relbd P) = yx(P) — x(telint P); 


hence, if P is closed, x(relbd P) = y(P) — y(relint P). 
Using the above results and suitable dissections, it is easy to verify that each of 
the polyhedral sets in FiGurEs 3, 4 and 5 has the Euler characteristic indicated. 


4. EULER’S THEOREM FOR BOUNDED POLYHEDRAL SETS. We now show 
how the Euler characteristic, as introduced in Section 3, can be used to derive 
analogues of the traditional Euler equations (1) and (2) which are valid for 
bounded but otherwise general polyhedral sets. The basic approach is to express 
each such polyhedral set P € P as the disjoint union of four well-determined sets, 
called k-scaffolds of P and denoted scaf,P for k = 0, 1, 2,3. 

The 3-scaffold scaf;P is simply int P, the set of all interior points of P. We 
delete scaf,P from P, leaving P, = P \ scaf,P; clearly, P, C bd P. The 2-scaffold 
scaf,P is the set of all 2-interior points of the set P,. (Equivalently, scaf,P is the 
relative interior of the set P M bd P.) Similarly, scaf,P is the set of all 1-interior 
points of P, = P, \ scaf,P, and scaf,P is the finite set of points P) = P, \ scaf,P. 

Each k-scaffold of P is a relatively open set; in fact, it is a relatively open 
polyhedral set, and S = {scaf,P, scaf, P, scaf, P, scaf,P} is a relatively open dissec- 
tion of P into polyhedral sets. To verify this claim, we only have to show that each 
scaf, P is a polyhedral set. This follows at once from the observations: 

(i) the definition of scaf,P is independent on the relatively open convex 
dissection @ which establishes that P is a polyhedral set, and 

(ii) each relatively open convex dissection of P can be refined to one that is 
complex-like. For such a refinement it is clear that each element is contained in 
one and only one scaffold of P, and so defines a relatively open convex dissection 
of each scaf, P. 

As a consequence we have: 


Corollary 4. If p is a bounded polyhedral set then 
x(P) = v(scaf,>P) + v(scaf,P) + y(scaf,P) + x(scaf,P). 


We note that if P is a convex polyhedron, then scaf,P is the union of the 
relative interiors of the faces of P, scaf,P is the union of the relative interiors of 
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the edges of P, and scaf,P is the set of vertices of P. These remarks motivate the 
following notation: V = y(scaf)P), E = — y(scaf,P), F = y(scaf,P), and C = 
—x(scaf,;P). Note that we use upper case symbols in order to stress the distinction 
between the precisely defined entities and the somewhat vague quantities men- 
tioned in the Introduction and Section 1. Corollary 4 implies: 


Theorem 4. Jf P is a polyhedral set then 
V-E+F-C=y(P). 


Thus relation (4) has been established. 

Theorem 4 will now be illustrated by means of examples. 

Consider first the closed polyhedral set P of Figure a); it is to be understood 
as a solid, the interior of which is scaf,;P. The 2-scaffold scaf,P consists of 12 


(d) 


Figure 9. (a) A (closed) polyhedral set; if interpreted as a solid P, it has V = 24, E = 36, F = 10, 
C= -1, and y(P) = —1. If P is interpreted as a 2-manifold then y(P) = —2. (b) One of the 
polygons B which occurs as the upper or lower face of the polyhedral set P shown in (a), and (c) a 
dissection of B. Interpreting B as a relatively open set, this shows that y(B) = —1. (d) A dissection of 
int P, which shows that y(int P) = 1. 
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relatively open rectangular faces (each of which has Euler characteristic equal to 
1) and two other relatively open polygons, namely the top and bottom of P, each 
of which is an open rectangle from which two closed squares have been removed. 
(FiGuRE 9(b)). The Euler characteristic of each of these polygons (which we shall 
denote by B) can be easily determined using the open dissection shown in FIGURE 
9(c). This shows B expressed as the union of seven relatively open convex polygons 
and eight open line segments, so y(B) = 7 X 1+ 8 X (—1) = —1. We deduce 
that F = 12 x 1+ 2 x (—1) = 10. As scaf,P consists of 36 open segments, each 
with Euler characteristic —1, it follows that E = 36. The 24 vertices form scaf,P, 
hence V = 24. To determine C, consider the open convex dissection of int P = 
scaf,P indicated in Ficure 9(d); it consists of seven open 3-dimensional convex 
polyhedra, and of 8 relatively open convex polygons, hence C = —y(scaf;P) = 
—(7 x (-1) + 8 x 1) = ~—1. Therefore 


y(P) =V-E+F-C=24-36+10-(-1) = -1. 


The value of y(P) can be verified by using the inclusion-exclusion principle 
(Corollary 2). We represent P as the union of seven closed convex sets, all of 
which are prisms (six on quadrangular bases, and one on an octagonal base—Fic- 
URE 9(d) can also be interpreted as showing this representation). The intersections 
by pairs are eight closed polygons (rectangles), and every set of three has empty 
intersection. Hence y(P) = 7 — 8 = ~1, as before. 

It should be noted that the flexibility built in into the approach followed here 
enables one in many cases to avoid subdivisions when calculating y(P). For the 
example in FiGurE 9(a) we could argue that P, together with two open rectangular 
boxes and four relatively open squares, forms a dissection of a closed rectangular 
box. Therefore 1 = y(P) + 2(-—1) + 4, leading to the same value y(P) = —1. 

The numbers calculated above can also serve to illustrate Corollary 3, since 


xv(relbd P) = 24 — 36+ 10 = —2. 
which coincides with 
x(P) —- x(relint P) = -1-1= -2, 
since y(relint P) = —C = 1. 

As a second example, consider the closed polyhedral set P in FiGure 2. Using a 
plane through two opposite vertical edges of the cube, partition P into two parts, 
each of which is a closed polyhedron and has Euler characteristic equal to 1. The 
intersection of these parts is the union of two closed triangles with a vertex in 
common, and again this has Euler characteristic 1. Hence, by the inclusion-exclu- 


sion principle, y(P) = 2 — 1 = 1. Considering scaffolds, we can verify V = 9, 
E = 20, F = 12, and so, from Corollary 4, 


C= —y(scaf,P) = —y(int P) =1-12+20+9=0. 


This can be verified using a dissection of int P analogous to that just used for P. 
Each half of int P has Euler characteristic —1, but their intersection now has 
characteristic 2, since it consists of two disjoint, relatively open triangles. Thus 
y(int P) = 2 x (—1) + 2 = 0, as before. For another verification we may consider 
the dissection consisting of P, an open 4-sided pyramid, and an open square, from 
which we have 1 = y(P) + (—1) + 1, hence y(P) = 1. 

Additional examples illustrating our theorems and corollaries appear in FIGURE 
3. The quantities V, E, F,C are indicated in the form of a vector (V, E, F,C) 
below each part of the diagram. Each of the polyhedral sets is considered to be 
solid, that is, to have non-empty interior, and to be equal to the closure of its 


120 A NEW LOOK AT EULER’S THEOREM FOR POLYHEDRA [February 


interior. Other examples are shown in FiGures 4, 6, 7, 9 and described in the 
captions. 

Finally, we note that the use of scaffolds leads to a definition of j-faces 
(j = 0,1, 2,3) which is applicable to all polyhedral sets. We define a j-face of a 
polyhedral set P to be a connected component of scaf,P. For each j, these 
components are well determined. In the case of convex polyhedra—and many 
others as well—these j-faces are the relative interiors of faces and edges in the 
traditional approach to the subject. However, in more complicated situations 
rather unusual sets can appear as faces. For example, in FIGURE 4, one 1-face of 
the first polyhedral set in (a) consists of the union of three open segments, namely 
the upper segment and the two contiguous sides of the deleted square. In FIGURE 
4(g), one 1-face is the union of two open segments. In FiGurE 7, one 2-face 
consists of the open segment JJ and the two open squares and one open triangle 
that contain JJ in their boundary; another 2-face consists of the open square that 
contains the point L, and the open triangle with vertex L, while one 1-face 
consists of the open segment AB together with the five additional open segments 
that have N or T as an endpoint. It remains to be seen whether this generalization 
of the usual concept of “face” will lead to interesting mathematics. 


5. UNBOUNDED POLYHEDRAL SETS. Now we shall extend the results of the 
previous sections to unbounded polyhedral sets. As we pointed out in Section 2, 
our definition of a closed polyhedron applies equally to the unbounded case. We 
may also allow the elements in the definition of a polyhedral set (5) to be 
unbounded, and thus extend the family P to include unbounded sets. From now 
on, P will be used in this wider sense. We shall assume that the reader has some 
familiarity with the structure of unbounded convex sets as explained, for example, 
in Griinbaum [1967], Section 2.5 and the references in Section 2.7. 

For any non-empty convex polyhedron P in the d-dimensional space E%, let L 
represent a k-flat, contained in P, and chosen to have maximal possible dimension 
k. Then P is called line-free if and only if k = 0. A bounded convex polyhedron is 
necessarily line-free, as is also a set such as that shown in FiGure 45(a). For the 
convex polyhedron in FIGuRE 5(b), k takes the value 1. It is known (see Griinbaum 
[1967], page 24) that if P is a closed, convex polyhedron then it may be written as a 
“direct product” or “vector sum” 


P=LopP, (7) 


where L is the maximal k-flat defined above and P is a line-free polyhedron 
whose dimension is k less than the dimension of P. The sign © means that every 
point x € P can be written uniquely in the form x = y + z (vector addition) where 
y €L and z €P. We may take for P any set of the form PL, where L is a 
(d — k)-flat orthogonal to L in E%. We shall refer to (7) as the standard linear 
decomposition of P. In the example of Figure 5(b), L is a line and P is a segment. 

For the boundary of a 3-dimensional unbounded line-free convex set P a 
relation analogous to (1) holds, namely 


vu—et+f=1, (8) 


where vu, e, f are, respectively, the numbers of vertices, edges and faces of P (see 
Griinbaum [1967], Section 8.5). If P is 2-dimensional, the corresponding result for 
the relative boundary is 


vu—-e=-—l. (9) 
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An example is provided by the line-free set in FiGuRE 5(a), where v = 3 and 
e = 4; two of the edges are line segments, and the other two are rays (unbounded 
half-lines). 

Just as the definition of the family P in Section 2 was formulated in such a way 
as to apply also to unbounded sets, so were Theorems 1 and 3, together with the 
three corollaries in Section 3 phrased in such a way as to apply also in the 
unbounded case. The result analogous to Theorem 2 is as follows: 


Theorem 2*. Let P be a closed convex polyhedron, and P = L ® P be a standard 
linear decomposition of P with dim L = k, then 


(P) = 0 if P is unbounded 
« (-1)* if P is bounded. 


Proof of Theorem 2*, The case where P is bounded has been dealt with in Section 
3, Theorem 2. Now let P be unbounded but line-free. If P is a closed half-line 
(ray), then y(P) = 1 — 1 = 0. If P is 2-dimensional then, by (9), 


V(P) =v-e+1=0, 
and if P is 3-dimensional then, by (8), 
V(P) =v-e+f-1=0. 


If P is not line-free, then to each j-dimensional element of P there corre- 
sponds a (j — k)-dimensional element of P (note that all elements of P have 
dimensions > k). Hence y(P) = (—1)*y(P) and equals (— 1)* or 0 depending on 
whether P is bounded or not. This completes the proof of the theorem. 

The definitions of the scaffolds of a polyhedral set P, together with Theorem 4 
and Corollary 4, hold with trivial modifications in the unbounded case. Examples 
of the application of Theorem 4 to unbounded sets appear in Ficure 5. Details 
are given in the caption to the figure. 


6. HISTORICAL REMARKS AND COMMENTS. The history of Euler’s Theorem 
and concepts related to it are both interesting and voluminous. It involves many of 
the ideas that led to modern algebraic topology, and also many of the errors which 
were committed in that development. At least two books have been devoted to the 
early history and attempts at clarification of Euler’s Theorem (Lakatos [13], 
Federico [4]), and countless books and articles contain short accounts. Here only a 
very brief survey will be given. 

Euler first published his theorem in 1750, stating that he had no satisfactory 
proof but was convinced of its general validity by a wealth of examples. (The 
frequently encountered assertion that Euler’s Theorem was known to Descartes a 
century before Euler is unsupported by any evidence, and based on erroneous 
interpretation of some of Descartes’ writings. For a discussion of this and other 
historical errors concerning Euler’s Theorem see Malkevitch [17] and the refer- 
ences given there, and Federico [4].) Euler’s formulation was, essentially, that “for 
every solid bounded by flat surfaces, the number of surfaces increased by the 
number of vertices exceeds by two the number of edges.” Later, Euler presented a 
proof of the theorem, as did several other mathematicians. Early in the nineteenth 
century it was observed that the assertion cannot be true in the generality claimed 
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(L’Huilier [16], Hessel [9]). In a masterpiece of understatement, Hessel remarks: 


Other excellent mathematicians (Legendre, Cauchy, Gergonne, Rothe and Steiner) supplied 
proofs for the general validity of the theorem. But in fact, it suffers from exceptions. 


In order to illustrate such exceptions and the difficulties in removing them, 
Hessel shows pairs of polyhedra such that “‘for each shape for which Euler’s rule is 
not valid, a rather similar one can be found for which the rule is valid.” Hessel’s 
examples are reproduced in FiGuRE 10; we shall return to them shortly. 

These developments led to a number of reformulations of the basic version of 
Euler’s Theorem (given by relation (1)) through the introduction of various 
parameters that replaced the value 2 of the right-hand side. All the discussions 
were dogged by two difficulties. 

On the one hand, no precise definitions were given for the polyhedra under 
consideration or for their faces, edges and vertices. It was more or less generally 
assumed that one is dealing with solids and considering features on their surfaces 
—but how to determine faces (or edges) was illustrated by examples rather than 
defined by unambiguous rules. A glance at the collection of examples of polyhedra 
in FIGURE 3, taken from Hajés [8], should convince the reader that the concepts of 
face, edge and vertex are not very straightforward to define even for those special 
polyhedra which are compact and such that the boundary of the polyhedron 
coincides with the boundary of its interior. This observation explains why, in the 
later decades of the nineteenth century, Euler-type relations with more and more 
complicated right-hand sides were appearing in the literature. 

On the other hand, in its original formulation and in the minds of many early 
workers, it was the hallmark of Euler’s theorem that it involved only the numbers 
of vertices, edges and faces—without any consideration of the nature of the faces. 
Thus, in Hessel’s presentation, his even-numbered examples are “good” because 
for them v — e + f = 2, and the odd-numbered polyhedra are “bad” because the 
numbers to be inserted in the left-hand side do not yield 2. Remarkably, some of 
this attitude survived even to our days: in Seydel [30, page 322] the same example 
as in Hessel’s diagram labelled 2 (and our FicureE 1(b)) is given, with the comment 
that this illustrates the validity of relation (1) for all polyhedra! We shall return to 
the faces of polyhedra later in this discussion. 

The second half of the nineteenth century saw the gradual clarification of the 
difficulties; one step was the insight that relations such as (1) deal with the surface, 
and not directly with the solid. The concept of genus of those special surfaces 
which are called orientable manifolds helped to reach the relation (2). In particu- 
lar, this led to the understanding that the original formulation of Euler’s theorem 
applies to the boundary of 3-dimensional convex polyhedra and, more generally, to 
maps on the sphere and other closed surfaces. 

At the same time, Euler’s theorem was extended to higher dimensions as one of 
the first results in the emerging discipline of algebraic topology. It was established 
(or, at least, stated!) that the Euler characteristic equals to the alternating sum of 
so-called ‘Betti numbers”. But again, there were more good intentions than 
mathematically proved results. In the words of Dieudonné [3]: 


... the mathematicians of the second half of the nineteenth century which were busy with 
these questions [of algebraic topology] speak freely of curves, of surfaces, of deformations, ..., 
without ever saying what they mean by these words. 
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Figure 10. Plate taken from Hessel [9]. These diagrams illustrate the difficulties that early authors had 
in extending the simple Euler relation (1) to more complicated polyhedra. Parts 2 to 5 show a cube with 
a hole drilled through it, and parts 6 and 7 show a cube with an indentation. Part 10 shows a block with 
two prismatic holes through it. In parts 3, 6 and 10 some faces have been subdivided. 
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Starting with Poincaré [26] and completed, simplified and extended by others (in 
particular, Alexander [1]) these concepts involved in the definition of Euler 
characteristic acquired precise meaning for simplicial complexes and for more 
general objects; see, for example, Hilton-Wylie [11], Singer-Thorpe [32], or any 
other modern text on algebraic topology. However, this development led to a loss 
of the connection to the origins of Euler’s theorem as a relation involving the 
vertices, edges and faces of a polyhedral object. Apart from these developments, in 
the comparatively simple case of convex polytopes of arbitrary dimensions, the 
analogue of Euler’s theorem was discovered by Schlafli in 1850 (but not published 
till 1901), and independently discovered and published in the 1880’s by several 
mathematicians. But all these proofs—including the one in Sommerville [33]—were 
grossly incomplete; as noted in Griinbaum [5, p. 141], to validate these proofs one 
would have to show that the faces of the polytopes in question can be arranged in 
a special order known as shelling. Although it was later shown that every convex 
polytope can be shelled (Bruggesser-Mani [2]), the absence of this concept in the 
proofs shows that they were invalid in an essential aspect. (For elementary proofs 
of the Euler theorem for convex polytopes of all dimensions, which avoid the use 
of shelling, see Griinbaum [5, p. 134], McMullen-Shephard [19, p. 94].) 

Returning to the situation dealing with polyhedra, a new direction opened up 
with the work of Hadwiger [6]. Hadwiger observed that the Euler characteristic can 
be defined consistently by assigning to each compact convex set C the value 
y(C) = 1, and proceeding to extend this to the “Konvexring” (family of sets each 
of which is a union of finitely many compact convex sets) by using the valuation 
property. Later, Klee [12] simplified and generalized this approach by putting it in 
a lattice setting, and observing that it also worked for unions of open convex sets 
(of a fixed dimension) if one starts by assigning to all open convex sets the Euler 
characteristic 1. Without mentioning Hadwiger or Klee, Shashkin [31] gave a 
detailed elementary exposition of Hadwiger’s approach to the Euler characteristic, 
restricted to closed and bounded polyhedral sets in the plane. However, Shashkin’s 
assertion (on page 82) that certain types of such sets admit a unique decomposition 
into “components” of a particularly ‘‘simple” type is incorrect. 

In a later paper, Hadwiger [7] considered polyhedral sets, defined as those 
admitting relatively open convex dissections, but—due to his insistence on defining 
y(C) = 1 for every relatively open convex polyhedron—failed to obtain the gen- 
eral version of Theorem 3. 

The approach followed here, to assign to a relatively open convex set of 
dimension d the Euler characteristic (— 1)%, is due to Lenz [15]. An account of the 
relevant writings by Lenz, Groemer and others appears in McMullen-Schneider 
[18]. Related developments, and in particular the relations between the Euler 
characteristics and valuations on appropriate families of sets are presented in 
Schneider [28], with extensive references to earlier literature. It should be stressed 
that, in all these works, the definition of the Euler characteristic by a relation such 
as (c) in Section 3 does not lead to any result of the nature of Theorem 4. 
Although involving the relatively open convex elements of dissections, these 
formulae apply equally to all such dissections of the given set (just as the 
topological approach applies to all simplicial complexes that represent a given set); 
as a consequence, they do not reflect in any way the particular facial structure of a 
polyhedral set. In fact, it seems that objects like the k-scaffolds have not been 
considered in the literature at all. 

Neither McMullen and Schneider [18] nor Schneider [28] mention the work of 
Nef [20-25], which our note parallels to some extent. Nef’s definition of polyhedral 


1994] A NEW LOOK AT EULER’S THEOREM FOR POLYHEDRA 125 


sets differs from ours, but is equivalent to it; his approach to the Euler characteris- 
tic is the same as ours. However, we believe our definitions lead to simpler proofs, 
and there is a significant difference between Nef’s treatment of the faces of 
polyhedral sets and that given here. In view of its importance in the history of the 
subject, it seems appropriate to devote a few lines to a discussion of this topic. 

In the case of convex polyhedra it is generally accepted that the “faces” are the 
intersections of the polyhedra with supporting planes. The only differences be- 
tween the treatments of various authors is in deciding whether to regard the 
intersections themselves, or their relative interiors, as “faces”, and whether the 
convex set itself and/or the empty set should also be included. Whatever choices 
are made, the faces of a convex polyhedron form a well-determined family; there is 
a bijection between the family of relatively open faces, and the family of closed 
faces. However, when more general polyhedra are considered there seems to be no 
agreement at all! In fact, most writers seem content to avoid the topic all together. 

One of the few writers devoting some attention to this question is Hajés [8]. His 
“nolyhedra” are closed, bounded, polyhedral sets which coincide with the closure 
of their interiors. To find the “faces” of such a polyhedron P, he proceeds as 
follows: If L is a plane that meets bd P in a set with non-empty relative interior, 
then the closure of any connected component of the relative interior of L N bd P 
is a face of P. A vertex of P is any point that belongs to some three different faces 
such that their planes do not pass through one line. If the intersection S of two 
non-coplanar faces of P contains a segment, then S contains two or more vertices 
of P; segments determined by these vertices, contained in S and not containing 
any vertices in their relative interior, are called edges of P. Hajés appears not to 
realize that these definitions lead to such oddities as an edge that is in the relative 
interior of every face that contains it, or a vertex that is a relatively interior point 
of every face that contains it (see Figure 11(a)). Rather unsurprisingly, Hajdés 
reports no results concerning these definitions, and it 1s reasonable to expect that 
there is little hope for establishing any connection between the Euler characteristic 
of P and the vertices, edges and faces defined in this way. The calculation in the 
caption to FiGurE 11 bears out this interpretation. 

Nef [1978] defines as faces of a polyhedral set P any family of disjoint, relatively 
open convex sets that is a dissection of P; hence a relation analogous to Theorem 
4 holds if V,£,F,C denote the numbers of “faces” of dimension 0, 1, 2, 3, 
respectively. However, these ‘‘faces” are, in general, not uniquely determined, and 
have only a limited geometric significance due to the following fact. There exist 
closed polyhedral sets homeomorphic to a closed ball, for which any convex 
dissection must use as vertices points that cannot be regarded as vertices of P in 
any reasonable sense. Such polyhedral sets were first described by Lennes [14]; the 
simplest of these, shown in FiGurE 11(b), is due to Sch6nhardt [29]. For this set P 
our definitions yield six vertices, twelve edges and eight faces. In any of the 
dissections of P used by Nef there are at least seven vertices (and correspondingly 
larger numbers of edges and faces), hence at least one of them is devoid of 
geometric meaning. The same polyhedron P ts also a counterexample to Lemma 2 
of Szab6 [34], which asserts that each polyhedron (according to a definition that 
includes P) has a simplicial decomposition in which all vertices of the tetrahedra 
involved are also vertices of the polyhedron. For interesting results concerning 
polyhedra that lack simplicial decompositions free of additional vertices see 
Ruppert and Seidel [27]. 

The results concerning polyhedral sets presented here can be extended to more 
general sets. For example, one could admit as “basic constituents”, besides 
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(a) (b) 


Figure 11. (a) A closed polyhedral set P, which is the union of four cubes. According to the definitions 
of Hajoés [8, p. 37], this P has 23 vertices, 42 edges and 15 faces, hence the Euler characteristic of its 
boundary would be 23 — 42 + 15 = —4. According to the definitions adopted here, C = 4, V = 16 
(four vertices such as A, and 12 like B), E = 30 (twelve edges like AB, and six edges in the shape of 
two crossing open segments such as those containing the point C, each of Euler characteristic —3), and 
F = 19 (twelve faces are small squares, and one face, of Euler characteristic 7, consists of the three 
large squares that contain the point D), hence y(bd P) = 5, and y(P) = 1—in agreement with the 
topological interpretation of the Euler characteristic. (b) A nonconvex octahedron P, which has 
the property that in every relatively open convex dissection of P, some vertices of the polyhedra in the 
dissection are not vertices of P. 


relatively open convex polyhedra, also spheres, open balls, closed balls, and the 
(unbounded) complements of closed balls. Extending property (b) by assigning to 
these sets the Euler characteristic 2, —1, 1, and —2, respectively, and to circles 
and open circular disks the values 0 and 1, and considering sets obtainable as finite 
unions of intersections of “‘basic constituents’, results analogous to Theorems 1 to 
4 can be obtained. Various other generalizations are also possible; their investiga- 
tion is left to the reader. 

We hope that the present account will lead to a better understanding of Euler’s 
Theorem, and possibly also to analogous results for other valuations on polyhedral 
sets. 
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Otto Neugebauer: 
Reminiscences and Appreciation 


Philip J. Davis 


I first met Otto Neugebauer in the fall of 1963 when I joined the faculty of Brown 
University. For many years prior to his return to the Institute for Advanced 
Studies in the mid ’80s, I ate lunch with him fairly regularly in the Brown cafeteria 
known as the Ivy Room. We were not infrequently joined by other members of the 
History of Mathematics Department or their visitors. The conversation was relaxed 
but lively, often very general, and was terminated promptly when the last person 
had finished lunch. Neugebauer was not one to twiddle his spoon leisurely in a 
second cup of coffee. 

He had that soft outward appearance and low-decibel manner that I associate 
with Austrians. Inwardly, he held firm opinions and prejudices, and would occa- 
sionally burst out in anger and irritation. Not unlike Mark Twain, he perceived the 
human world as consisting largely of fools, knaves, and dupes; and when he was 
overwhelmed by this perception, he took refuge in his love of animals which was 
tender and deep. 

He had his roster of The Greats in his profession, and though his ratings were 
not as finely tuned as those of G. H. Hardy, anyone who ate lunch with him would 
find out after a week which of the great names were really great and which were 
asses. As regards the past, he thought that Copernicus was overrated—he called 
him Koppernickel. Kepler was much better, and he loved Arthur Koestler’s 
popularization of Kepler in Zhe Sleepwalkers. Ptolemy was a great hero. As regards 
contemporaries, he expressed his views candidly. 

He could not abide philosophy; he thought it a great waste of time and rarely 
discussed it. I would guess, though, that his largely unspoken philosophy of science 
was that of the logical positivism promulgated by the Vienna Circle in the 1920s. 
He parodied Hegel. He could not abide religious ritual or theological dogmas, and 
though he was a considerable student of these matters, he parodied them all 
mercilessly. As a relaxation, he read the Lives of the Ethiopic Saints, and made a 
dossier of the individual Devils those fellows had to contend with. 

He was given to playful irony and loved Anatole France’s ironic fantasies. I 
recall his being greatly amused by this: he said he found in some ancient Near 
Eastern medical text that crocodile droppings were prescribed for such and such a 
medical condition. The patient applied it without success and went back to the 
practitioner with a complaint. The practitioner asked “‘What was the sex of 
the crocodile?” “A male, I think.” “Then try with a female.” To this story, 
Neugebauer commented with a twinkle in his eye, “You see, the ancients knew all 
about hormones!” 
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Otto Neugebauer in his office at Brown. Photograph by John Forasté /Brown University 


The work that made him world famous was, of course, the reconstruction of, the 
understanding of, the interpretation of ancient scientific texts. The text was the 
thing. Correspondingly, I would guess that his philosophy of history, if he ever 
expressed it openly, would have been that of Ranke: ‘die Vergangenheit wie es 
eigentlich gewesen” (the reconstruction of the past as it really was). 

I suspect that his philosophy of mathematics (rarely made explicit) was that 
mathematics exists and has a reality independently of the people who created it. 
After all, for a number or years he was associated with Richard Courant and the 
Goettingen group. I could see at the very beginning of our friendship that his 
Platonic idealism was far removed from my philosophy of mathematics, and in the 
interests of that friendship, I stayed away from the topic at lunch time. 

In line with a pervasive skepticism, and in spite of his appreciation of Koestler’s 
reconstruction, he was suspicious of and had little patience with attempts to link 
the history of mathematics with general history. He relegated such attempts pretty 
much to the world of the unprovable, at best, or to the world of fiction, at worst. 
The text, the symbol, and the recoverable platonic meaning thereof; these were his 
touchstones and his standards. 

And yet, he knew that history was more than text, that it could not be reduced 
to a few symbols. He once remarked “If you never heard the sound of Nazi boots 
below you in the street, you cannot understand the history of the period.” 

I’m sure he would have agreed with Macbeth, that human history “is a tale told 
by an idiot, full of sound and fury, signifying nothing.” The very separation he was 
able to make between the human world and the world of platonic ideas was a 
source of strength, a basis for his faith and his ability to pursue his work with gusto 
to an advanced age. 
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Human life and character display the particular and the universal. But as no 
man is an island, what seems most striking in an individual is often characteristic of 
a generation. Otto Neugebauer was an extraordinary individual; he also served as 
the emblem of an age of scholarship. 
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Correction to the Temple / Tracy article “From Newton to Einstein,” 


June / July 1992. 
Richard Kavinoky, Mathematics Dept., U.C. Davis 


The authors’ main result in Te /Tr is that the perihelion’ shift A@ in one revolution 
of Mercury about the Sun can be estimated by 27re, where e is a small dimensionless 
constant determined by the Sun-Mercury system.* They use an asymptotic expansion 
(in €) of the function for the reciprocal of the radius of the orbit, together with a 
general theorem (given on p. 521 of Te/Tr) to get A@ = 27re + terms of order e7. 
However, Godfrey L. Issacs pointed out that the theorem does not in fact apply. 

In the spirit of Te/Tr we would like to formulate a new theorem that does apply. 
The function under consideration in Te /Tr is of the form G(@) = F(@ — €0) + €f(9), 
where 0<e <1, F and f are 27-periodic, i.e. G, is a slight perturbation of a 
periodic function. (The perihelion occurs initially at @ = 0 and subsequently when 
G0) = G(0).) The Te/Tr theorem does not apply to their specific G, because 
F'(0) =0. Thus for such G, (i.e. with F’(0) = 0) we want to know: when does G,(@) 
first return to its initial value G,(0), and what is the error in making a linear (in €) 
estimate at this point? 

Answer: If we add the conditions that F, f € C4, f(0) = f’(0) = 0 and F’(0) # 0, 
we may conclude that A@ = 27re + terms of order e?/*, where 6 = 22 + A@ is the 
first point at which G.(@) = GO). More precisely, |A@ — 27re| = Ce?/? + o(e?/*), 


where C = 2ry|(f"(0)/F"(0))|. 


This is proved using Taylor’s Theorem and a “bootstrap” type argument, i.e. we 
first get a crude estimate of the error, which enables us to then get a more precise 
estimate, etc., sort of pulling ourselves up by the bootstraps, as it were. 

The specific Te/Tr function G, does satisfy these conditions and thus they are still 
justified in using 27re as an estimate for the perihelion shift. 

A particularly remarkable point is that although there may be more than one small 
A@ satisfying G27 + A@) = G(0), they all will satisfy A@ = 27re + o(€?/*), 

e = 3G/c7a(1 — d*)) = 7.8 X 107°, where G is the gravitational constant for the 
sun, a is the semi-major axis and d is the eccentricity of Mercury’s orbit, and c is the 
speed of light. 


[1] Issacs, Godfrey L., Unpublished letter to Editor, American Math. Monthly, 
June 20, 1992. 

[2] “A Quantitative Analysis of the Precession in the Perhelion of the Orbit of 
Mercury (Addendum to the Temple/Tracy article “From Newton to Einstein,” 
June/July 1992),” Richard Kavinoky, U.C. Davis Preprint. 
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From the Buffon Needle Problem to the 
Kreiss Matrix Theorem 


Elias Wegert and Lloyd N. Trefethen 


In this paper we present a theorem concerning the arc length on the Riemann 
sphere of the image of the unit circle under a rational function. But our larger 
purpose is to tell a story. We thought at first that the story began in 1962 with the 
Kreiss matrix theorem, the application that originally motivated us. However, our 
arc length question turns out to be more interesting than that. The story goes back 
to the famous “‘Buffon needle problem” of 1777. 


SPLJKER’S LEMMA IN THE COMPLEX PLANE. Let r be a rational function of 
order n, that is, a quotient of two polynomials of degree at most n. Let S denote 
the unit circle {z € C: |z| = 1}, and let || - |l,, [| - |l2, and ||: ||. denote the 1-, 2-, 
and o-norms on 5S, 


fll, = [I f(z) |lazl Ife = [\f(2)Mlael, fle = sup| f(z)|. 


zEs 


Then the arc length of the curve r(S) in the complex plane, which we denote by 
L-(r(S)), can be represented compactly by the formula 


Le(r(S)) = Ur’. 


If r is multiplied by a constant a, L,(r(S)) changes by the factor |a|. However, 
this scale-dependence can be eliminated by considering the ratio 


L¢(7(S)) /llrlle.. (1) 


In 1984, building on earlier work by Laptev, Strang, and Tadmor, LeVeque and 
Trefethen [9] observed that a bound on (1) could be used to derive a sharp form of 
the Kreiss matrix theorem (which we shall discuss at the end). They therefore 
posed the question, what is the maximum possible value of (1)? 

It is easy to see that the value 27rn can be attained; just take r(z) to be z” or 
z". If r is restricted to be a polynomial, it follows from Bernstein’s inequality that 
27n is the maximum possible. It is also easy to see that 27rn is the maximum value 
for rational functions in the special case n = 1 (the reader can supply the proof!). 
Based on these facts and on computer experiments, it was conjectured in [9] that 
27n is the maximum value (1) for all rational functions r and all n. However, only 
the bound 477” was proved, and the task of eliminating this gap of a factor of 2 
was presented as an Advanced Problem in this Monthly [10]. 

Just one response to the Monthly problem was received, from James C. Smith, 
of the University of South Alabama, who improved the bound to 2(2 + 7)n [16]. 
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Five years later, Marc Spijker of the University of Leiden finally settled the 
conjecture in the affirmative [17]: 


Theorem 1. L.(7(S))/|Irllo < 2arn. (“Spijker’s lemma”) 


SPILJKER’S LEMMA ON THE RIEMANN SPHERE. The simplicity of Theorem 1 
is marred by the need for the normalization by ||r||.. In looking for a cleaner 
formulation one may ask, what is the analogous result for the Riemann sphere? 
Let S denote the Riemann sphere {x € R*: |x| = 1}, with the north and south 
poles corresponding to the points © and 0 in C, respectively, according to the usual 
stereographic projection, and the equator corresponding to the unit circle S. This 
identification of C and S is discussed in many books on complex analysis [1], and it 
is readily shown that a unit of arc length |dz| at a position z € C is expanded by 
the factor 2/(1 + |z|*) in being projected onto S. It follows that if r(S) is 
considered as a closed curve on S, with L.(r(S)) denoting its arc length on S, then 
we have 


Le(r(S)) =|[27'7(0 + Ir?) (2) 


Now, the trivial scale-dependence has been eliminated from the problem. It makes 
sense simply to ask, what is the maximum possible value of L.(r(S))? 
The new result of this paper is the following answer to this question: 


Theorem 2. L.(r(S)) < 27rn. (“Spijker’s lemma on the Riemann sphere”’) 


The proof of Theorem 2 will emerge in the following pages. For the moment, 
we note first of all that like Theorem 1, Theorem 2 is obviously sharp, with equality 
attained for any r that maps S with winding number v7 onto a great circle of S. 
(For example, r(z) = z” maps S with winding number n onto the equator, and 
r(z) = i"(z — 1)"/(z + 1)” maps S with winding number n onto the Greenwich 
meridian.) A more important observation is that for any r with ||r||. < 1, we have 
L.(r(S)) < L<(r(S)). This follows from (2), since 2/(1 + |r|*) => 1 when |r| < 1. 
Consequently, Theorem 2 implies Theorem 1 as a corollary. Thus Spijker’s lemma 
on the Riemann sphere is both simpler and stronger than Spijker’s lemma in the 
complex plane, and perhaps it should be considered the more fundamental result. 


FROM THE NEEDLE PROBLEM TO POINCARE’S FORMULA. The reader has 
undoubtedly encountered the Buffon needle problem, published by the Comte de 
Buffon in 1777. Suppose a needle of length 1 is thrown at random on a plane ruled 
by parallel lines at a distance 1 apart. What is the probability that the needle will 
land in a position that crosses a line? Easy calculus shows that the answer is 


2 
Probability of intersection = —. 
7 


Buffon, incidentally, was the leading French naturalist of the eighteenth century 
and also a translator of Newton. He worked on his “probléme de Il’aiguille’”’ long 
before publishing it as an appendix on “moral arithmetic” in his 44-volume treatise 
on natural history [3]. 

The needle problem became well known, especially among the French, and was 
generalized. Laplace, without referencing Buffon, solved the analogous problem 
for a square grid (Théorie Analytique des Probabilités, 1812). A more important 
generalization was to consider the slightly modified question: if the needle has 
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length L, possibly greater than 1, what is the expected number of intersections? 
The answer is easily seen to be 


2L 
Expected number of intersections = —-. (3) 
T 


And from here it is a small step mathematically, but a big one conceptually, to 
note that the same formula (3) is valid also for a paper clip. Various steps in this 
direction were taken by Cauchy, Lamé, and Barbier, among others [2]. In fact, if 
any rectifiable curve I’ of arc length L is thrown at random on the parallel grid, 
the expected number of intersections is (3). (A curve is rectifiable if its real and 
imaginary parts are functions of bounded variation [1].) The idea behind this result 
is that [' can be thought of as a concatenation of infinitesimal straight segments, 
each satisfying (3) for an appropriate infinitesimal value of L. Now it may seem at 
first that the expected number of intersections for I should be more complicated 
than the sum of the expected numbers for the segments I’ is composed of, since 
after all, the segments do not fall on the grid independently. However, indepen- 
dence is not relevant unless one cares about the efficiency of (3) as a method for 
approximating 7r. It is a basic fact of statistics that the expectation of a sum of 
random variables is equal to the sum of the expectations, regardless of whether or 
not they are independent. This observation seems elementary to us now, but its 
application to the needle problem was evidently not obvious in the nineteenth 
century. 

Taking the paper clip to be a circle of radius 5 gives an easy way to remember 
Buffon’s result and its generalization (3). For this choice of [, L is m7 and the 
number of intersections is exactly 2, no matter how the paper clip falls. 

We now want to move from the plane to the sphere, a step taken as early as 
1860 by Barbier [2]. Consider a “spherical paper clip”—that is, a curve [ 
embeddable in the Riemann sphere. Suppose I is oriented at random on S. What 
is the expected number of intersections with the equator? The answer is again 
essentially a matter of combining calculus with elementary statistics: 


L 
Expected number of intersections on the sphere = —. (4) 
7 


Or one can skip the calculus and remember this result by thinking of the case in 
which I is itself a great circle. In this case L = 27 and the number of intersec- 
tions is again exactly 2 unless I’ happens to land exactly on the equator, an event 
of probability zero. 

A final development completes this brief history. After Barbier, other mathe- 
maticians generalized these results further, including Poincaré, who referenced 
neither Buffon nor Barbier (Calcul des Probabilités, 1896 [12]). By this time it was 
clear that although the needle problem and its generalizations had conventionally 
been formulated as problems of probability, that interpretation could be dispensed 
with. Instead of orienting [ at random on S and asking for the expected number 
of intersections with a fixed equator, one can consider I to be fixed on S and 
compute its arc length L.(T) as an integral of the number of intersections with all 
great circles. To be precise, for any rectifiable curve [CS and any x = 
(x1, X, x3) € S, let (T, x) denote the number of points of intersection of T° with 
the great circle on S consisting of points equidistant from the antipodes +x. 
(When this number is infinite, the definition of v(T, x) does not matter, for the set 
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of such points has measure zero.) One obtains the following elegant result: 
Lemma 1. L.(T) = i) VT, x) dx. (“Poincaré’s formula’’) 
S 


The integral is taken with respect to area measure on S. 

Lemma 1 can be expressed in words as follows. To find the arc length of a curve 
on the Riemann sphere, integrate its numbers of intersections over all great 
circles, then divide by 4. Or, equivalently, since the sphere has surface area 477, 
take the average number of intersections and multiply by 7r. This latter paraphrase 
of Lemma 1 makes plain its equivalence to (4). 

Poincaré’s formula has far-reaching generalizations described in the book by 
Santal6 [15], which the reader may consult for a wealth of related ideas as well as 
for the rigor lacking in the discussion above. It forms a centerpiece of the field 
known earlier as “geometric probability” but now as “integral geometry.” 


PROOF OF SPIJKER’S LEMMA. Is it obvious now how to prove Theorem 2? All 
we need is the following lemma, whose proof we shall spell out though it might 
equally well have been left as an exercise. As above, v(r(S), x) denotes the 
number of intersection points of the curve r(S) with the great circle on the 
Riemann sphere S defined by the points + x. 


Lemma 2. [f r is a rational function of order n, then v(r(S), x) < 2n forallx ES 
with the possible exception of a single pair x = +X 9,X 9 € S. 


Proof: Since any point of S can be rotated to any other by a Mobius transforma- 
tion, leaving the set of rational functions of order n invariant, we are free to 
choose a particular value of x for convenience. Let us take x to be the north pole, 
so that the great circle in question is the equator, i.e., the image of S on S. If 
r(z) = p(z)/q(z) for polynomials p and q of degree <n, then for z € S, 


p(z){_ p(z)p(Z) _ p(z)p*(z) 
a(z)|  a(z)a(Z)— a(z)a*(z)’ 


where p*(z) := z"p(z~!) and g*(z) = z"@(z~"). The condition |r(z)|* = 1 is thus 
a polynomial equation in z of degree at most 2n. Therefore r(S) intersects the 
equator in at most 2n points, counted with multiplicity, unless it lies along the 
equator exactly. In the latter case it is obviously only the north and south poles for 
which the intersection number is infinite. a 


In(z) | = 


Since the surface area of S is 47 and since } - 2n - 4a = 277n, Theorem 2 is an 
immediate consequence of Lemmas 1 and 2. 

Spijker’s original proof of Theorem 1, though derived independently, can be 
interpreted as a planar version of the same argument just given to establish 
Theorem 2. In particular, equation (6) of [17] is a kind of Poincaré formula for the 
complex plane, expressed in terms of lengths of one-dimensional projections 
instead of numbers of intersections. Apparently this formula was first worked out 
by Cauchy in 1832 and published by him in 1841 [5]. 
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THE KREISS MATRIX THEOREM. What does all this have to do with the Kreiss 
matrix theorem? Let A be an n Xn matrix, and let || - || denote the matrix norm 
induced by the vector norm || - ||2. The Kreiss matrix theorem, originally published 
in 1962 [8], concerns the problem of characterizing matrices and families of 
matrices that are power-bounded. Let us define 


p(A) = supllA'l, r(.A) = sup (|zl — 1)||(27—.4) 7]. 


[z|>1 


The current, sharp form of the theorem reads as follows [17]: 
Theorem 3. r(A) < p(A) < enr(A). (“Kreiss matrix theorem” 


In words, a matrix A is power-bounded ( p(A) < ©) if and only if the norm of its 
resolvent (zI — A)~! increases at most inverse-linearly as z approaches the unit 
circle from the outside (r(A) < o). Moreover, the gap between p(A) and r(A) is 
a factor of at most e (= 2.718...) times n, so the same conclusion applies to 
families of matrices {A ,} of fixed dimension that satisfy uniform bounds on p(A,) 
and r(A,,). 

The first inequality of Theorem 3 asserts that if ||A*||< C for k > 0, then 
(zl — A)" || < C/(z| — 1) for |z| > 1. This is easy to prove by making use of the 
power series (zJ — A)~! =z~ +2774 + 27342 +... . The more interesting in- 
equality is the second one, which asserts that if ||(zJ — A)~'|| < C/(z| — 1) for 
|z| > 1, then ||A*|| < enC for k => 0. According to Tadmor’s remarks in [18] for the 
earlier developments, the history of successive improvements toward this constant 
en involves no fewer than nine steps, though the earlier authors in the list were 
certainly not concerned with optimizing the constant: 


Kreiss ‘62: ~ [r(A)|" 
Morton ‘64: ~ 6%(n + 4)" 
Miller & Strang ‘66: ~n" 
Miller ‘67: ~ en 
Laptev ‘75 /Strang ‘78: 32en* /ar 
Tadmor ‘81: 32en /1 
LeVeque & Trefethen ‘84: 2en 
Smith ‘85: el1+—In 
T 
Spijker ‘91: en 


History, thank goodness, stops here. It is shown in [9] that the constant en is best 
possible. 

As the estimates have become sharper, the proofs have become mercifully 
simpler and have ceased to depend upon the explicit manipulation of eigenvalues 
and normal forms of matrices. We reproduce now the argument from [9] that 
shows how the constant en follows from Spijker’s lemma. 


Proof of the second inequality of Theorem 3. According to the calculus of resolvents 
described for example in [7], the matrix A* can be written as the Cauchy integral 


1 
A‘ = 5a | eM - A) az, 
TL!G 


where G is any curve enclosing the eigenvalues of A, which must lie in {z € C: 
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|z| < 1} if r(A) < ». Let uw and v be arbitrary n-vectors with |lu||. = |lv|l2 = 1. 
Then 


1 

v*A*u = —— | z*q(z) dz, 
277i J. q(2) 

where v* denotes the conjugate transpose of v and q(z) is the function v*(z/J — 

A)~‘u, which can be shown to be a rational function of order n. Integration by 

parts gives 


* Ak —_— 
yee rake Je” 


k+ 19'(z) dz. 
Let the contour of integration be taken as G = {z € C: |z}=1+(k +174. On 
this contour we have |z**!| < e and hence 


lv*A*ul < ———— f |a’'(z) | |dzI. 
2a7(k + 1) G 


This integral can be interpreted as the arc length of q over the circle G. By a 
trivial change of variables it might as well be an arc length over the unit circle S. 
Theorem 1 therefore implies 


e 
*AkKy| < —————_-(2 
|v*A*ul < In(k + 1) ( mn) sup |a(z)I, 
and therefore, since the supremum of |g(z)| on G is at most (k + 1)r(A), by the 
definition of r(A), we have 


|u*A*ul| < enr(A). 


Finally, we note that since ||_A*|| is the supremum of |v*A*ul over all vectors u and 
v with |||, = |lv|l2 = 1, this last inequality proves the theorem. = 


The Kreiss matrix theorem has been a fixture of numerical analysis since its 
appearance in 1962 and dissemination in the well-known book by Richtmyer and 
Morton [14]. It is one of the fundamental results available for establishing numeri- 
cal stability of discrete processes. 


CONCLUSION. From Buffon to Spijker to Kreiss, the pieces of our story fit 
together so neatly that it may seem there can be nothing more to say. Nevertheless, 
matters related to the Kreiss matrix theorem are subjects of active interest today, 
and in conclusion, we would like to mention a recent generalization of Theorem 3 
and an open question. 

The generalization concerns the problem of numerical stability of the “method 
of lines.” When time-dependent partial differential equations are solved numeri- 
cally by discretization, it is common to simplify the process by constructing the 
space discretization and the time discretization independently. For example, the 
Crank-Nicolson formula for solving parabolic PDEs, of which the prototype is 
the heat equation u,=u,,, can be viewed as a second-order centered finite 
difference with respect to x coupled with the “trapezoid formula” with respect to 
t. In more realistic problems the space discretization might involve more compli- 
cated finite difference, finite element, or spectral approximations and the time 
discretization might be accomplished by any of the familiar methods for ODEs 
such as Runge-Kutta or Adams-Bashforth formulas [6]. 


1994] BUFFON NEEDLE PROBLEM TO KREISS MATRIX THEOREM 137 


According to the celebrated Lax Equivalence Theorem, the numerical solution 
computed by a consistent discretization of a well-posed linear partial differential 
equation will converge to the solution of the PDE as the mesh size shrinks to 0 if 
and only if the discretization is numerically stable [14]. (We ignore the effects of 
rounding errors.) But how does one test for numerical stability? It has recently 
been shown that for method of lines calculations, one can do it by a transplanta- 
tion of the Kreiss matrix theorem from the unit disk to the subset of C known as 
the stability region of the ODE formula [13]. One replaces the monomial A* in the 
term p(A) of Theorem 3 by the solution to a more general matrix recurrence 
relation, and the unit disk in the term r(A) of Theorem 3 by the stability region. 
The condition for stability is that the norm of the resolvent of an appropriate 
spatial discretization matrix must increase at most inverse-linearly as z approaches 
the boundary of the stability region from the outside. For numerical analysts, to 
whom stability regions of ODE formulas are as familiar as simple groups are to 
algebraists, this result provides an easy means of applying the Kreiss matrix 
theorem to a wide range of practical problems. In particular it is applicable to the 
stability analysis of the high-accuracy numerical techniques known as spectral 
methods [4], where the matrices that arise are often far from normal and difficult 
to analyze by more elementary techniques. 

The open question is, what happens to Theorem 3 if r(A) is viewed as a 
constant rather than a variable? If r(A) = 1, then it can be shown that the field of 
values of A, that is, the set of Rayleigh quotients u*Au /||ull5, must lie in the 
closed unit disk. By a result due originally to Lax and Wendroff and subsequently 
sharpened by Halmos, Berger, and Pearcy [14], it follows that when r(A) = 1 we 
have p(A) < 2, or in other words, the factor en of Theorem 3 can be replaced by 
the constant 2, independently of n. Now, what if rCA) is a constant greater than 1? 
For example, what can be said about p(A) ifr(A) = 2? It is known that p(A) can 
no longer be bounded by a constant [11], but beyond this—for example, whether 
en can be improved to a quantity that grows only logarithmically in n—nothing is 
known. 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


He made it possible to study compact 
groups as if they were finite. 
(see page 190.) 
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The book features leisurely and informative treatments of the three kinds of 
geometry considered: Euclidean (curvature = 0), spherical (curvature = 1), and 
hyperbolic (curvature = —1). In each case, one begins with a study of the isome- 
tries of the simply connected space having the desired constant curvature and then 
proceeds with the study of general surfaces by means of a study of the discrete 
subgroups of the group of isometries. Of the three, hyperbolic geometry is perhaps 
the most interesting, since it is the geometry of “most’’ Riemann surfaces, and the 
connections with complex analysis are especially appealing; appropriately enough, 
it gets the lion’s share of attention. The sections labeled ‘‘Discussion” are espe- 
cially noteworthy: they furnish brief heuristic introductions to some fascinating 
connections with other areas, including elliptic functions, modular forms, and 
Riemann surfaces. Later on, the book includes some basic material about the 
topological classification of surfaces and the fundamental group. When pursuing 
the “Riemannian covering” approach to geometry adopted here, it is quite natural 
to inquire what happens if one discards the requirement that the groups of 
isometries act without fixed points. This leads to the idea of a Riemannian orbifold 
of constant curvature, a space which locally looks geometrically like the quotient of 
a simply connected space of constant curvature by a finite group of isometrics; this 
idea is introduced in the final chapter. It should also be noted that the book reads 
quite smoothly and is liberally furnished with helpful pictures. 

This summary suggests an answer to the second question: Why is a new book 
needed? One answer is that the approach taken here is significantly different from 
that of most available texts written at a comparable level, and it furnishes 
comparatively easy access to some very beautiful mathematics. One of the frustrat- 
ing things about teaching a one-quarter differential geometry course from one of 
the standard texts is that one scarcely has time to introduce the basic machinery 
before the term is over, and hence one never really gets very far into the geometric 
heart of the subject. For such a course, this book fills the bill admirably. Of course, 
there are inevitable drawbacks as well: the book neatly sidesteps the difficulties of 
teaching differential geometry by the rather brutal expedient of excising the 
“differential” to concentrate on the “geometry”, so that a student wishing to go 
further (say, to study traditional Riemannian geometry or general relativity) will be 
ill-equipped to do so; such a student would be well-advised to peruse do Carmo, 
O’Neill, Morgan, or even one of the many more advanced treatments of the 
subject to gain familiarity with the requisite machinery. Thus Stillwell’s book 
should be viewed as complementary to (rather than as a substitute for) the 
standard treatments, and in this role it is a welcome contribution. Most important, 
however, this book is tremendously valuable as a reminder of why geometry is so 
captivating in the first place. 


Department of Mathematics 
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Hanover, NH 03755 


Answer to Picture Puzzle 


(p. 139) 
Alfred Haar. 
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A Counterexample for Germain 


William C. Waterhouse 


In 1804, Sophie Germain (1776-1831) began a correspondence with Carl Friedrich 
Gauss (1777-1855). She began by writing under a male pseudonym [BD, Chap. 3], 
discussing topics in the Disquisitiones Arithmeticae [DA] that he had published in 
1801. When she disclosed her identity to him, in 1807, he wrote a response that is 
famous for its comments on the obstacles women face in learning mathematics. 
But less attention has been paid to the actual mathematical content of the letter, 
where (apart from giving a brief account of some of his current work) Gauss points 
out a mistake in one of the results Germain had sent him and gives a counterexam- 
ple. In a recent little note on the topic [MK], MacKinnon has drawn attention to 
the large size of the number involved (13 digits), saying “When I saw it I was filled 
with wonder and suspicion. Is Gauss being less than honest with Germain about 
the frequency of counter-examples? Surely he could never have found such a 
monster as a lowest counter-example?’ MacKinnon found one smaller example 
but correctly noted that it has a special nature that might have made it inappropri- 
ate, and he concludes that he cannot see how Gauss could have discovered the 
example he gave. The question was echoed by Wagon [W, 319]. 

What I have written here is a kind of detective story in which we try to discover 
what really happened. We do not have the original note by Germain, so we must 
first reconstruct the relevant part of it from what Gauss says. Then we can go on to 
figure out what reasoning would have led Gauss to a counterexample. There will 
be a fair amount of evidence for the validity of our reconstruction; in particular, 
his actual counterexample will be the first one that this method would produce. I 
shall summarize most of the mathematics needed, so I hope that readers can also 
use this paper as a sort of offbeat introduction to number theory. 


1. THE COUNTEREXAMPLE AS GAUSS GAVE IT. Here is the relevant part of 
Gauss’s letter, in my translation from the original French [G, 70—74]. 


The taste for abstract science in general, and especially for the mysteries of numbers, is very 
rare; this is no surprise, as the enchanting charms of this lofty science only reveal themselves in 
their full beauty to those with the courage to go deeply into it. Women, by our customs and 
prejudices, must encounter infinitely more obstacles and difficulties than men do to acquaint 
themselves with these thorny investigations; and when a person of that sex is nonetheless able to 
break through these barriers and penetrate the most hidden secrets, she must undoubtedly have 
the most noble courage, quite extraordinary talent, and superior genius. Your favoring this 
science, which has added so much beauty and joy to my life, reflects honor upon it; nothing 
could give me a more flattering and unambiguous proof that its attractions are not chimerical. 

The learned notes with which your letters are so richly filled have given me countless 
pleasures. I have studied them with attention, and I am struck by the ease with which you have 
penetrated all branches of Arithmetic and perceived how to generalize and improve them. I beg 
you to take it as a proof of my attention if I dare to add a remark on one point in your last letter. 
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It seems to me that the converse proposition, that is, ‘if the sum of the nth powers of two numbers 
is of the form hh + nff, the sum of the numbers themselves will be of that same form, is stated a 
bit too generally. Here is an example where this rule fails: 


1511 + 8!! = 8 649 755 859 375 + 8589 934592 


= 8 658 345 793 967 = 1595 8267 + 11 x 745 3917. 


Yet 15 + 8 = 23 cannot be reduced to the form xx + 1lyy. 

The same is true for the proposition: if one of the factors of the formula yy + nzz (where n is a 
prime number ) is of the form (1,0, n), the other necessarily belongs to the same form. Your proof 
only shows that no other indeterminate form, besides those equivalent to (1,0, 7), can give the 
product (1,0,7) when multiplied by the form (1,0, 7); but this proof does not carry over to 
specific numbers. For determinant —n, let C be any class of forms that it is not equivalent to the 
principal class or to any anceps class; let D be the class resulting from duplication of C (which 
will be different from the principal class); and finally let D’ be the class opposite to D. It follows 
that the composition C + C + D’ yields the principal class. Thus if the two numbers f, g can be 
represented by a form of the class C, and the number h can be represented by a form of the 
class D’, the product fg < h can be reduced to (1,0, 2); but it is easy to see that fg reduces not 
only to D or D’ but also to (1,0,7). Thus here we have a case where one factor fg and the 
product fg x h are of the form (1, 0, 2) without the other factor having to be of that form. One 
can also see easily that the first factor must be composite; otherwise, the proposition would be 
true. In the example above, the factor (151! + 811) /23 contains the divisor 67. 


2. RECONSTRUCTION OF THE ORIGINAL ASSERTIONS. There are some 
good clues in this letter that allow us to reconstruct the context of the original 
assertions. First, the end of the paragraph about forms refers back to the 
numerical example. Hence we can suppose that the mistake analyzed in that 
paragraph was in fact the fault in the argument for the earlier assertion. Further- 
more, as Gauss explicitly cites the second result as assuming m prime, we can 
suppose that Germain also made that assumption in the first proposition. Indeed, 
as MacKinnon pointed out, there are many simple counterexamples otherwise. 

The second clue is that Gauss describes the faulty result as a converse proposi- 
tion. Thus presumably there was an earlier result stated the other way around; and 
since Gauss did not object to it, that result must be true. The hypotheses, 
therefore, must make the following statement true: 


Ifa +b =x* + ny” for some x and y, then the same is true of a” + b”. 


We already know we want n prime, and the values a = 1, b = 2 show that we want 
n > 2. It is (and was then) well known that the product of two numbers of the form 
x* + ny” is again of that form; explicitly, 


(x? + ny*)(xf + nyt) = (2x, - nyy,)° + n(xy, + yx1)°. (1) 


For odd n, we know that a + b divides a” + b” formally, and so we probably want 
to see when the quotient has the form x” + ny’. 

Now we can begin to get our bearings. Suppose n is an odd prime. In modern 
terms, Gauss had shown that the field generated over the rationals by the n-th 
roots of unity contains y+; here we must take the plus sign when n is of the 
form 4k + 1 and the minus sign when n is of the form 4k + 3. One explicit 


consequence of this is in [DA, Article 357], where Gauss derives a decomposition 
x” —-1 
——— = Y? $nz?. (2) 

x-1 


These Y and Z are polynomials in x with integer coefficients, and they can be 
computed explicitly for any particular odd prime n. Formula (2) suggests that the 
direct theorem should assume that n is of the form 4k + 3. Indeed, for (say) 
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n = 5, we can easily check that neither 3° + 2° = 275 nor 2° + (—1)° = 31 can be 
written as Y? + 5Z7, though 3 + 2 and 2 + (—1) can be. The direct theorem then 
must have been as follows. 


Theorem A. Let n be a prime of the form 4k + 3. Ifa +b =x* + ny” for some 
x, y, then the same is true of a” + b”. 


The case n = 3 is special, because the quotient is already quadratic (see [MK]), 
so let us see how to prove this theorem for n > 3. The Disquisitiones notes that the 
highest term in Y is 2x“~/*, and Z has no term that high. Setting x = 0, we 
have 4 = Y(0)* + nZ(0)?; hence the constant term in Y is +2 and the constant 
term in Z vanishes. Letting x approach 1, we get 4n = Y(1)* + nZ(1)’, so 
Y(1) = 0 and Z(1) = +2. Hence an even number of terms with odd coefficients 
occur in each of Y and Z. 

Now set x = —a/b; clearing denominators, we get a polynomial identity 
expressing 4(a” + b”)/(a + b) as a square plus n times a square. For n = 11, for 
instance, the formula in (2) is given explicitly in [DA], and it yields the identity 


a‘! + pl 2 
4. = (-2a° + a‘b + 2a3b? + 2a?b? + ab4 — 2b°) 
a+b (3) 


+ 11(a*b — abt)’. 


Our analysis of highest terms and constant terms shows in general that the 
terms in the squared quantities involving purely a or b are even, and there are an 
even number of others having odd coefficients. It follows that, whether a and b are 
even or odd, the two squares are even. Hence we can divide to get (a” + b”)/(a + 
b) as a square plus n times a square. The theorem then follows from (1). 

This proof was well within Germain’s grasp. Her first letter to Gauss [S, 
298-302; BD, 21] specifically singled out the decomposition of (x” — 1)/(x — 1) 
for praise, and at the end of her life she published a further note on that topic [G]. 
Thus we can be fairly sure that Theorem A was indeed her direct theorem. The 
converse theorem therefore must have read something like this: 


(Supposed) Theorem B. Let n be a prime of the form 4k + 3. If a" + b” is of the 
form x? + ny”, the same is true of a + b. 


Germain may or may not have realized that even numbers pose a special 
problem. For n = 7, for instance, 2 is not of the form x” + 7y’, and yet we have 
2’ +0’ = (4)* + 7(4)*. A related counterexample with n = 7 and a and b both 
positive was found by MacKinnon [MK, 350]. More generally, if n is of the form 
7+ 16k, we have 811 + 2k) = 1 +n. We can then write 


1+ (2k)" 
1+2k.— 


By (1) and Theorem A, this quantity is of the form x” + ny’; but 2 + 4k is not, as 
it is less than n and is not a square. This difficulty, however, turns out to be 
restricted to the prime 2. Indeed, as we shall see, Theorem B is true for n = 7 so 
long as a + b is odd. To give the theorem the benefit of the doubt, we should 
therefore assume that a + b is odd. 

To simplify a bit more, we can note that factors equal to m are irrelevant. 
Indeed, as n = (0)* + n(1)”, we know by (1) that a product nr is of the form 


(2)" + (4k)" = 2"-3- 8(1 + 2k) - 
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x* + ny” whenever r is of that form. Conversely, if x? + ny* = nr, then we see 
that n must divide x. Setting z = x /n, we get r = y” + nz”. Hence we may as well 
assume that a + b is not divisible by n. The same then will be true of a” + b”, as 
a" +b" =(a+b)" mod n. (That is, the difference of the sides is divisible by n.) 
Thus we may concentrate on the following version of the converse: 


(Supposed) Theorem B’. Let n be a prime of the form 4k + 3, and suppose a + bis 
not divisible by 2 or n. If a" + b” is of the form x* + ny’, the same is true of a + b. 


3. QUADRATIC FORMS AND THEIR COMPOSITION. To understand the dis- 
cussion in Gauss’s letter, we need to review a bit of his theory of quadratic forms. I 
shall use the notation common nowadays (see for instance [D], [B], [BS], or [C]); it 
is slightly different from that of Gauss, but it does not introduce any serious 
conceptual differences. 

The forms are polynomials aX? + bXY + cY’ with a,b,c all integers. An 
integer t is represented by such a form if t = ax” + bxy + cy” for some integers 
x,y. Such a form is equivalent to the forms obtained as 


a( pX + qY)° + b( pX + qY)(rX + sY) + c(rX + sY)° 


where p,q,r,S are integers and ps — gr = +1 (the condition allowing us to 
reverse the integral change of variables). Those forms obtained by such a change of 
variables with ps — gr = 1 are called properly equivalent. Since the change of 
variables is invertible, a number represented by one form is also represented by all 
equivalent forms. The number D = b* — 4ac is called the discriminant of the 
form; it is easy to compute that equivalent forms have the same discriminant. A 
form is called primitive if there is no nontrivial common divisor of its coefficients, 
and again this property is preserved under equivalence. 

When the form is positive for all nonzero X and Y, like X* + nY?, then D is 
negative. There is then a straightforward procedure (as efficient as the Euclidean 
algorithm) to reduce each form to a properly equivalent form satisfying the 
inequalities —a <b <a and a <c, with a #c when b # 0. Using this, you can 
easily see that there are only finitely many different proper equivalence classes 
with a given discriminant D. In fact, it is also true that no two of these “‘reduced”’ 
forms are properly equivalent, and so for any D we can routinely determine the 
different proper equivalence classes. For instance, if D = —44, there are four 
classes, corresponding to the reduced forms X?+ 11Y’*, 3X¥* + 2XY+ 4Y7’, 
3X? —2XY + 4Y’, and 2X’ — 2XY + 6Y’. The first three are primitive. 

The simplest way to get a change of variables with ps — gr = —1 is to change 
the sign of one variable; this amounts to changing the sign of the central coefficient 
b, and the result is called the opposite form. If two forms are properly equivalent, 
so are their opposites, and thus we have an operation on classes. Forms in the class 
of the principal form X?+nY? (which is (1,0,) in Gauss’s notation) are 
properly equivalent to their opposites. If forms in another class have this property, 
the class is called ambiguous, or (in Gauss’s Latin) anceps. 

In the previous section, we wrote out a formal identity (1) giving the product of 
(x? + ny*) and (x? + ny?). For D = —44, here are two more identities: 


(3x? + 2xy + 4y*)(3xf + 2x,y, + 4yf) = 3r? — 2rs + 45° with 


r=xx,+2xy,+2yx, and s = —xx, + xy, + yx, + 2yy,, (4) 
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and 
(3x7 + 2xy + 4y7) (3xi — 2x,y, + 4y?) =u? + liv? with 
Uu = 3xx, —xXy, + yx, —4yy, and v =xy, + yxy. (5) 


We can think of these as ways of composing two forms with the same D to get 
another one. Such expressions exist in general, but there is a very subtle difficulty 
involved. Clearly, for instance, we could take (4) and change the sign of y,, getting 
a valid identity with corresponding changes of signs in the expressions for r and s. 
Thus we would get expressions of the product in (5) by the two inequivalent forms 
X*+11Y? and 3X* — 2XY + 4Y*. Gauss discovered that if we put suitable sign 
restrictions on the coefficients in the expressions of the new variables like r and s, 
then in fact the proper equivalence class of the form giving the product will 
depend only on the proper equivalence classes of the two factors. Thus we get a 
composition of proper equivalence classes. (Formulas (4) and (5) satisfy the 
restrictions.) A composite of primitive classes is primitive, and in modern terms the 
primitive proper equivalence classes form a commutative group under composition. 
The identity element is given by the principal class, and the opposite of a form is in 
the inverse class. Composing a form with itself is what Gauss called duplication. 
Formulas (4) and (5) show that the group for D = — 44 is cyclic of order 3. 
Finally, we need one fact about representations. 


Lemma. Let M be relatively prime to 4n, and write M = K*M, where M, has no 
repeated factors. Then M is represented by some form of discriminant —4n if and 
only if there is some number r with r* + n divisible by M,. 


Proof: If such an r exists, then the form M,X* + 2rXY + (r* + n)/M,)Y” has 
discriminant —4n and represents M (with Y = 0). For the converse, say M = 
ax* + bxy + cy” with b* — 4ac = —4n. Clearly then b is even. Let d be the 
greatest common divisor of x and y, and find s and ¢ with sx + ty =d. Direct 
computation using our expression for M shows that 


M(as? — bst + ct?) = (s(xb/2 + yc) — t(xa + yb/2))’ 
+(ac — b*/4)( sx + ty)’. 


Dividing by d* and observing that ac — b?/4 =n, we see that n plus a square is 
divisible by M/d* and hence by M,. 

The converse part of this argument was given right at the start of Gauss’s 
treatment of forms [DA, Art. 154]. Still earlier material [DA, Art. 105] shows that 
such an r exists if and only if, for every prime p, dividing M), there is some r; with 
r? +n divisible by p,;. A simple count of powers now shows the following result: 


Theorem C. Let M = BC be a number relatively prime to 4n. If M and B are 
represented by forms of discriminant —4n, so is C. 


This may well have been in Germain’s note. Observe that every number 
represented by a non-primitive form has a factor in common with the discriminant, 
and so only primitive forms will be candidates for representing M. If there is just 
one proper equivalence class of primitive forms (necessarily the principal class), 
then of course it follows that when M and B are of the form x* + ny, the same is 
true of the other factor C. This is true for n =3 and for n = 7, and thus 
Germain’s Theorem B’ is true when n is 3 or 7. 
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4. LOCATING THE ERROR. The lemma above shows that M, will be repre- 
sented by a form of discriminant —4n precisely when the primes in it are so 
represented. When a prime number (different from 2 and n) is represented by a 
form of discriminant —4n, it is represented by forms in a unique equivalence class 
[DA, Art. 168]; but that usually means forms in two (inverse) proper equivalence 
classes. Occasionally, of course, only one proper class occurs; this happens pre- 
cisely when the class is (principal or) ambiguous. We can use the representations 
of the primes to build up a representation of a general M by compositions. Since 
we can choose either of the two classes (when they are distinct), we usually get 
several different classes of forms that represent the same number M. It is the 
distinction between equivalence and proper equivalence that makes this happen, 
and thus it is one of the more subtle parts of the theory. And it was here that 
Germain made her mistake. Gauss says explicitly that she tried to prove something 
like this: 


(Supposed) Theorem D. Jf M = BC is prime to 4n and both M and B are 
represented by X* + nY*, then so is C. 


As we saw, she could correctly have proved that C is represented by some 
primitive form of discriminant —4n. To derive Theorem D (and thus Theorem B’), 
she would then have to show that this form was in the principal class. It appears 
that she tried to do this using the group structure on the classes. As Gauss says, it 
is indeed true that there is no formal composition for any other form; that is, if we 
have an identity 


f(x, y)(x{ + ny{) =u? + nv? 


with uw and v bilinear combinations of x, y, x,, y, as before, then f(x, y) must be 
in the principal class [DA, Art. 249]. But because representations of the same 
number can be built up to come from different classes, the formal argument fails 
to establish a corresponding result for specific numbers. 

Following Gauss’s suggestion, we can easily find explicit counterexamples to 
Theorem D as soon as we find an n where not all the classes are ambiguous. The 
first such case is n = 11. Take, for instance, the first three primes represented by 
3X*+2XY + 4Y? (and hence not by X* + 11Y7); they are 


3 = 3(1)° + 2(1)(0) + 4(0)° 
5 = 3(1)° + 2(1)(—-1) + 4(-1)” (6) 
23 = 3(1)° + 2(1)(2) + 4(2)”. 


In Gauss’s notation at the end of Section 1, these numbers will be f, g, and h. 
Direct composition of the first two, as in (4), gives us 


15 =3-5 = 3(—-1)’ — 2(-1)(-2) + 4(-2)’. 
Composition of that result with the expression for 23, as in (5), gives us 
3-5+23 = (13)° + 11(—4)’. 
But we can reverse a sign to get the other expression 


5 = 3(1)° — 2(1)(1) + 4(1)’; 
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and composing this with the expression for 3, as in (5), gives 
15 = (2)° + 11(1)’. 


Thus both 15 and 15 - 23 are represented by X* + 11Y7, but 23 is not. 

This example also shows that, if we have primes represented by 3X* + 2XY + 
4Y?, we can combine them either in pairs or in triples to get numbers represented 
by X* + 11Y 7. By (1), then, it is easy to see that we actually have the following 
result: 


Theorem E. Let M be a number that is not divisible by 2 or 11. Suppose 
M=p,°D> °°: p, is a product of primes each representable by a form of discrimi- 
nant —44. Then M is represented by X* + 11Y* except when there is exactly one of 
the p,; not represented by X” + 11Y%. 


5. LOCATING A COUNTEREXAMPLE. We have shown by example that the 
supposed Theorem D is false. It was used in the argument for Theorem B’, and 
thus that proof is invalid, but we do not yet know that Theorem B’ is false. How 
might Gauss have searched for a counterexample? It would be most natural to try 
following the same pattern; that is, we should try to take a + b to be a prime 
represented by 3X* + 2XY + 4Y7. We know (a!' + b!')/(a + b) is represented 
by X* + 11Y7, and Theorem E shows that we just need to have it divisible by 
some other prime p that is represented by 3X* + 2XY + 4Y?. We can also recall 
Fermat’s result (see [WL]) that such a p will have to be of the form 11k + 1. (The 
point is that there is an x # 1 with ax = —bmod p; then if p divides a!! + 51', 
we have x!! = 1mod p. But x?~! =1 by Fermat’s theorem, and so 11 divides 
p — 1.) Thus our previous work gives us the following way of searching for an 
example: 


1) Take a prime represented by 3X* + 2XY + 4Y7, and write it in all ways as 
asum a+ Db. 

2) Take another prime p of the form 11k + 1 represented by 3X7 + 2XY + 
4y?. 

3) Test whether a!! is congruent to —b!! modulo p. 


When the congruence holds, then both p and a+b will divide a! + b1', and 
Theorem E will tell us that a!! + b'™ is represented by X7 + 11Y7. 

The first primes represented by 3X? + 2XY + 4Y° are 3, 5, and 23. The first 
ones also of the form 11k + 1 are 23 and 67. Obviously a + b = 3 and p = 23 
should be tried first. The only decomposition is a = 2, b = 1. It 1s easy to compute 
powers modulo a small number, and we find that 2!! = 1 mod 23. Thus there is no 
example there. Similarly, we see that 444 =1= 1" mod23 and 34 =1e= 
2'1 mod 23, so we have no examples with p = 23 and a + b = 5. We next move to 
p = 67, trying first a + b = 3 and then a + b = 5S. There are still no solutions. But 
we have a wider range of possibilities with a + b = 23; and if we start from b = 1 
and work our way up, we do find an example, at b = 8, a = 15. I repeat that 
checking these facts requires only computations of powers modulo 67, which are 
quite easy. (They are trivial if you have once computed a “table of indices’” modulo 
67, as described in [DA, Art. 58] and given in [DA, Table 1].) Thus relatively simple 
computation has led us to the following result, which we have fully established: 


Theorem F. The number 15'! + 8" is represented by X* + 11Y7, but 15 + 8 = 23 
is not. 
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To confirm that we have been on the right trail, we can now observe that the 
one thing Gauss mentioned about his counterexample was that (15!! + 81") /23 is 
divisible by 67. 


6. COMPUTING THE COUNTEREXAMPLE. Gauss could have stopped at this 
point in his work, but it is not surprising that he did not. He was always fond of 
computation, and the counterexample will clearly be more immediately convincing 
if it is displayed rather than deduced. Furthermore, he had available a quite simple 
method for finding the expression of 151! + 8'! as a square plus 11 times a square; 
it merely applies suitable compositions to the formula that started the whole 
discussion. 
Let us recall what we know. First, we have 


1547+ 8!1 = 23-67-M 
for some integer M. (You can compute that M = 5 618653987.) If we can 
represent M as3X* + 2XY + 4Y°, then we can compose that representation with 
representations of 23 and 67 to get 1511 + 81! as X¥* + 11Y7. 

It is in fact possible to take the value of M and solve this representation 
question from scratch (see Section 7.2). But we can do much better here, because 
we already have an expression for 67M. Indeed, setting a@ = 15 and b = 8 in 
formula (3) above, we get 

67M = (15!! + 8!) /23 = (—227723)° + 11(171 780)’. (7) 


Now we can work backwards: if we have M = 3x* + 2xy + 4y’, and 67 = 
3(3)* — 2(3)(4) + 4(4)”, composition gives us the expression 67M = u? + 11v? with 
u=5x — 13y and v = 4x + 3y. We can solve to get x = 3u + 13v)/67 and 
y = (—4u + 5v)/67. Having (7), we try u = +227723 and v = +171 780 to find 
values making x and y integers. Taking both u and v positive, we get the solution 


M = 3(43527)° + 2(43527)(—776) + 4( —776)’. (8) 

All we need to do now is to compose the terms differently, first combining 

23 = 3(1)° + 2(1)(2) + 4(2)” and 

67 = 3(3)° + 2(3)(—4) + 4(-4)” 
to get 

23 - 67 = 3(7)° — 2(7)(-17) + 4(-17)’. 
Composing this then with (8), we get 
1543+ 811 = 23-67:-M=X’+4+11Y’, 
where 
X = 3(43527)(7) — (43527)(—17) + (—776)(7) — 4(—776)(—17) = 1595 826 
and 
Y = (43527)(—17) + (—776)(7) = —745391. 


Thus we have recovered exactly the example Gauss gave, and we have done it by 
methods all drawn from [DA]. 


7. SIDE ISSUES 


7.1. We can find a substantially smaller example if we are willing to allow one of a 
and b to be negative (there is nothing gained by taking them both negative). 
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Indeed, if we resume the search in Section 5 with p = 23 and a+b = 3, we 
observe that 41! = 1mod23. Then we deduce as before that 41! + (—1)" is 
represented by X* + 11Y 7, while of course 3 is not. Readers might enjoy using the 
method from Section 6 to find the explicit representation 


4114 (—1)") = 3-23 - 60787 = 82? + 11(617)’. 
It is also possible to find another positive example by continuing the search in 


Section 5 with a + b = 23 and p = 67; one finds that a = 13, b = 10 also works, 
and we have 


13’! + 10" = 1892 160 394 037. 


This is a little smaller than our previous example, but still of the same order of 
magnitude. The procedure of Section 6 shows us that 


1371 + 10" 


Sy = (125212)? + 11(77805)’, 


whence 
131 + 101! 


3-67. 3(20 703)” + 2(20703)( — 1669) + 4( ~ 1669)" 


and finally 


131 + 10! = (661539)* + 11(363 634)’. 


7.2. We can use the value M = 5618653 987 from Section 6 to illustrate the 
method Gauss gives [DA, Art. 322] for finding representations by 3X¥* + 2XY + 
4Y? from scratch. The first step is rather like that in Section 6; we observe that 
such an expression will give us 


3M =Z74+11Y? with Z=3X+Y. 
Thus it will suffice to find such representations of 3M. There is an obvious bound 


IY| < V3M/11 < 39146, 


and any particular Y can be checked to determine whether 3M — 11Y? is a 
square, but there are too many cases to check by hand. 

Gauss’s method now is a sort of sieve which he called the use of “eliminating 
numbers” and which is now called Gaussian exclusion [B, p. 194]. We take various 
small moduli E and determine conditions on Y modulo E that follow from the 
fact that 3M — 11Y? is congruent to a square modulo E. For instance, in our case, 
take E = 8. We have Y” = 0, 1,4 mod 8, while we have 3M = 1 mod 8 and —11 = 
5mod 8. Thus 3M — 11Y” = 1,6,5mod8. As only 1 among these is a square 
modulo 8, we see that Y? = Omod 8, which tells us that Y is divisible by 4. 
Similarly, if we take E = 25, we have 3M — 11Y’ = 67(1 — Y”) mod 25, so we 
want 1 — Y? to be congruent to a square. The squares modulo 25 are 0, + 1, 
+4, + 6,+9, + 11. The condition on 1 — Y? forces Y* = 0,1 mod 25, so we see 
that either Y is divisible by 5 or Y= +1mod25. Combining this result with 
divisibility by 4, we see that Y must have one of the following three forms: 


Y=20W, Y=100W+ 24, or Y= 100W + 76. 


Clearly this analysis has reduced the number of values to be checked; in the last 
two cases, for instance, we have W < 320. We can continue exclusion with other 
moduli until we feel the number of values is reasonable for checking. Gauss 
indicates that he was comfortable using up to 9 or 10 moduli, and that number of 
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steps will suffice here. For Y = 100W + 24, for instance, we may use moduli 3, 7, 
13, 17, 19, 23, and 29; that leaves 10 values of W. The test at modulus 31 
eliminates all W except 222, 239, and 304. These can be checked individually, and 
it turns out that none of them work. Similarly, for Y = 100W + 76, the moduli 3, 7, 
13, 17, 19, 23, and 29 leave 12 values of W, and the test at modulus 31 eliminates 
all W but 7, 33, 126, 239, and 315, which we can check. It turns out that W = 7 
works; we get 


3M = (+129805)* + 11( +776)’, 


whence 
M = 3(43527) + 2(43527)(—776) + 4(—776)’. 


That, of course, is the solution we found earlier. If we want all solutions, we can 
attack the other case. For Y = 20W, we have W < 1859, and testing with the 
primes through 31 still leaves 25 values. The test at 37 cuts this down to 12 values, 
and the one at 41 then restricts W to be 538, 943, 944, 1541, or 1553. Again we can 
check these, and it turns out that 1553 works; we get 


3M = (+79019)* + 11( +31060)’, 


whence 
M = 3(36 693)° + 2(36 693)( —31060) + 4( —31 060)’. 


Thus in fact there is also another representation of M. If we run through the 
computations for it, we get another expression for our basic counterexample: 


1511 + 811 = (935 166)” + 11(841 201)’. 


7.3. The computations in 7.2 could be shortened (using composition of forms) if 
we knew a factorization of M. But, as Gauss pointed out [DA, Art. 333], we can 
actually reverse that procedure and use our two different representations by 
3X* + 2XY + 4Y? to find factors of M. The proof of the lemma before Theorem 
C shows that each such representation can be rewritten to give a number whose 
square is congruent to —11 modulo M; specifically, if rX + sY = 1, then 


[r(X + 4Y) —s(3X + Y)]° = —11mod M. 


For the solution X = 43527, Y = —776, the Euclidean algorithm gives r= 
— 153, 5 = —8582, and we get the congruence 


(1107 801791)” = —11mod M. 
The solution X = 36693, Y = —31060 similarly gives the congruence 


(3 256 684 733)° = —11 mod M. 


Now whenever u? = v” mod M, each prime dividing M divides either u + v or 
u —v; and so long as u # +u mod M, neither of u + v is divisible by M itself. 
Thus we can find nontrivial factors of M as the greatest common divisors of u + v, 
M and of u — v, M. In our case, the Euclidean algorithm shows that the greatest 
common divisor of 3 256 684733 — 1107801791 and M is 235 159, while that of 
3 256 684 733 + 1107801 791 and M is 23 893. And indeed you can check that 


M = 235 159 - 23 893. 
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Department of Mathematics 
The Pennsylvania State University 
University Park, PA 16802 


Is Algebra Useful? 


I would like to share with you (and if you deem useful with the AMM readership) some 
considerations that occurred to me upon reading an article in the latest AMM issuc. 

At the end of his interesting article “Two-Year Magazine Subscription Rates” (Vol 100, #1, January 
1993), Underwood Dudley states that “‘...it is better to present mathematics to students as a glorious 
adventure for the mind... That it has uses is important, but incidental. Few students will use it, but all 
can see some of its glory.” 

My initial reaction to this was one of incredulity: hasn’t the author noticed that the majority of 
people fail to identify the glory of mathematics and are in fact proud to profess a deep disdain for it? 
That most people tolerate the math requirements in school only because of educators’ assurances that 
one day those mathematical skills will be useful? Expecting students to take mathematics course to 
appreciate the glory of the subject would not be much different from expecting us to take pottery 
courses to appreciate the glory of human manipulative creation. Some would enjoy it (including me) 
but most would object to the rationale for imposing such an experience. 

My dismay, however, slowly turned to keen interest in what this statement really means: finally 
somebody is realizing, and admitting, that much of what we mathematicians have imposed on society as 
an essential part of knowledge can in fact be dispensed with; that people can live meaningful, 
productive and happy lives without using any mathematics beyond grade 3; that we should start 
thinking seriously about who needs what mathematical knowledge; and that we should start providing 
real rationales for each of the topics we want to have taught at any level. 

Debates have raged recently between popular media and mathematicians about whether algebra is 
really useful. The arguments brought forward in defense of algebra were quite convincing to me, until I 
realized that I understood them because I am a mathematician, but they would not have convinced me 
if I weren’t. 

Hopefully some of us have initiated a move away from our ivory tower and towards the real world, 
where people use math, high power math, when needed, but do away with it whenever possible. Do you 
think that I am wrong? Then ask yourself when was the last time you used mildly advanced techniques 
for any purpose that had nothing to do with your job. 

I thank you for your attention and J look forward to hearing from you. 


Dr. Roberto Bencivenga 
Learning Assistance Centre 
Red Deer College 

Box 5005 

Red Deer, Alberta 

Canada, T4N 5H5 
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Cubic Equations, or Where Did the 
Examination Question Come From? 


H. B. Griffiths and A. E. Hirst 


1. INTRODUCTION. Cubic equations can give rise to a great deal of interesting 
elementary mathematics, and at one time 18 year-old students were expected to 
know something about them. This paper arises from thinking about a (very old) 
examination question on a cubic, and how the Examiner came to set it. One of us 
approached it with an interaction method, using a micro, and the other a “bare 
hands” traditional technique, and we then compared our efforts. This led us to 
find infinite families of related examination questions. As a by-product we found 
infinite families of explicit solutions to the Diophantine problem: find integers p, q 
for which 4p? — 27q? is the square of an integer. 

One measure of the importance of cubic equations is suggested in the following 
paragraph from the book [2] (p. 95) of the physicist Richard Feynman, who was 
commenting on some educational attitudes he had met while on a visit to Greece 
in 1980: the italics are ours. 


They were very upset when IJ said the development of the greatest importance to mathematics in 
Europe was the discovery by Tartaglia that you can solve a cubic equation: although it is of little 
use in itself, the discovery must have been psychologically wonderful because it showed that a 
modern man could do something no ancient Greek could do. It therefore helped in the 
Renaissance, which was freeing man from the intimidation of the ancients. What the Greeks are 
learning in school is to be intimidated into thinking they have fallen so far below their ancestors. 


Tartaglia’s method dates from 1530, more than 1000 years after the ancient 
Greeks. There is a lot of interesting history connected with all this, but that is not 
our concern here, except to recommend the general topic as a good one for 
students in higher education. 

The stimulus for the present work arose from the need to discuss with Honours 
Mathematics students the point raised on p. 330 of Griffiths-Howson [3]. There, 
two examination questions aimed at 18 year-olds are printed, on cubic equations; 
the first was set (as a “Higher Certificate” question) in 1910, the second in 1972, 
and they are given as examples of resistance to change in the examination system 
—but the authors were not advocating the complete disappearance of this type of 
work! 

One change is displayed in the questions: the second is more structured and 
depends on the explicit use of symmetric functions of the roots of the cubic. The 
symmetry can be used by the examinees as a check on their manipulative accuracy. 
But the first, older, question makes one wonder how the Examiner found the 
numbers in it so that it would work at all. 

The question was: Prove that, if a, B,y are the three roots of the equation 
x? — 21x + 35 = 0 then a? + 2a — 14 will be equal to B or y. 
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Our interest here is the question: How did the Examiner find the coefficients 
— 21, 35, 1, 2, — 14? Also, what other coefficients are possible? The obvious approach 
to solving the examination question is to write X for a* + 2a@ — 14 and then show 
that 

(X— B)(X— vy) = 0 (1.1) 
using a knowledge of such facts asa + B+y=0. 

There seems to be no way of checking possible arithmetic errors, no “‘concep- 
tual” way through the question. Great manipulative ability and accuracy would be 
needed by the examinees; they would be unable to detect a printing error in the 
question. 

We can, however, use the same approach in the general case, with a view to 
designing the original question. [Is it the most likely one a person would use in 
1910?] Thus we start with the cubic polynomial 


C(x) =x? —-px+q, (1.2) 
and then ask for which quintuples (p, qg, a, b, c) we have a quadratic 
Q(x) =ax*+bx+c (1.3) 


such that, for any root a of C, Q(q@) is one of the other roots of C. It turns out 
that, to have a sensible problem we need the quintuple (p,q, a, b,c) to be real, 
with 

p#0#a, A = 4p? — 27q7>0 (1.4) 


so that the three roots a, B, y are all distinct and real. (A is called the discriminant 
of the cubic.) We then call (C, Q) a related pair, and show: 


Theorem 1.1. All related pairs (C,Q) can be obtained by the following process. 
Choose real non-zero numbers a, a. Then take b = (o — 1)/2, c = (s — o*)/2 with 
s = —1 or —3; and then 


(s - 0*)(s ~— 307) o(s~-oa*)(2+s-a07) 1s 
7 4a*(1 +07) ’ an 4a°(1+ 07) (1.5) 
It turns out that if s = —3, the correspondence Q - C is a bijection, and Q 


permutes the roots of C; but if s = —1 and (C, Q) is a related pair, then Q does 
not permute the roots — and if a0 = +1 two Qs may correspond to the same C. 

From (1.5), if a,o lie in the field Q@ of rational numbers, the quintuple 
(p,q, a,b,c) is rational. Moreover, if a* = 1 and o is an odd integer, then 
D,q, a,b,c are all integers, so we can generate an infinity of examination questions 
like the one above (which itself corresponds to taking s = —3, o = 5, a = 1). 
Somewhat miraculously, if (1.5) is used to evaluate A in (1.4), an algebraic 
manipulative program shows that the only real values of s, for which A is an 
algebraic perfect square, are s = —1 ands = —3. Hence if p, q are integers, so is 
VA. Thus we are able to give infinite families of solutions to the Diophantine 
problem of finding integers p,q for which A is a squared integer. More precisely, 
we show: 


Theorem 1.2. Let s = —1. Then (1.5) gives integer values for p,q,c only when 
a= +1 andb €Z. Whens = —3, let W denote the multiplicative closure of the set 
W = {prime numbers of the form 6n + 1, (n € Z)}. (1.6) 
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Then (1.5) gives integer values for p,q,c only when b © Z, and either |a| = 1, 
la| = 3, or lal EWU 3W. 


2. ELIMINATING THE ROOTS. With C(x), Q(x) as above, we now go through 
the same process as the examinees, and find (as with (1.1)) when (p, q, a, b, c) are 
such that 


F,=(k-B)(k- 7) =0, k=Q(a). (2.1) 


Assume, for the moment, that a # 0. The blanket assumption that A in (1.4) is 
strictly positive will ensure that a, B,y are real and distinct (see, for example 
Birkhoff & Maclane [1] p. 432, Theorem 20). 

If now we multiply out (2.1), we have 


Fi=k*+ak-—q/a (2.2) 


since the sum and the product of the roots of C(x) are 0 and ~—g respectively. 
Since a? = pa — q, we can express F,, as a quadratic F, = La” + Ma + N, where 


L=a’p+2ac+b*?+b+1, M=ap(2b +1) —a’q+ (2b+ 1)c, 
N = —p — (2b + 1)aq +c’. 


The equations F, = 0 = F, yield 3 linear equations in L, M, N with non-zero 
coefficient determinant, since we are assuming a, B, y distinct. Therefore L = M 
= N = 0. 

If, however a = 0, then g = 0 and by (1.2) B= -~y=+ Vp # 0. So with 
a=1/yp,b=0,c=-— yp we have Q(0) = ~ /p which is one of the two 
remaining roots, while Q(B) = Q(y) = 0, thus ensuring that the required condi- 
tions are Satisfied. 

To summarise: if we assume (1.4), then “all” our Examiner has to do, in order 
to produce the examination question, is to find a quintuple (p,q,a,b,c) that 
satisfies the equations L = M = N = 0; for then, F, in (2.1) will be zero (and only 
such quintuples will work). Of course, the Examiner would hope to find a 
quintuple of integers, but we shall deal with that point later. 

If for brevity we write o = 2b + 1 then the equations L = M = N = 0 become 
respectively: 


(i) 1 + 2ac + (o* — 1)/4 = —a’p, (ii) aop — a’q = —co, 
(iii) p + aog = c? (2.3) 


The Examiner’s quintuple is (21, 35, 1,2, — 14), which satisfies the equations (with 
a = 5). We also observe that if 8B = Q(q@) in (1.3), then 


y = —(a@+ Q(a)) = —aa*- (b+ 1)a-c. (2.4) 


So (p, qa, —a, —(b + 1), —c) must also be a solution, (with o changed to —oc). 
Therefore we have a check on our algebra, for we see that equations (2.3) are 
indeed invariant under the transformation (p, q,a,a,c) > (p,q,a, —o,c). The 
work of section 3 suggests (as is easily checked), that (p, —qg, —a,b, —c) is also a 
solution. 

Before going on to find solutions of the equations (2.3), we observe that since 
C(x) has at least one real root a, then O(a) (¥ a) is also a real root, so the third 
root must be real. A corollary is therefore that, to have a real quintuple 
(p,q,a,b,c), we are forced to assume that A in (1.4) is positive. (If A = 0, the 
roots are real, but at least two coincide.) Thus VA is also real. 
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If, moreover, VA is rational, then by a standard result, (see Birkhoff & Maclane 
[1] Theorem 21, p. 433), B and y then lie in the field V obtained by adjoining a 
and VA to Q. Hence if our examiner had started with a rational cubic C(x), there 
always is a suitable rational quadratic Q(x). But, the algebraic theory does not 
explicitly tell us how to calculate a, b, and c. We therefore need the following 
work, which establishes the algebraic theory as a payoff. 


3. INVESTIGATION BY MICROCOMPUTER. So, let us consider, at first, only 
the integer values satisfying the given conditions. Then the most useful starting 
point is equation (2.3)(iii), with o = 2b + 1. This equation tells us that c? = p + 
(1 + 2b)aq. 

A computer program was written to run through values (< 100) of p, qg, a and 
b, and to test for c being an integer. Those values which satisfied both this 
condition and equations (2.3Xi) and (ii) were then printed out to give the table 
below. 


p 1 3 7 9 19 21 37 39 7 
q 0 1 6 9 30 35 84 91 7 
a 1 1 1 1 1 1 1 1 3 
b 0 0 1 1 2 2 3 3 4 
Cc —] —2 —5 —6 —13 ~—14 — 25 — 26 —14 


There is one rogue result here where a = 3, but in all the other cases a = 1. 
The cases where a = 1 appear in pairs where, in each pair, the p-values differ by 2 
and the b-values are the same. 

Suppose we now look at the cases where a = 1 and consider the first p of each 
pair. The method of differences suggests the following results. 


p q a b C 
1+ 3n + 3n? n + 3n? + 2n? 1 n —(1 + 2n + 2n7) (3.1) 
34+ 3n + 3n? 1 + 3n + 3n? + 2n3 1 n ~(2 + 2n + 2n?) (3.2) 


These values satisfy equations (2.3), even if we replace n by any real number b. 
To summarise we have the following lemma, using the notation of Section 1. It 
concerns a cubic C(x) and a quadratic Q(x) of the forms (1.2), and (1.3). 


Lemma 3.1. Suppose that in (1.3), the quadratic O(x) has a = 1, and b is any 
specified real number. Then there are two possible values of c, for each of which we 
can find a cubic C(x), such that (C, Q) is a related pair. | 


When a # 1, a computer search was made, similar to that described at the 
beginning of the section, with a = 1/2,1/3,1/4,1/5,1/6. It suggested that if a 
and b are any real numbers with a # 0, 


p =3(1+5b+b’)/a’, q = (1+ 3b + 3b? + 2b?) Ja’, 
c= —-2(1+b+5’)/a, 


will satisfy the equations (2.3). 

These solutions correspond to those given in (3.2), and we shall refer to them as 
Type 2 solutions. Their form suggests that we might introduce similar denomina- 
tors in the sequence of solutions given in (3.1). We call these Type 1 solutions. In 
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both cases it is easily verified that these are solutions to the equations (2.3), and 
we can summarise these findings in the following table. 


Type 1 Type 2 
p (1 + 3b + 3b*)/a? 3(1 + b + b*)/a? ) 
q (b + 3b? + 2b°)/a? (1 + 3b + 3b? + 2b3)/a3 ) (3.3) 
C —(1+2b+2b7)/a —-21+b+b*)/a ) 


Hence if (p,q, a,b,c) is a solution, then so is (p,— qg, —a,b, —c). Using the 
remark following (2.4) we may summarise: 


Lemma 3.2. For any real number b, and any real non-zero a, there are two 
quintuples (p,q,a,b,c) satisfying the equations (2.3). Further solutions are then 
(p,q, —a, —-(b + 1),c) and (p, -—q, —q,b, —c). Given b,a, the Examiner has 
several related pairs (C, Q) available. = 


As we shall see, the two types of solutions in (3.3) are all solutions of equations 
(2.3). 

Next, by investigating Type 1 solutions when a = 1, it becomes apparent that 
the equation C(x) = 0 has integer roots, namely b,b + 1 and —(2b + 1). 


Observation 3.3. Jn (1.3), Q need not permute the roots. 


In the case under discussion Q(b + 1) = b = Q(—(@b + 1)) showing that QO 
does not permute the roots. So, for any related pair (C, Q), O(a) = O(B) iff b/a 
is the third root of C(x); and then it is easily verified that C(b/a) is always 0 or 
1/a* according as the solutions are of Type 1 or Type 2. Thus with Type 2, Q 
permutes the roots of C, but not with Type 1. An interesting sidelight in the former 
case: Q has an orbit of period 3, so it is an explicitly constructed quadratic which, 
by Li and Yorke [6], has iterations which exhibit chaos. 


4. SOLVING EQUATIONS (2.3). The previous investigation starts from a and b, 
then finds p, gq and c. But suppose we had found any real solution (p, q, a, b, c) of 
equations (2.3). Then, by (ii) and (iii), p and gq would have to satisfy 

c(ac ~ a”) oc(1 + ac) 


q = ———— (4.1) 


Pp ""(1 + 0?) ” a2(1 +07)’ 


and then substitution in (2.3)(i) requires (after simplification) 
(Qac+t)’=1, t=o7?+2. (4.2) 


Conversely, we can compute p,q from (4.1) for any choice of (a,c,a) with 
a # 0; and (2.3)(ii), Gii) will be satisfied. If also (a, c, 7) satisfies (4.2) then we have 
obtained a quintuple (p, q, a, a, c) satisfying all three equations (2.3). 

Observe that (4.2) tells us that the only relevant quadratics Q(x) in (1.3) are 
those for which (a, b, c) lies on one or other of the pair S*, S~ of quadric surfaces 
in R° with equations 2ac + (2b + 1)? = s, where s = —1 for S* and s = ~—3 for 
S~. For our Examiner, s = —3, but we should consider also the case s = —1. 

Starting with s,a,o in (4.2), c has to be (s — o*)/2a, and then by substitution 
in (4.1) we would have the equations (1.5) above. Now, for each real s, the 
equations (1.5) define a mapping 


f,: (4,0) > (p,q) 
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of the (a,a) plane A to the (p,q) plane. Our previous investigation therefore 
required us to work out the image of f, when s = —3 and —1. 

First, what could the boundary of this image be? The family of all cubic 
polynomials of the form (1.2) separates most basically into those with 3 real roots, 
and the rest. We model the former set by the subset D of the (p, g)-plane which is 
defined using the discriminant in (1.4): 


D = {(p,q@) € R*: A = 4p® — 27q’ > 0} (4.3) 
with boundary the semi-cubical parabola 
K:q = +2(p/3)”. (4.4) 


Here, D lies in the right hand half (p > 0) of the (p,q)-plane, as shown in 
FIGURE 1. 


Figure 1 


2 


Now in (1.5), p > 0 since s and —a” are negative, so we are led to guess and 


then prove the following: 
Proposition 4.1. If s < 0 and (p,q) € D, (p,q) lies in the image of f.. 


Proof: Given (p,q) € D, then we must find a point (a, 0) with f(a, a7) = (p, q). 
If g = 0, then (s/2yp , 0) will do. If q # 0, let g/p = m # 0; then by (1.5), (a, o) 
satisfies 


m=o(2+s-—17)/a(s—3r) (tT=0”"). (4.5) 

We solve this for a and substitute in the first equation of (1.5) to find some 7 > 0 
satisfying 

d4pr(1t+7)(2+s—71)? =m%(s —1)(s — 37)’, (4.6) 

with m given. The difference between the left and right sides of (4.6) is —m7s* < 0, 

(since ms # 0) at + = 0, and approximately (4p — 27m?)r* = 7*A/p” when 7 is 


large. Therefore by the Intermediate Value Theorem, the required 7, > 0 exists. 
| 


We now confine our investigation of (4.6) to the relevant cases s = —1 and the 
Examiner’s s = —3. 

Some facts about f_,, f_, are these, and are left as an exercise for the reader. 
f_3 is an injection (one-to-one) if we restrict its domain to the region A,: a > 0 of 


156 CUBIC EQUATIONS [February 


A. For each a > 0, f_3 maps the line L,: a = a, to the curve I’, given with 
parameter o by 
p = 3(3 +. 0’)/4a5, q= -0(3+07)/4a}3, 
SO 
dq/dp = q/p = —(1 + 0*)/6a0 


except when o = O, when the tangent is vertical. The curves l,, are as shown in 
FIGURE 2. 


Figure 2 


Thus f_3: A,— D is a biyection, and a diffeomorphism if o > 0. Following 
(3.3) we noted that if (p, qg, a, b, c) is a solution there, then so is (p, —q, —a, b, —c); 
hence f_, is also an injection at points of A with a < 0. Thus, in Theorem 1.1, 
the correspondence Q - C is a bijection (when s = —3). 

If again we confine the mapping f_, to the half-plane A, of the (a, o)-plane, 
we see that f_,: A,— D is onto, but it is no longer an injection; for f_, maps the 
line L,, to the curve I, 


= (1+307)/4a2,  q=oa(1-—07)/4a3, dp/dq = (1 — 30”) /6a. 


Thus f_, maps as shown in FicureE 2. Therefore in Theorem 1.1, if s = —1, then 
at most two quadratics Q can be paired with the same cubic C. 


5. ALGEBRAIC FORMULAE: THE CASE s = —3. Here (4.6) reduces through 
cancellation to a linear equation in 7, yielding +r = 81q7/A. The formulae (1.5) 
reduce to 


p = 3(3 + r)/4a’, q=a(3+7)/4a°, with A = 81(3 + tr) /16a°. (5.1) 
Thus we choose a,o arbitrarily, then take c = —(3 + r)/2a to compute p,q 
from (5.1) and obtain the required quintuple (p, q, a, b,c) with b = (o — 1)/2. It 


is easily verified that these are exactly the “solutions of Type 2” in 3.3). Choosing 
the sign of the root relative to that of a, since 3 + t > 0, we then have 


a =3p/VA, ao =9q/VA, c= —2p?/VA (5.2) 
which confirms the statement made in Section 2 that if p,q, VA are all rational, so 
are a, b and c. 


Integer solutions when s = —3. Now, our Examiner found integers as coefficients, 
and since his p = 21, q = 35, then VA = 63, another integer. Certainly, if we 
require p,q,a,c © Z, then (5.1) also requires VA to be rational. The problem 
then arises of finding solutions of this type. In particular, for which integral p, q is 
VA in Z? We therefore have now a Diophantine problem. 
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Recall that r = 0”? and 0 = 2b + 1. If B = b? + b + 1 then from (5.1), (5.2) we 
obtain: 


p=3B/a’, q=(2b+1)B/a®, c= -—2B/a, VA =9B/a>. (5.3) 
Therefore, if a = 1, we have an infinite sequence of solutions in Z as b runs 
through Z: 

(p,q,4,b,c) = (3B,(2b + 1)B,1,b,-2B), VA = 9B (5.4) 


and,when b = 2, this gives the Examiner’s solution. In Section 7 other solutions 
are discussed. 


6. THE CASE s = —1. It is technically more difficult to have s = —1 because 
equation (4.6) becomes a cubic. The parametric formulae (1.5) now become, when 
7 = (2b + 1)’ as before, 


1+ 3b + 3b? (b + b*)(2b + 1) 
| rr a i a 
a a 
1+ 7? 1+ 2b + 2b? 
oo ET) eh Fey (6.1) 
2a a 


It is easily checked that these are the “solutions of Type 1” in (3.3). For VA we 
obtain 


VA = (97 — 1)/4a3 = (9b? + 9b + 2) /a3. (6.2) 


Therefore, if a = +1 and b runs through Z, we obtain a further family of 
solutions to the Diophantine problem of finding (p,q) in Z for which A is a 
square. In Lemma 7.1 below we show that there are no Type 1 integral solutions if 
la| > 1. 

How accidental is it that A is a perfect square when s = —1, —3? Adrian 
Oldknow gave us an explanation which we summarise as follows. Using an 
algebraic manipulation package, he substituted p,q from (1.5) in A = 4p? — 27q? 
to express A in the form X? - (u(sX1 + x)~! + v(x, s)), where v(x, 5) = 81x? + 
9x - g(s) + h(s), and u(s), g(s), h(s) are polynomials in s of degree 4, 2,3 respec- 
tively. Because of the denominator 1 + x, A cannot be a perfect square unless 
u(s) = 0, which occurs for real s only when s = —1, —3. In those cases it turns 
out that 


v(x,-1) =(9x—- 1)’,  v(x,— 3) = 81(x + 1)’, 


and A is then a perfect algebraic square. 


7. INTEGER SOLUTIONS AND NUMBER THEORY. The approach using a 
micro yielded the patterns arising out of this problem, and so we consider Types 1 
and 2 (defined in (3.3)) separately, and look for values of a which give integer 
solutions. It is clear from (5.3) that if B = b* + b + 1, such solutions will exist 
whenever a*|B. The following sequence of lemmas leads to a proof of Theorem 1.2 
above. 


Lemma 7.1. For Type 1 equations there are integer solutions iff \a| = 1. 
Proof: If p and c are integers, a divides the numerators of both p and c in (6.1) 


and (6.2). But if the integer a divides both 1 + 3(b + b?) and 1 + 2(b + b”), then 
a= +1. a 
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Moving to Type 2 integer solutions, we quote Corollary 25.1 of Grosswald, 
Chapter 7 [4]. Here, the discriminant of a quadratic polynomial f(x) = ax” + bx 
+c is b” — 4ac, and we shall take f(x) to be B(x) = x* + x + 1 with discrimi- 
nant —3. 


Lemma 7.2. If D is the discriminant of the polynomial f(x), and if P is a prime such 
that P + D, then all congruences 


f(x) =0(mod P’), (reN) 
have the same number of solutions, which is in particular the number of solutions of 
f(x) = 0 (mod P), a 
The prime 3 has an anomalous role in the sequel precisely because 3|D in our 


case. 
Now recall the set W in (1.6) and its multiplicative closure W. 


Lemma 7.3. For any P € W, andr €N, there exist Type 2 integer solutions for any 
a of the form P’. 


Proof: Since B = B(b) in (5.3) is odd, it cannot have 2 as a factor. Let P € W. 
Then, working mod P, we have, since P is odd 
14+b+b?=0 © 44+4b4+4b?=0 © (14+2b)’= -3 
This means that 

P|B = —3 is a square (mod P), (7.1) 
which happens whenever P is of the form 6n + 1 (by Hardy and Wright [5], 
Th. 96), that is whenever P € W. Thus VP € W, db € Z such that P|B(bd). But 
the quadratic form B(b) has discriminant —3, to which P is prime. Therefore, by 


Lemma 7.2, Vr © N, 3b’ € Z such that P*”|B(b’). Hence (5.3) gives integer 
solutions when (a, b) = (P’, b’). | 


We leave the short proofs of the next two lemmas as an exercise for the reader. 


Lemma 7.4. Let B = B(b) and let y be an odd integer prime to 3. If y”|B and 
y°|(2b + 1)B then y? |B. Zz 


Lemma 7.5. 3|B(b) @ b = 3k + 1 for some k € Z, in which case 9 + B(b). a 


Lemma 7.6. If we have a known solution of either type for a given a, then we can 
find an infinite sequence of integer solutions for this a. 


Proof: Already in (5.3) we have an infinite family of solutions for a = +1. Hence 
the Type 1 case is clear from Lemma 7.1. For Type 2 solutions, recalling (5.3) we 
wish to know for which other values of a and b we have c, p and gq in Z. This will 
be true if a*|B(b), so we first look at the divisors of B(b). If |a| = 3 we first note 
that there is an integer solution with b = 4. For then 


B(4)=1+44+47=21 and 1+2b=1+24=9, (7.2) 


so 27 divides the numerator of g and thus c, p and gq are integers in (6.1). To get 
further solutions we note that B(4 + 9x) = 21 + 81(x + x’) is divisible by 3, while 
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1+ 2b = 911 + 2x) is divisible by 9, so the numerator of gq is divisible by 27. Thus 
if |a| = 3 there is a solution for b= 4+ 9x for any x € Z, hence giving an 
infinite sequence of integer solutions. 

If |a| > 3, suppose we have found some integer t, such that (a,t) gives a 
solution, (i.e., a*|B(t)). Then by a method similar to the above case with (a, t) = 
(3,4 + 9x), it is easy to check that (a3, (t + a°x)) is also a solution for any x € Z. 

a 


Lemma 7.7. If w © W, then there are Type 2 integer solutions for a =w, and 
a = 3w. 


Proof: We prove this inductively. Let w = pi'p5?...pz*, where the p,, Gi = 
1,...,k) are distinct primes. Then from the definition of W, p; € W for i= 
1,...,k. 

Let x = pi)... peé>, y = pz*. Then since the p, are distinct, hcf(x, y) = 1, and 
hence hcf(x°, y*) = 1. Suppose there are solutions for a =x and a =y. Then 
Ab,,b, € Z such that x3|B(b,) and y?|B(b,). Then by the method of Lemma 7.6, 
it follows that 


Vu,v € Z, x°|B(b, + ux*), and y*|B(b, + vy°). 
So if we can find u,v such that 
b, + ux? =b, + vy? =b*, say, (7.3) 


then, since hcf(x3, y?) = 1, x?y?|B(b*), thus giving an integer solution for a = xy. 
Now there exist integers m and n such that mx? + ny? =1 so by choosing 
u = (b, — b,)m and v = (b, — b,)n it can be easily verified that equation (7.3) is 
satisfied, thus ensuring that there is an integer solution of Type 2 for a = xy. 

We know by Lemma 7.3 that when k = 1, w yields integer solutions so, by 
induction, there is a Type 2 integer solution for any w € W. 

Now we consider the case when a = 3w, w € W. We know from above that 
there exists a b such that Wh € Z, (w, b + hw’) is a solution; and that Vt € Z, 
(3, iy + 9t) is a solution, so we need to find integers u,v such that 4+ 9u =b+ 
UW>. 

Since hcf(9, w) = 1, the same method used for x, y above will show that such 
u,v do indeed exist. Hence, for any w € W we can find an integer solution of 
Type 2 when a = 3w. a 

We are now ready for the final step which proves Theorem 1.2. 


Theorem 7.8. All Type 2 solutions (p,q, a,b,c), have either |a| € W, or |a| /3 € 
W, or |a| = 1 or |a| = 3. Moreover, if a is any of these forms, there is a Type 2 
solution. 


Proof: We prove only the first sentence, as the second is essentially Lemma 7.5. 
Thus we shall deduce the form of a, which gives integer solutions for: 
3B B(2b + 1) —2B 


= —_—— _—_— = TT, h B=b*+b+4+1. TA 
p=—z> 4 ee 7 where (7.4) 


For brevity we say “by p” instead of “by the formula for p”. Then by p, a is 
odd because B is always odd. We may clearly assume |a| > 3. 

First suppose 3 + a. Then, since a is odd, a|B by c; also by p, a*|B. Hence by 
Lemma 7.4 we may assert: a*|B. Further, a7|4B = (2b + 1)? + 3, so —3 is a 
square (mod a?), and hence, by Lemma 7.2, (mod P*) for every prime power P* 
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dividing a. Therefore by Hardy and Wright [5] Theorem 96, P © W. Hence since 
la| > 1, la] € W as required. 

Next suppose 3|a, so a = 3a say, with a@ odd and > 1 since |a| is odd and 
> 3. Then 3 + a, since by p this would imply that 9|B which we know from 
Lemma 7.5 to be false. So, repeating the above argument with a replacing a, we 
have a°|B and hence |a| = |a| /3 € W, as required. a 


__ Finally, we describe a method for finding a suitable b in (7.3), given a in 
W U 3W. This depends upon the crucial observation (7.1). Choose a, and let x and 
y be integers such that 


x=(a+1)/2, y?= -3(moda); 
then if t = x(—1+y), a direct computation gives 


(a + 3) (a + 1) 
B(t) =a|- — + I— (mod a) = 0 (mod a), 
since both (a + 1) and (a + 3) are even, so the interior of the square bracket is an 
integer. 

Having thus found a t such that a|B(t), we now need a b such that a>|B(b). It 
can be shown that for some n <a’, b=t+ na will do. So, for each a, the 
computer has only a finite number of integers to run through to find one which 
satisfies the required condition. This method also works with composite values of 
a, provided the factors of a are all in W. A computer search using this method 
yielded the following solutions: 

a 3 7 13 19 21 31 vo 91 
b 4 18 1036 2819 1390 6287 69267 


However, it can be seen that the numbers b soon become very large. Now, 
B(b)/a? — | B(b)/a?| may be very small in comparison with B(b), so an ordinary 
microcomputer will soon give unreliable answers. Thus, although a microcomputer 
is an extremely useful tool for generating patterns and prompting ideas, the theory 
to back up these ideas is still as necessary as ever. 
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NOTES 


Edited by: John Duncan 


On the Identity of Polyhedra 


Hellmuth Stachel 


Dedicated to Prof. O. Giering, Munich, on the occasion of his 60th birthday 


Let S and S’ be two polyhedral solids in the Euclidean 3-space E>. In this note 
it is proved that S and S’ need not coincide though they share all vertices and all 
planes through their faces. 

Let V = {v,,...,v,,} and V’ = {v},...,v',} denote the sets of vertices of S and 
S’, respectively. P = {p,,..., p,} and P’ = {p},..., p',} are supposed to be the 
sets of planes containing the faces of S and S’. Figure 1 shows two different 
polyhedra S and S’ with equal vertex sets V = V’. This is caused by different 
triangulations of the skew quadrangle v,v0,u0,U;. With the example in FiGuRE 2 we 
obtain that also P = P’ does not imply S = S’. Here S (left) is the intersection and 
S’ (right) the union of two congruent “houses”. The following example (FiGurE 3) 
reveals that the two polyhedra can even be different though both the vertex sets 
and the plane sets are equal: 

We start with a regular dodecahedron D. There are five inscribed cubes 
C,,...,C,. And each cube C, contains a left tetrahedron L; and a right one R;. 
The edges of the left tetrahedra L,,...,L, of D can be defined in the following 
way: Start at any vertex v, of D along any edge e,. At the endpoint v, take the 
left' edge ¢, and at the next vertex v, take the right edge e,. Then v, and the 
endpoint of uv, of e, determine an edge of any left tetrahedron. Commutation of 
left and right gives the right tetrahedra edges. Let S denote the union of all left 
tetrahedra of D (Ficure 3, left) and S’ be the union of all right tetrahedra 


D7 
V4 
U6 
U6 
Us 
Us 
V3 
D3 
Vy 
Vy 
V2 
V2 
Figure 1. 


1if seen from outside (compare also [2]). 
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Pg 


Figure 2. 


Figure 3. 


(FiGuRE 3, right). Then S and S’ are well known stellated icosahedra (see [1], p. 5, 
Ef, and Ef,; plate XVI shows the latter; the former is its reflected image). 

Each two tetrahedra L, and R,, i # j, share exactly one vertex of D. We denote 
this vertex by u;;. This notation has the following advantage: In order to get any 
transformation of the group ¥ of direct symmetries of D take any even permuta- 
tion 7 of elements {1, 2,3, 4,5}. Then the corresponding isometry maps the vertex 
V;; tO Uaciyecjy Lhe points v;; and uv, are opposite vertices of D. 

The faces of L; and R, opposite to v,, belong to the same plane p;,. The face of 
L, in plane p,, is a regular triangle A, with vertices v3, V14, Vj; (FIGURE 4). It can 
be carried into the face P, of R, with vertices v<,, V3, U4. by a rotation about the 
common center. The rotation angle 


3/5 -1 
p= arccos ——5—— = 44,4775...° 


is deduced from the fact that the smallest distance between vertices of A, and P, 
equals the edge length of D. 

The tetrahedra L, and R, form a stella octangula. Therefore, L, intersects 
plane p,, along the triangle A, of midpoints of R,. A, is bounded by the traces of 
planes P53, P54, Po. The interior of A, is dotted in Ficure 4. 
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Figure 4. 


L, intersects p,, in a triangle A, (hatched in Ficure 4). The sides of A, are 
located on the traces of planes p3,, P32, P3;. The plane p,, contains the vertices 
U4, and v,, of P, and therefore also a vertex of A,. The plane p,, passes through 
U3, and v,,. The plane p,, belongs to R,; therefore its trace in p,, connects the 
midpoints of two sides of L,. v3, must be a vertex of A3. 

We get the intersections A,, A, of L,,L, in p,, by rotating A, about 120° and 
240°. The union A := A, U A, U A, is a superset of A,. This is a consequence of 
the fact that all planes p,;,; bound a regular icosahedron polar to D. Whenever five 
vertices belong to the same face of D then there is a subgroup of ¥ of order 5 
preserving this face. Then the corresponding planes meet at a common point on 
the axis of rotation. E.g. the vertices V5, U5, Us4, V43, U3 Permute under the group 
given by the cycle (12543). Hence the planes p,;, ps4, P43, D3, have concurrent 
traces in plane p,,. 

The faces of S=L, UL, UL, UL, U Lg are located in the planes p,,. E.g. 
the face of S in p,, is given by A, \ A (FicureE 5, left). It consists of three 
pentagons (compare [1], Fig. 3., where the regions 5,6,7,9, 10 and 5, 6,7, 9, 10 
give FiGuRE 5, upside down). The plane p,, contains also a face of the union S’ of 
all right tetrahedra (FiGurE 5, right). A reflection that commutes A, and P, in p,, 
interchanges also the faces of S and S’. This reveals that these faces share all those 
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vertices that are not vertices of D. On the other hand all vertices of D are vertices 
of S and S’ too. This proves P = P’ and V = V’ for S and S’. We summarize as 
follows: 


Lemma. For the identity of two polyhedra in E? it is not sufficient to require equal 
sets of vertices and equal sets of bounding planes. 
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More on the Pompeiu Problem 


David C. Ullrich 


1. INTRODUCTION. A recent Monthly article by H. Lacquer [HL] begins with 
the following question: 


Question. Suppose that f is a continuous function .on S and the integral of f over 
every a-sector vanishes, for some a € (0,77). Must f be identically zero? 


(Here S < R? is the two-sphere and an a-sector is the region on S§ bounded by 
two great circles meeting at an angle a.) 

Theorem 1 of [HL] states that the answer is “yes” for continuously differentiable 
functions f, but the general case is left open; we shall see that the general case 
actually follows from the theorem in [HL], by a “regularization” argument. (In fact 
the hypotheses on f can be weakened still further; see the note at the end of 
section 3.) 

One may regard the result as a sort of “tauberian theorem’. Lacking space to 
explain precisely what is meant by that, let us just point out that the answer to the 
question changes if other regions are substituted for a-sectors. For example, if we 
consider the same question with spherical caps instead of a-sectors then ‘‘Ungar’s 
Freak Theorem” states that the answer is sometimes yes and sometimes no, 
depending on the “radius” of the cap; see [HL] for more discussion. Some insight 
into what’s really going on here may be obtained by considering an analogous 
lower-dimensional question: 
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Easier Question. Suppose f is a continuous 277-periodic function on the line, 
0<a< 2m, and 


X+a@ 
[ f(t)a=0 
x 
for all x © R. Does this imply that f is identically zero? 
Easy Answer. “Yes” if a/7 is irrational, “no” otherwise. 


Sketch of Easy Proof. The hypothesis says precisely that the convolution f * y is 
identically zero, where y is the 27-periodic function taking the value 1 on the 
interval [0, a) and 0 on [a, 277). Thus the Fourier coefficients (f * y)” (7) must 
vanish for all n € Z. 

But (f * x)’ (n) = f(n)2(n), and one easily calculates that ~(0) = a/27m and 
((n) = (Q7rin)“'(1 — e~'"*) for n # 0. Now if @/7 is irrational this shows that 
f(n) = 0 for all m € Z, which implies that f vanishes identically. On the other 
hand, if a= 2a7m/n then f(t)=e'™ (or f(t) = cos(nt)) gives the required 
counterexample. 

(One can give a more elementary proof of the above result, without involving 
any Fourier analysis; the proof above is precisely analogous to the proof of the 
“main computational lemma” in [HL], with the circle R/27rZ in place of the group 
SO(3). It seems possible that [HL, Theorem 1] itself could be proved directly by a 
similar argument, but (at least) some details would need to be changed, because 
a-sectors are not hemispheres; in particular, there is no natural way to associate a 
unique a-sector to each point of the sphere.) 


2. INTEGRATION ON COMPACT GROUPS. It seems like a good idea to say just 
a little bit about integration on compact groups before proceeding. The reader 
might find a proof of everything in this section in [FO], Chapters 1, 2, 7, and 10.2. 

If K is a compact topological space the notation C(K ) will refer to the space of 
all (real-valued) continuous functions on K. Suppose that G is a compact group; 
that is, G is simultaneously a group and a compact topological space, in such a way 
that the group operations are continuous. A linear functional on C(G) is a linear 
map A: C(G) — R; we say that A is positive if Af > 0 whenever f > 0 on G. It 
turns out that there exists a unique positive linear functional A on C(G) satisfying 
the following two conditions: 


Al = 1, (2.1) 

Ag=Af if g(x) =f(% 9x) forsomex, €G. (2.2) 

(Here “1” denotes the element of C(G) with 1(x) = 1 for all x € G.) The 

existence of a functional A satisfying all these properties is not obvious or trivial, 

but assuming that such a thing exists the reader may easily derive the following two 

inequalities: 

Iff,geC(G) and f<gthen Af < Ag. (2.3) 

Ifg=|fl| then |Af| < Ag. (2.4) 

The value of Af is usually denoted “/, f(x) dx” and is known as the “integral” 
of f with respect to “Haar measure’. - 

Of course given a compact group G one can often find explicit formulas for 


Io f(x) dx. The case of interest below will be G = SO(3), the group of (real) 3 x 3 
matrices with determinant one. Perhaps the most transparent way to give a 
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formula for the invariant integral in this case is as follows: For  € S and t € R let 
R(, t) denote a counter-clockwise rotation through an angle ¢t about the axis ¢. It 
is not hard to see that SO(3) = {R(é, t): ¢ © S, t € [0,27)}; in fact if T <= SOG), 
T # I, then T = RC, t) (0 < t < 27) in precisely two ways, and 
] ] 2% 
T) dT = — |] — R(¢,t)) dtd ; 

Koos (P47 = ae lag, PRG) atdo(s) 
(Here “do” means “with respect to surface area”’.) One may also “represent” the 
elements of SO(3) in terms of quaternions, leading to another formula for the 
integral; see [ZH] pp. 33-335. 


3. THE POMPEIU PROBLEM. We now suppose that f € C(S), a € (0, 7), and 
the integral of f over any a-sector vanishes. The plan is to find a sequence of 
continuously differentiable functions f,, f,,... with the same properties, and which 
also converge uniformly to f. Theorem 1 of [HL] will then show that f, = 0 for 
each n, so that f = 0. 

To begin, suppose that h =foT for some T € SO(3). It follows that the 
integral of h over any a-sector is zero, because surface area on S is rotation- 
invariant and 7 takes a-sectors to a-sectors: if V is an a-sector then 


fy do = [fe T do = Saf? = 0. (3.1) 


(Note that 7 preserves distances; hence 7 maps geodesics to geodesics, which is to 
say that it maps great circles to great circles. Since J also preserves angles it 
follows that T(V) must be an a-sector.) 

Now given a function g € C(SO(3)) we define f * g € C(S) by the formula 


f*g(£) = i} pol PST) aT (SES), (3.2) 
or simply 
fxg= Foolt? T-')g(T) dT. (3.3) 


It follows that the integral of f * g over any a-sector is zero. Indeed, if V is an 
a-sector then 


[fr a(¢) dos) 7 [Loft ean ar) ao() 


= e(T)| [ f(T-%) do(¢)| dT = Joc’ dT =0, (3.4) 


SO(G3) 


by (3.1). (The technical conditions needed to justify the exchange of the order of 
integration are easily verified here; see [FO], Chapter 2.5.) 

Now define a metric on SO(3) by setting d(T,,T,) = sup, <slIT(Q) — TOIL 
and choose a sequence of continuously differentiable functions g, € C(SO(3)) 
(n = 1,2,...) such that 


[_8AT) at = 1, (3.6) 
SOG) 
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and 
g,(T) = 0 (T € SO(3), d(T, I) = 1/n). (3.7) 
Set f, = 8n*f. 

The fact that g, is continuously differentiable implies that f, is continuously 
differentiable. Formula (3.4) shows that the integral of f,, over any a-sector is 
zero, and hence Theorem 1 of [HL] shows that f, = 0; we need only show that 
f, ~>fasn> ~, 

So we suppose e > 0, and we choose N so that |f(¢) — f(é)| < « whenever 
If — Ell < 1/N(@, € © S). A few standard tricks show that |f(¢) — f,(Q)| < e for 
all € & S if n > N: First, note that 


f(2) -f.(L) =f(D) - i} pop CE )8a(T) aT 


=f (FE) ~ F(T) 87) a7 


here we have used nothing but (3.6) and linearity. It follows from (2.4) that 


If(S) -f.S)| < Jol fi — f(T~L)|g,(T) dT. 


Now if n > N then |f(Q) — fi(Ole,T) < eg,(T) for all £ © S and T € SO(3) 
(consider separately the cases d(T, I) < 1/n and d(T, I) = 1/n), so that 


if) -fOlsef edT)aT=2© (eS, >N), 


by (2.3). Hence f, — f uniformly on S, as claimed. 


Note. If the reader knows some real analysis (s)he may use essentially the same 
argument to show that if f is a Lebesgue-integrable function on the sphere and 
the integral of f over every a-sector vanishes then f = 0 almost everywhere: The 
fact that C(S) is dense in L'(S) shows that f, > f in L'(S). (The novice should 
perhaps be warned that it is somewhat more difficult to show that f, — f almost 
everywhere; in fact this is false unless one is a bit more careful in the choice of g,.) 
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UNSOLVED PROBLEMS 
Edited by: Richard Guy 


In this department the MONTHLY presents easily stated unsolved problems dealing 
with notions ordinarily encountered in undergraduate mathematics. Each problem 
should be accompanied by relevant references (if any are known to the author) and by 


a brief description of known partial or related results. Typescripts should be sent to 
Richard Guy, Department of Mathematics & Statistics, The University of Calgary, 
Alberta, Canada T2N 1N4. 


Every Number is Expressible as the Sum 
of How Many Polygonal Numbers? 


Richard K. Guy 


Sums of squares have been extensively studied, and the following three theorems, 
due essentially to Lagrange, Legendre and Fermat, are among the most famous in 
number theory. The first was stated in 1621 by Bachet. 

Every number is expressible as the sum of four squares. 

The numbers expressible as the sum of three squares are just those not of the 
form 4*(8/ + 7),k,1 > 0. 

The numbers expressible as the sum of two squares are just those whose “bad” 
prime factors (those of shape 4k — 1) occur to an even power. 

Fermat is perhaps most famous for the fact that the margin of his copy of 
Diophantus wasn’t wide enough to hold a certain proof, but another claim of his, 
that every number is expressible as the sum of k k-gonal numbers, is worth noting. 
He wrote to Pascal on 1654-09-25: 


“Ce que vous y trouverez de plus important regarde la proposition que tout nombre est composé 
d’un, de deux ou de trois triangles; d’un, de deux, de trois ou de quatre quarrés; d’un, de deux, 
de trois, de quatre ou de cing pentagones; d’un, de deux, de trois, de quatre, de cinq ou de six 
hexagones, et a l’infiny. 


Pour y parvenir, il faut demonstrer que tout nombre premier, qui surpasse de l’unité un multiple 


de 4, est composé de deux carrés, comme 5, 13, 17, 29, 37, etc. 


He promised to devote an entire book to this subject, but it didn’t appear, and we 
had to wait for Cauchy [2] to give the first proof. Meanwhile, for triangular 
numbers, there was Gauss’s famous 1796-07-10 diary entry (see [1)): 


ETPHKA! num=A+A+4+AQ, 


i.e., every number is expressible as the sum of three triangular numbers. 
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Squares and triangles are the most familiar members of the family of polygonal 
numbers. The r-th k-gonal number is 


sr((k — 2)r—(k -4)). 


Squares and triangles have the property that we obtain no new numbers when we 
throw in those of negative rank, r < 0. 


k=4:(-r) =r? k= 3:3(-r)(-r +1) =3(r-1((r- 1) + 1). 


But when we come to the pentagonal and other k-gonal numbers for k > 5 (and 
k < —1) we get a different set if we allow numbers of negative rank. 


TABLE OF K-GONAL NUMBERS 


-—90 -65 —44 -27 -14 -5 0 1 -2 -9 -20 -35 -—-54 —77 —-104 —135 -170 
—69 —-50 —-34 -21 -11 -4 0 1 -1 -6 —-14 —-25 -39 -56 —-76 —-99 —125 
—-48 —-35 -—-24 -15 -8 -3 0 1 0 -3 -8 -15 -24 -35 —-48 -63 #=—80 
—27 —-20 -14 -9 -5 -2 0 1 1 0 -2 j-5 -9 ~-14 —-20 —-27 —35 
-~6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 9 10 
15 10 6 3 1 0 0 1 3 6 10 15 21 28 36 45 55 
36 25 16 9 4 101 4 9 16 25 36 49 64 81 100 
57 40 26 15 7 2 0 1 5 12 22 35 31 70 92 117 145 
78 55 36 21 10 3 0 1 6 15 28 45 66 91 120 153 190 
99 70 46 27 13 4 0 1 7 18 34 55 81 112 148 189 235 
120 85 56 33 16 5 0 1 8 21 40 65 96 133 176 225 280 


In the table, the values of r and k are the bold row and kolumn. The (—k)-gonal 
numbers are 1 minus the (4 + k)-gonal numbers: more precisely, the r-th (—k)- 
gonal number is 1 minus the (1 — r)-th (4 + k)-gonal number. 

Let us jump to the hexagonal numbers. It is well known that every hexagonal 
number is triangular: 


r(2r — 1) = 5$(2r — 1)((2r — 1) + 1) 
but not so well known that the converse is also true: 


| 2{ =} ~ 1} or (*)(2( 7) - 3 


sr(r+t)=[ 2 


according as r is odd or even. With this interpretation, it follows from Gauss’s 
result that 


Every number is the sum of three hexagonal numbers. 
But if we restrict ourselves to hexagonal numbers of positive rank and put 
n=r,(2r, — 1) +7,(27, — 1) + 7,(23 — 1), 
8n +3 = (4r, — 1)’ + (4r, — 1)° + (4r3 — 1)° 


then, although every number 8n +3 is expressible as the sum of three odd 
squares, they will not necessarily be squares of numbers of shape 4r — 1 with 
r >0. Many numbers (what fraction of the whole?) require four hexagonal 
numbers of positive rank; several, e.g., 


,. 9, 10, 20, 25, 38, 39, 54, 65, 70, 114, 130,..., 


require five, and 11 and 26 require six. Which numbers require five? Are there any 
others, or is every sufficiently large number the sum of four hexagonal numbers of 
nonnegative rank? Perhaps every sufficiently large number is the sum of only three 
such numbers? 
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For pentagonal numbers, if 
ar(3r, — 1) + 3r2(3r, — 1) + 3r3(3r3 — 1), 
24n + 3 = (6r, — 1)’ + (6r, — 1)’ + (6r; — 1)” 


and every such number is expressible as the sum of three odd squares. Moreover, 
either none or all of these squares are multiples of 3. If the latter, then n must 
have been of shape 3m + 1 and, after removal of a factor 9, 8m + 3 is the sum of 
three odd squares. Repeating the process if necessary, we may assume that the 
squares are of shape 6r+ 1. So, if negative ranks are allowed, every positive 
integer can be expressed as the sum of three pentagonal numbers. But, restricting 
themselves to pentagonal numbers of positive rank, Richard Blecksmith & John 
Selfridge found six numbers among the first million, namely 


9, 21, 31, 43, 55 and 89, 


which require five summands, and two hundred and four others, the largest of 
which is 33066, which require four. 

They believe that they have found them all. 

Can all sufficiently large numbers be expressed as the sum of three pentagonal 
numbers of nonnegative rank? Equivalently, is every sufficiently large number of 
shape 24n + 3 expressible as the sum of three squares of numbers of shape 
6r — 1? 

For heptagonal numbers, sr(5r — 3), even if we allow those of negative rank, 
the situation is less clear. If 


n = 5r,(5r, — 3) + ar2(5r — 3) + 3r3(5r3 — 3), 
40n + 27 = (10r, — 3)* + (10r, — 3)” + (107, — 3)” 


and although every number 40n + 27 is expressible as the sum of three squares, 
these will not necessarily be of shape (107 + 3)”. For example, 427 is uniquely 
expressible as 9% + 117 + 15? and 667 only as 17 + 15% + 217 or 97 + 157 + 197, 
so 10 and 16 are not the sum of three heptagons. Nor is 76; are there others? 

If n is the sum of three octagonal numbers, 


n=r,(3r, — 2) +7,(37, — 2) + 7,(37, — 2) 
3n + 3 = (3r, — 1) + (37, — 1)° + (37; — 1)° 


there will be no such representation if 3n + 3 = 4*(81 + 7), ie., if 
n= 4*(81 + 5)—1, eg, n= 81+ 4, 321+ 19, 128] + 79,... but otherwise it 
seems that there is always a representation, provided we allow numbers of both 
positive and negative rank. If so, this implies that all multiples of 3, other than 
those of shape 4*(24/] + 15) are representable as the sum of three squares, none of 
them being multiples of 3. 

It is clear that, for k > 8, k — 4 requires k — 4 ones to represent it as the sum 
of k-gonal numbers, while 2k — 1 and 5k — 4 require k k-gonal numbers of 
positive rank to represent them (k and k — 1 ones; four ks and k — 4 ones). But, 
for n sufficiently large, presumably a much smaller number suffices. 

What theorems are there, stating that all numbers of a suitable shape are 
expressible as the sum of three (say) squares of numbers of a given shape? 

We can also ask for the number of representations of n as the sum of polygonal 
numbers. Jacobi (see [3, 5, 7, 8]) gave the answers for two or four squares: 

The number of representations of n as the sum of two squares is 4{d,(n) — 
d,(n)}, where d,(n) is the number of divisors of n which are = i mod 4. 


n 
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The number of representations of mn as the sum of four squares is 
8(2 + (—1)")o,(n), where o,,(n) is the sum of the odd divisors of n. 

This last number can also be read as “8 times the sum of those divisors of n 
which are not multiples of 4.’ Notice that these results count representations as 
different if they differ only in order, or in sign. For example 4 = (+2)? + 07 = 
07 + (+2)? has 2+2=4 representations and 4 = (+1)? + (+1)* + (41)? + 
(+1)? = (+2)? + 07 + 07 + 0? has 24+ 4-2 = 24. 

For the corresponding result for two triangular numbers, see [5 & 6]; the 
number of representations of n as the sum of two triangular numbers is d,(4n + 1) 
— d,(4n + 1). 

Can we find corresponding results for any of the other polygonal numbers? 
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Excerpt from The Long Comeback of Henry Roth: Call It Miraculous by 
Leonard Michaels, The New York Times Book Review, August 15, 
1993. 


In one of Walton’s letters, written to a friend in 1960, when Mr. Roth was 
in Maine, she says, “He cannot write, and calms himself and his depressions, 
which seem many, by studying not Greek, as with me, but math.” 

Mr. Roth says he used the textbook “Calculus and Analytic Geometry,” by 
George Thomas of M.I.T. He even sent Professor Thomas a letter “very 
respectfully” correcting some of the solutions (‘Who else would be crazy 


enough to do every problem in the book?” he says in his self-deprecating 

way.) To think of the self-taught Mr. Roth doing calculus problems reveals 

much about his exertion, in the freezing and scary darkness of Maine, not to 

write fiction. 

[Author’s answer to his question: Donald Knuth—see his interview in 
Mathematical People, p. 185—186.| 

Contributed by Stephen B. Maurer 

Department of Mathematics 

Swarthmore College 

Swarthmore, PA 19081-1397 
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Letter to the Editor 


Recently Norbert Hegyvari [1] elegantly subsumed proofs of the irrationality 
of the fraction consisting of the decimal representations of consecutive 
primes by showing the following: 


Let a,,a,,...be a sequence of distinct positive integers with decimal 
representations (a,),(a,),... . Suppose the fraction a = 0.(a,)(a,)... 
is rational. Then L1/a,; converges. 


The proof, however, can be considerably simplified. Omitting a finite number 
of terms of the sequence if necessary, we may assume a has a periodic 
decimal representation. Call the period s. For a fixed k consider the terms a, 
for which 10*~! < a, < 10*, i.e. those with exactly k digits. As a term is 
uniquely defined by its number of digits and its starting location within a, 
there can be no more than s terms of k digits. Hence, £1/a; < Us/10*7!. 
The latter series converges and the result follows. 


[1] N. Hegyvari, On some irrational decimal fractions, Amer. Math. Monthly 
100 (1993), 779-780. 
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PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on 
the inside front cover. Please include solutions, relevant references, etc. Three copies 
are requested. 


Solutions of published problems should arrive before July 31, 1994 at the MONTHLY 
PROBLEMS address given on the inside front cover. Solutions should be typed with — 
double spacing, including the problem number and the solver’s name and mailing 
address. Two copies suffice. A self-addressed postcard or label should be included if 
an acknowledgment is desired. 


An asterisk (*) after the number of a problem, or part of a problem, indicates that 
no solution is currently available. Partial solutions will be useful in such cases. 
Otherwise, the published solution is likely to be based on a solution which is complete 
and correct. Of course, an elegant partial solution or a method leading to a more 
general result is always useful and welcome. In addition, references to other 
appearances of MONTHLY problems or to solutions of these problems in the 
literature are also solicited. 


PROBLEMS 


10361. Proposed by Emil A. Cornea, University of Bucharest, Bucharest, Romania, 
and Florin N. Diacu, University of Victoria, Victoria, B.C., Canada. 


Do there exist nonlinear C' functions f: R > R such that for any rational x, 
f(x) is also rational and for any irrational x, f(x) is also irrational? 
10362. Proposed by Hans Liebeck and Anthony Osborne, University of Keele, Eng- 
land. 


Let A be a real orthogonal matrix without eigenvalue 1. Let B be obtained 
from A by replacing one of its rows or one of its columns by its negative. Show 
that B has 1 as an eigenvalue. 


10363. Proposed by Joseph M. Santmyer, California University of Pennsylvania, 
California, PA. 


If M, N are integers satisfying 1 < m <n — 1, prove that 


[promt -("Z)-ry k+j\{2n-m—2k—-j—3 
2n —2m—1 m }~ teal x 


2(n —-m—k — 1) 
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10364. Proposed by Frank Schmidt, Arlington, VA. 


Let S,, denote the symmetric group of degree 2n. Let E,,, (respectively O,,,) 
be the set of those permutations in $,, all of whose cycle lengths are even 
(respectively odd). Show that E,, and O,, are equinumerous by finding an explicit 
bijection between them. 


10365. Proposed by Daniel Tisdale (student), University of Maryland, Baltimore, 
MD. 


Consider the square {(x, y): 0 < x, y < 1} divided into n* equal small squares 
by the lines x = i/n and y = j/n. For 1 <i <n, let x; =i/n and 


yl—x;? - ‘|. 


d,= min 
O<j<n 


Determine 


lim > d,, 


—- co. 
n i=1 


or show that it does not exist. 


10366. Proposed by Raphael M. Robinson, University of California, Berkeley, CA. 


Let C denote the Cantor set. If s is a real number, by C + s is meant the set of 
all sums c + s where c € C. Find a value of s such that C + s contains no rational 
number. This value of s must be given explicitly; an existence proof does not 
suffice. 


10367. Proposed by Donald R. Chalice, Western Washington University, Bellingham, 
WA. 


Let C be the Cantor set in [0,1], and let E be the set of endpoints of the 
removed intervals (together with 0 and 1). Let F = C — E and let p be the point 
(1/2,1/2). For any c € C, let L. be the line segment from p to c, let Q. be the 
points on L, with rational ordinates and J, the points of L,. with irrational 
ordinates. The set 


has the property that T is connected, but T — p is totally disconnected. Consider 
instead 


n-(Ur)e(Uo) 


ecE feF 


obtained by interchanging the roles of points with rational and irrational ordinates. 
Is T, connected? 
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NOTES 


Notes: (10363) No limits of summation are included in the statement since the 
sums may be extended over all integers with the convention that "|= 0 unless 


0 < k <n. For the convenience of readers, we note that, in this case, the nonzero 
terms occur for 0 <k <n—m — 1 and 0 <j <m — 1. (10364) It is known that 
#(E,,) = #(O,,,) = (2n — 1)(2n — 3)---3- 1), but proofs of this are based on 
separate counting of the two sets. (10365) This is a representative of a large family 
of questions. The d,; are the shortest distances on the vertical lines between grid 
points and the graph of the unit circle in the first quadrant; and one could also 
consider the behavior of the signed distance from the circle to the closest grid 
point. The sum of absolute values is only one way to describe the n-tuple of 
distances. The pattern of signed distances for n = 200 or so illustrates the 
difficulty of representing smooth curves by grid points. (10367) The set T was 
constructed by B. Knaster and K. Kuratowski in the paper “Sur les ensembles 
connexes”, Fund. Math. 2 (1921), 206-255 (This can also be found in L. A. Steen 
& J. A. Seebach, Jr., Counterexamples in Topology, p. 145). The set T, appears in 
B. Gelbaum and J. Olmstead, Counterexamples in Analysis, p. 146 in place of the 
Original example of Knaster and Kuratowski. Thus we are asking whether this 
example has the advertised property. 


SOLUTIONS 


A Maximal Average-Excluding Set 


E 3474 [1991, 956]. Proposed by Andrew Lenard, Indiana University, Bloomington, 
IN. 


Let S be the set of nonnegative real numbers whose expansion to the base 4 
involves only the digits 0 and 1. 

(a) Prove that if x © S, y ]© S, and x #y, then(x+y)/2€S. 

(b) Suppose 7 is a set of nonnegative real numbers properly containing S. Prove 
that there exist x € T, y © T, x # y such that (x + y)/2 ET. 


Composite solution of part (a) by many solvers. If (x + y)/2 € S, then the base 
4 expansion of x + y must involve only the digits 0 and 2. But since x # y, the 
base 4 expansion of x + y must in fact include at least one 1. So (x + y)/2€S 
(since for elements of S the expansion is unambiguously determined). 


Solution of part (b) by NSA Problems Group, Fort Meade, MD. It suffices to 
show that for any nonnegative x ¢ S, there are y,z © S with x + y = 2z. Sup- 
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pose we write x = L,.yx;4‘, x; = 0, 1, 2, or 3. Let t= L214‘ and define 
integers u; by the base 4 expansion of u=t¢+x = L;. y+ ,U;4'. Next, for each 
integer u, define integers y, and z, with values given by the following table: 


Uu. y; Zz 


L 


WN © 


Then define y, z € S by 


and 


z= 24. 


ix<N+1 


Since each row of the table satisfies 2z, + 1 =u; + y,, we find that 
2z-y= ) (2z,-y)4= YE (u,-1)4 =u-t=x 


i<N+1 i<N+1 
as desired. Note that if the construction is applied to x © S, then since each u, is 1 
or 2 we end up with y = z and hence x = y, in conformity with part a. 


Editorial comment. For part (b), most respondents took an approach requiring a 
slightly awkward analysis of the propagation of carries in the sum of base 4 
numbers x + y for y € S, x € S. In about half the responses the presentation was 
significantly less clear than the selected solution. The device adopted above of 
introducing the number ¢t allowed the carries to take care of themselves. Jiro 
Fukuta obtained a similar effect by using a base 4 expansion in which the digits are 
—1, 0, 1, and 2. 

Albert Nijenhuis employed Cantor set constructions. The set A, consists of the 
numbers in the interval [0, 1] having a base 4 expansion whose first n digits are 0 or 
1, and B, are those whose first n digits are 0 or 2. Thus A, = [0,1/2] and 
B, = [0, 1/4] U [1/2, 3/4]; while the intersection of the A, is S 1 [0,1) and the 
intersection of the B, gives the set of doubles of those numbers. Given x € [0, 1/4], 
he studies the set of y, © A, for which x + y, © B,. The digits of such y, are 
constructed as in other solutions, but only n digits are required. The limit process 
is postponed and aided by the compactness of the sets A and B. 

Raphael Robinson gave an argument that the number y constructed in part (b) 
is unique, unless the expansion of x terminates in an infinite sequence of 2’s. For 
instance, if x = .222... then the construction above yields y = 1.11111..., while 
in fact y = 0 would also work. 

The proposer noted that the problem could be rephrased to assert that S is a 
maximal average-excluding subset of R. Robert High made a similar observation, 
and went on to ask if there are other closed maximal average-excluding subsets of 
R. There is no difficulty in obtaining maximal average-excluding sets by Zorn’s 
lemma, but a construction of a different closed set with this property, or a proof 
that this property uniquely characterizes S up to affine equivalence would be of 
interest. 


Solved also by K. F. Andersen (Canada), C. D. Ashbacher, J. W. Benham, D. M. Bloom, D. Callan, 
R. J. Chapman (U.K.), M. Dindos (Slovakia), J. Fujuta (Japan), R. High, K. S. Kedlaya (student), 
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N. Komanda, A. F. Martin, A. Nijenhuis, R. E. Prather, A. Riese, R. M. Robinson, R. L. Schilling 
(Germany), A. Stein, Anchorage Math Solutions Group, Western Maryland College Problems group, 
and the proposer. One solution of part (a) only and three incorrect solutions were received. 


The Smallest Factorial That Is a Multiple of 


6674 [1991, 965]. Proposed by Paul Erdos, Hungarian Academy of Sciences, Bu- 
dapest, Hungary. 


If n is an integer greater than 1, let P(n) denote the largest prime factor of n. 
Prove that x|P(n)! for almost all n, i.e., prove that if 
S(x) ={n <x:nt P(n)}}, 
then 
lim | S(x)|/x = 0, 


Solution by Ilias Kastanas, California State University, Los Angeles, CA. Let 
S’'(x) = {n <x: n|P(n)!}; we will show that d = lim, _,,, S’(x)/x is equal to 1. 
Let Q denote the set of square free natural numbers, and introduce 


Q(x) =| {1,2,..., 2} 
Q, =|{n: n € Q and all prime factors of n < a}|. 


For any integer k > 0 let f(k) be the least prime greater than 3k. If gq € OQ and q 
is divisible by some prime > f(k), then k*q|P(q)!, since 2k and 3k are factors of 
P(q)!/q. So, there are at least O(x/k7) — Q(x) integers of the form kq in S'(x), 
and 

. QO(x/k*) a QO rk) . QO(x/k’) 1 1 

lim ——¥—-—— = lim ———— = —— -—. 
x0 xX x— 00 XxX {(2) k? 
For the last equation, see G. H. Hardy and E. M. Wright, An Introduction to the 
Theory of Numbers, Oxford, 1980, Theorem 333 in §18.5. Since the sets k?Q are 
disjoint for distinct k, we have 


1 27 1 
lod>— = 
¢(2) & k* 
for any n > 0. Therefore, 
1 
d>—— :¢(2) = 1. 
ga 6) 


Editorial comment. In preparing the problem for publication, the editors ob- 
tained a solution from Kevin Ford based on a division into cases depending on the 
number of prime factors of n and the size of P(n). He was able to show that 
|S(x)| = O(x/log x) by this method, and indicated that stronger results could be 
obtained using sieve theoretic estimates. 


No other solutions were received. 
A Characterization of Special Quadratic Irrationals 


10218 [1992, 362]. Proposed by David Dwyer, University of Evansville, Evansville, 
IN. 
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For positive real numbers r and positive integers n, put 


o(n,r) = (nr) + ([nr]r), 
where (x) = x — |x] denotes the “fractional part” of x. 
Find {r € R*: d(u,r) > 1 for all n € Z7}. 


Solution by Richard Stong, Rice University, Houston, TX.'The numbers r > 0 for 
which g(n,r) > 1 for all n € Z* are exactly the numbers 


r=(k+vk*+4m)/2 


for integers k >m-2>1, or equivalently the positive roots of the equations 
x*—kx —-m= 0. 

Suppose first that r is such a number. Then it is easy to verify that r is 
irrational, with k <r<k+41, so that |r] =k. Since r is irrational, for any 
positive integer nm we may write nr =p+ 9 for some positive integer p and 
0<6@< 1. Then (writing «-) for fractional part), (nr) = 6 and |[nr|r = pr = 
(nr — 0)r = nr? — Or = nlkr +m) — Or = kp +ké04+nm —- Or =kp+nm +t 
(k — r)@. Since 0 > k — r > —1 we see that 


({nar jn =1-(r—-k)é, 
g(n,r)=1+(k+1-r)@>1. 


Thus, all numbers r of the stated form work. 

Suppose r is not of the stated form. If r is a rational number p/q then one 
easily checks that o(q, r) < 1. Thus we may assume r is irrational. Let k = |r] and 
set s=r*—kr=r(r—k). Since k <r<k +1 we necessarily have 0 <s <k + 
1. Therefore, since r is not of the specified form, s =r? — kr cannot be an 
integer. 


Claim. There is a positive integer n with 
1 — (nr) = (ns) = (nr). 
Granting the claim and using the specified n, write 
nr=p+060 and ns=qt+a 


for non-negative integers p,q, and with 0 < @<o< 1-9. Then (nr) = @ and 
lur jr = pr =nr? —r@ =nkr +ns —r0=pk+qt+oat(k —r)é. Since 0>k — 
r> —l1and o> 60 we see that (|nr|r) = o — (r —k)6@, and from o < 1 — 0 we 
find o(nm,r) =o + (k + 1—-—7)6 < 1, as required. 

To establish the claim, consider the set A = {(nr,ns)|n € Z*} as elements of 
the torus T = R?/Z?. The closure A C T is a closed subgroup of T and must 
thereby be a closed submanifold, by the closed subgroup lemma for Lie groups. 
Dimension 0 is ruled out since by Kronecker’s theorem the first coordinate nr 
projects to a dense set in R/Z. If A has dimension 2, then A clearly has points in 
the triangle A defined by 1 — (nr) > (ns) > (nr). If A has dimension 1 then it 
is a finite union of lines of rational slope, including one through (0, 0). The slope 
can be neither 0 nor ~, since neither s nor r is an integer. It is then elementary to 
see that A must intersect A. 


Editorial comment. The other two solvers effectively gave self-contained ad hoc 


proofs of the above claim, using Kronecker lemma arguments. One would-be 
solver showed that for every rational r there is an interval J = [r,r + E,), such 
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that for any rp € I, y(n, ro) is not always > 1, but then jumped to the fallacious 
conclusion that these intervals must cover all of R, owing to the density of the 
rationals. 


Solved also by R. J. Chapman (U.K.) and the proposer. Two incorrect solutions were received. 
Nonnegative Curve Fitting 


10219 [1992, 362]. Proposed by Alan Horwitz, Penn State University, Media PA. 


(a) Suppose that the function f is positive on R and that f”(x) exists for all 
x € R. Prove that there exists x, € R such that the second order Taylor polyno- 
mial of f centered at x, is also positive on R. 

(b)* Let n be an arbitrary even integer, and suppose that f is positive on R and 
that f(x) exists for all x € R. Does there exist x, € R such that the n-th order 
Taylor polynomial of f centered at x, is also positive on R? 


Solution of (a) by David Callan, University of Wisconsin, Madison, WI. By 
completing the square, it is easy to verify that x, has the desired property 
iff f(x) =f"(xo) = 0 or f(x) f"(X%o) > f'(%_)*/2. Let g(x) = yf(x), thus 
g(x) = f'(x)/Q2g(x)) and g"(x) = (fx) f"(x) — f'"(x)?/2)/28(x)’). If there 
exists x) with g”(x,) > 0, clearly this x, will do. If not, g’(x) < 0 for all x and 
the graph of g lies entirely on or below each of its tangent lines. Hence since g is 
positive everywhere, these tangents must all be horizontal, so g and f are 
constants, and any X, will do. 


Editorial comment. All solvers confined attention to part (a). In the original 
proposal, the proposer included a reference to J. Briggs & L. Rubel, ‘Interpolation 
by non-negative polynomials”, J. Approx. Theory 30 (1980), 160-168 for a version 
of part (b) on a compact interval. He also mentioned A. Horwitz, “Interpolation by 
polynomials non-negative on the real line”, preprint as treating the analogous 
problem in which Taylor polynomials are replaced by interpolating polynomials at 
distinct points. 


Solved also by K. F. Andersen (Canada), D. Caccia, R. J. Chapman (U.K.), R. Cooke, E. A. 
Herman, O. P. Lossers (The Netherlands), M. Mécsy (Hungary), A. Nijenhuis, K. Schilling, Anchorage 
Math Solutions Group, and the proposer. 


A Little Measure Theory 


10225 [1992, 462]. Proposed by Paul R. Chernoff, University of California, Berkeley, 
CA. 


Suppose that ¢: [0,) — [0,%) is a strictly increasing, strictly concave function 
with (0) = 0. Let m* be Lebesgue outer measure on the unit interval J = [0, 1]. 
For E Cl, define n*(E) = d(m*(E)). Show that n* is an outer measure and 
determine the n*-measurable sets. 


Solution by Michael B. Gregory, University of North Dakota, Grand Forks, ND. 
The monotonicity of n* follows from that of m* and the fact that @ is increasing; 
n*(@) = 0 is a consequence of m*(@) = 0 and (0) = 0. To verify that n* is 
countably subadditive, we will first show #(t)/t is strictly decreasing on (0, &): let 
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0 <a <b be given. Because ¢ is strictly concave, with a = a/b, we have 
a 
(a) = 6((1 — a)0 + ab) > (1 - 2) (0) + af(b) = (5). 


Thus, #(a)/a > 6(b)/b, as claimed. In turn, this implies that ¢@ is. strictly 
subadditive: if a,b > 0 then d(a + b) < h(a) + d(b). To see this, write 


a b 
+b) = ——¢(a+b)+ +b 
b(a +b) = ——d(a +b) + ——4(a +b) 
p(a o(b 
<a (2) +p) = f(a) + f(b). 
a b 
Now one can see, by induction, that if x,,x,,...,x, > O then d(x, + ++: +x,) < 
f(x,) + ++: +h(x,). It follows that if ¢(x,) is any sequence of nonnegative 


numbers such that the sequence ((x,)) is any sequence of nonnegative numbers 
such that the sequence (@(x,)) is summable then the sequence (x,) is also 
summable and 


6( Em) < Do). 
k=1 k=1 


because ¢ is continuous at each positive number. (Concave functions are continu- 
ous at the interior points of their intervals of concavity.) Therefore, since @ is 
increasing, if «E,) is any sequence of subsets of J, then (we may assume that the 
sequence (d(m*(E,))> is summable) 


| U E,| s | > m(E) < b(m*(E,)) = di n*(E,), 
k=1 k=1 k=1 k=1 
and consequently n* is an outer measure on J. 

We now show that the n*-measurable subsets of J are exactly the m*-measura- 
ble sets E CI with m*(E) = 0 or m*(E) = 1. Because the n*-measure of such a 
set or its complement (in J) is zero, it follows from the basic theory of outer 
measures, that every such set is n*-measurable. 

To prove the converse, we first verify, by contrapositive, that any n*-measurable 
set must be m*-measurable. Suppose that E CJ is not m*-measurable. Then there 
is a test set A CI such that (with E*° denoting the complement of E in J) 


m*(A) <m*(ANE)+m* (ANE). 
Because ¢ is strictly increasing, it follows that 
$(m*(A)) < d(m*(A NE) + m*(AN E*)) 
< $(m*(AN E)) + $(m*(AN E*)); 
the last inequality follows from the strict subadditivity of @. This means that 
n*(A) <n*(ANE)+n* (ANE) 


and hence E is not n*-measurable. It follows that all the n*-measurable sets must 
be Lebesgue measurable. 

Now assume E C/I is a Lebesgue measurable set such that 0 < m*(E) < 1. 
Taking the “test set’ A = J, we see that because both m*C4 M E) and m*(A nN 
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E‘) are positive, 
n*(A) = 6(m*(A)) = o(m*(A 0 E) + m*(AN E*)) 
<6(m*(ANE)) + 6(m*(AN E*)) 
=n (ANE)+n* (AN E*). 


Therefore, E is not n*-measurable. Thus, a subset E CJ is n*-measurable if and 
only if either m*(E) = 0 or m*(E) = 1. This means that the family of n*-mea- 
surable sets is the o-algebra in J generated by the subsets (of J) that have 
Lebesgue measure zero. 


Solved also by R. J. Chapman (U.K.), N. Komanda, and the proposer. Two incomplete solutions 
were also received. 


Characterizing The Ball 


10228 [1992, 463]. Proposed by Ernesto Bruno Cossi and Marcos Antonia Sebastiani, 
Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil. 


(a) Let @ be a Banach space, and let B be a bounded, nonempty subset of @ 
such that, for any pair of points x and y in B, there is an open ball U such that 
U cB, x €U and y &€ U. Show that B is an open ball. 

(b) Show that the result of part (a) does not generalize to the case in which @ 
is only assumed to be a complete metric space. 


Solution by Robert B. Israel, University of British Columbia, Vancouver, B.C., 
Canada. Part (a): Let R = sup{r: B contains an open ball of radius r}. This is 
finite since B is bounded. Let (U,) be a sequence of open balls contained in B, 
with radi 7, > Ras n — ©, Let x, be the centre of U,. Note that 


diam(U, U U,) = 7; + 1; + IIx; — x;ll. (1) 


Thus there exist open balls contained in B (containing pairs of points in U, U U;,) 
with diameters arbitrarily close to r; +7, + ||x;—x,|]. Since the diameter of 
a ball is twice its radius, we must have r, + r; + ||x; — x,ll < 2R, ie., |lx; — x\ll < 
2R —r, — r;. Therefore (x,) is a Cauchy sequence, converging to some point x. I 
claim that B is the open ball V of radius R centered at x. 

Now for any r < R, the open ball of radius r centered at x is contained in U,, 
and therefore in B, if ||x, — x|| +r <vr,, and this is true for n sufficiently large. 
Since V is the union of these balls, V C B. On the other hand, if ||z —x|| > R 
there is a point u © V with |lu —z||>2R. Thus u © B, but any open ball 
containing u and z must have radius greater than R, so z ¢ B. This means that B 
is contained in the closure of V. But clearly B is open, so B = V. 

Part (b): Consider the complete metric space X consisting of five points a, b, c, 
d and e with d(a, b) = d(b, c) = d(c, d) = d(a, d) = 2, d(a,c) = d(b, d) = 4, and 
d(x, e) = 3 for all x # e. Let B = {a, b, c, d}. Note that an open ball contained in 
B consists of either one point or any three points of B. Thus B has the required 
property, but is not an open ball. 


Editorial comment. G. Muraz & P. Szeptycki remarked that the solution to (a) 
remains valid in any complete metric space satisfying (1). Nasha Komanda, Reiner 
Martin, and Kenneth Schilling also provided solutions to (b) in which & has five 
points and B has four points. Another popular collection of examples used subsets 
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of the plane. G. Muraz & P. Szeptycki took 2= {(s,0): -—1<s < 1} U {0, D} 
and B= {(s,0): —1<s <1}. The example given by Jean-Pierre -Grivaux and 
Dave Trautman took X= R with the metric given by d(x, y) = |Ix —y|/( + 
lx — yl), with B being the set of positive real numbers. 


Solved also by J. Alexopoulos, S.-K. Chung, M. Dindos (Slovakia), J.-P. Grivaux (France), R. 
Holzsager, E. G. Katsoulis, K. S. Kedlaya (student), N. Komanda, R. Martin (student), M. Mdédcsy 
(Hungary), G. Muraz & P. Szeptycki (France), K. Schilling, D. Trautman, and the University of 
Wyoming Problem Circle. A solution by Marcos Antonio Sebastiani accompanied the original proposal. 


Some Inverse Hilbert Polynomials 


10240 [1992, 674]. Proposed by Michael Golomb, Purdue University, West Lafayette, 
IN. 


Fix an integer n. For each integer m with 0 < m < n, let p,, be a polynomial of 
degree n for which /jp,(x)x' dx =0 for 0<l<n with 1 #m, while 
[oPmix)x™ dx = 1. 

(a) Determine the value of [jp2(x) dx. 

(b) Find an explicit expression for p,, and prove that the coefficient of x’ in p,, 
is the same as the coefficient of x” in p, forO <l1<m<n. 


Solution by Robin J. Chapman, University of Exeter, Exeter, U.K.. Let 


(-1)"(n+m+1)! C)C 
«m= m!*(n — m)! and C1,m = 


(a) We have 


(n +m +1)!" 


I 2 _ _ OF 
J, Pn(*) = Cm = m!\*(n — m)!?(2m + 1)— 


(b) Also 


m 
P(X) = > Cy mx 
l=0 


Proof: Represent a polynomial f of degree <n by a row vector v,;= 
(ao, a,,...,4,), where f = L”,_ a,x”. Then Jo f(x)g(x) dx = v,Hv; where H = 
Q/U +m + bn mao is the n+ 1 by n + 1 Hilbert matrix. If C = (c;, mim=0 = 
H~' then putting p,(x) = Li_oc, ,x we immediately get [oPmn(x)x! ax = 8) m: 
As H is symmetric so is C, and so the latter assertion of (b) is immediate. Also 


1 ” 1 
[Pn x)? de = Lcim f P(x)! Be = Cm 
0 1=0 0 
The formula for c, ,, 1s classical. 
Editorial comment. A reference for the entries of H~! supplied by readers was 
D. E. Knuth, The Art of Computer Programming, Vol. I, Addison-Wesley, 1973, ex. 
41, p. 36, which deals with the more general Cauchy matrix, and ex. 45, p. 37. 


David and Peter Borwein observed that the result may be generalized to Muiintz 
polynomials X_,a,x"** where the exponents A, are real numbers satisfying 
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-1/2 <A, <A, °°: <A,. The quantities 
T1,{A,. tA, t1:0<k <n} 
om = Tp{Am ~ApiOSk <n;k #m} 


generalize the corresponding expressions in the classical case, and the coefficient 
Crm = CjCp/(A; + Am + 1). The expression for the inner products of Miintz poly- 
nomials in terms of their coefficients is given by a Cauchy matrix, so the approach 
used above applies in this generality. Another approach to part (a), suggested by 
the proposer, is to calculate the coefficients of the p,(x) in terms of the 
orthogonal basis of Legendre polynomials. 


Solved also by S. Ali, R. Bagby, S. F. Barger, D. J. Barrett, K. L. Bernstein, D. Borwein & P. 
Borwein (Canada), D. Callan, R. Cerf (France), D. A. Darling, K. Diethelm (Germany), I. Dimitrié¢, 
P. G. Engstrom, J. Fukuta (Japan), J. A. Gomez Ortega (Mexico), E. A. Herman, R. Holzsager, W. 
Hong, F.-A. Izadi (Iran), I. Kastanas, N. Komanda, H. K. Krishnapriyan, C. Libis, O. P. Lossers (The 
Netherlands), S. Matz, A. Nijenhuis, A. Pedersen (Denmark), F. C. Rembis, H. J. Seiffert (Germany), 
D. Spellman, M. Stamp, W. F. Trench, D. B. Tyler, R. L. Van de Wetering, M. Vowe (Switzerland), 
National Security Agency Problems Group, Skidmore College Problem Group, Western Maryland 
College Problems group, University of Wyoming Problem Circle, and the proposer. 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 


From the New Yorker, July 26, p. 62: 


This was a world bounded by a 
diminishing set of coordinates. There 
were from the beginning a finite num- 
ber of employers who needed what 
these people knew how to deliver, and 
what these people knew how to deliver 


was only one kind of product. ‘Our 
industry’s record at defense conversion 
is unblemished by success’’... 


Contributed by Emma Lehmer 
1180 Miller Avenue 
Berkeley, CA 94708 
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REVIEWS 


Edited by Darrell Haile 
Indiana University, Bloomington, IN 47405 


Reality Rules I. The Fundamentals; II The Frontier. By John L. Casti, John 
Wiley & Sons, New York, 1992, $72.00 (set). 


Reviewed by Rutherford Aris 


Par ma foi! il y a plus de quarante ans que je dis de la prose sans que j’en 
susse rien. 


Many a mathematical practitioner must have reacted—albeit with a whimsical 
smile—to the current popularity of “mathematical modeling” as M. Jourdain does 
when told that his orders to Nicole are indeed prose. Books with these words in 
their titles or subtitles have been sprouting plentifully in the last decade and more. 
They range from elementary introductions and collections of case histories through 
general discussions of modeling methodology to the metaphysics and philosophy of 
the subject. Such topics as chaos, fractals and complexity have been treated at 
book-length at a level almost accessible to the notorious ‘man in the street’. With 
this popularization, often very well done, there has come a healthy awareness of 
the more concrete aspects of mathematics, even though fraught with the danger of 
misunderstanding which either invests the model with a spurious authority or else 
dismisses it as something less than real. (Some recent observations by Davis and 
Boose-Bavenbek in SIAM News, 6 May 1993, pp. 6-7, on ‘Mathematics and the 
Media’ are very much to the point.) Practitioners of the art and craft of modeling 
(I use ‘and,’ for the old disdain of the artist for the craftsman is as sterile as that of 
the ‘pure’ mathematician for the applied which sometimes used to fester among 
the second-rate. I do not regard Hardy’s famous Apology as expressing disdain, 
even though there is clearly a lack of sympathy.) are often too much wrapped up in 
the detail and the peculiar beauties of their model to be easily able to step aside 
and take a larger view. There is therefore a genuine need for expositions of the 
general principles and overarching considerations of modelling that will help us to 
get a sound perspective on the whole enterprise. It is to this need that Casti’s two 
volume extension of his 1989 Alternate Realities is addressed. 

Reality Rules can be used independently of its precursor and it is very much 
more than a mere revision or updating of it. Its title is deliberately ambivalent, its 
scope far-reaching and its style somewhat reminiscent of the late Dick Bellman to 
whose memory it is dedicated. I mean this in the best sense, not that it is imitative. 
But there is the same variousness of example, introduced seriously and thought- 
fully enough, but leaving much unsaid and needing to be supplemented from the 
references, given with a useful word or two of comment at the chapter’s end. 
Mention of some of the examples will indicate the range of contacts between the 
external reality and mathematical model: Keynesian dynamics and equivalent 
economies, forest insect pest control, the van der Pol equation, racial integration 
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in urban housing, rodents and cellular automata, DNA sequences, chaos in 
monetary aggregates, the dynamics of beer distribution, the art of Escher and 
Mondrian, industrial manufacturing processes, the discrete, controlled pendulum, 
cognitive theory, sea bass spawning and the flowering times of competing plants. It 
is an abundantly furnished table to which we are led. A very strong point of the 
book is the generous supply of exercises, discussion questions and problems which 
make the book attractive for classroom or seminar use. There i is to be a solutions 
manual, obtainable from the publishers. 

Because the individual models in Reality Rules are introduced to illuminate 
some particular point or other, only a few are developed far enough that one gets a 
sense of the shaped and polished model. Of course there is a certain danger in 
even wanting a model to be a complete thing. No model stands alone and there are 
always plenty of possible developments. But provided the sin of idolatry is avoided, 
a good model has a life of its own and may be enjoyed for its own sake. Take, for 
instance, the Lorenz equations, the logistic map or some of Otto ROssler’s systems. 
They have all the ‘juice and joy’ of old, and modern, masters and one may well 
seek, with Hopkins, to know whence it comes. But perhaps this must await a 
mathematical editor of the temperament of a classical scholar who will bring out a 
Corpus Systemorum Dynamicorum to rival the greatness of the Corpus Vasorum or 
Lowe’s Codices Latinae Antiquiores. 

Though the details of encoding—i.e. the task of setting up a model with its 
choice of variables and parameters, their manipulation and the preliminary shap- 
ing of the equations—are not the subject of Reality Rules, the modeling relation is 
clearly explained and the system scientist installed as the ferryman across the river 
that divides the mathematical and the ‘real’ worlds, encoding the natural system in 
one direction and decoding the formal system in the other. Three major types of 
model are considered in the “Fundamentals” volume and they are related to major 
classes of problems. Thus the singularities of resource systems are amenable to 
analysis from the viewpoint of catastrophe theory; pattern formation and the 
emergence of living forms link with discrete dynamical systems and cellular 
automata; turbulence and population dynamics invite chaos theory. Similarly in the 
second volume, “Frontiers”, —the dividing line between the volumes is clearly not 
to be taken as hard and fast—games and the theory of evolution occupy chapter 
five, while chapter six is a system-theoretic view of minds and mechanisms and 
seven deals with control, optimal, adaptive and anticipatory. Not surprisingly, 
almost all the examples up to this point have been dynamical systems, but in 
chapter eight he introduces the geometrical concept of a simplicial complex as a 
tool in teasing out the inner structure of a natural system. The homology of such 
complexes leads to a natural definition of connectivity and this is illustrated by an 
analysis of the Middle East situation, Escher’s Sky and Water and a work of Piet 
Mondrian. There is a nice tie back into dynamical systems in a discussion of the 
structure of reachability and a range of discussion and other problems touching on 
every thing from The Death of a Salesman to elementary homology theory and 
from the Betti numbers of the Middle East conflict to the food web of Nepenthes 
albomarginata. This chapter illustrates the strength and weakness of the work. One 
marvels at and is stimulated by Casti’s Bellmanesque fecundity, but the exposition 
of the underlying theory is necessarily so brief that even the copious references 
leave one wondering just where to start digging in the mathematical soil that holds 
promise of so interesting a harvest. 

In the final chapters of this long and challenging work, Casti tackles the truly 
philosophical question of how we know. First he raises the question of whether 
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there are natural limits to reason, by looking at the properties of the Turing 
machine and the import of Gddel’s Incompleteness Theorem and finishes with a 
chapter on the role of myths, models and paradigms. These are deep waters where 
few of us would claim to have any final insight, and where indeed such a claim 
would be viewed with suspicion. Casti does a good job of laying out several of the 
recent approaches and ends with the wise remark that there are no complete 
answers adding the admonition that “the only rule in the Reality Game is to avoid 
falling into that most common of all human delusions, the delusion of a single 
reality—our own!”’ 

In summary these volumes provide a timely and insightful overview of mathe- 
matical modeling, rich in example and lively in presentation. It should go far to 
ensure that the growing self-consciousness of the mathematical modeler is the 
eager self-edification of the honest artisan, rather than the empty self-congratula- 
tion of the bourgeois gentilhomme. 


Chemical Engineering and Mathematical Sciences 
University of Minnesota 
Minneapolis, MN 55455-0373 


Geometry of Surfaces. By John Stillwell. Springer-Verlag, New York, 1992. 


Reviewed by David L. Webb 


It is pretty generally recognized that modern Riemannian geometry is a subject 
which has come of age during this century, following the development of founda- 
tional ideas able to bear the weight of the intuitive geometric arguments pioneered 
by Riemann and others. It is also a subject whose influence is felt nearly 
everywhere, not merely in mathematics but also in physics. Many important 
physical theories are most naturally expressed in geometric terms: general relativ- 
ity relies upon Lorentzian geometry, classical mechanics is naturally formulated in 
terms of symplectic geometry, and modern gauge field theories make heavy use of 
the theory of connections on fibre bundles. The interactions with other parts of 
mathematics are equally striking: one well-known example is the classical Uniform- 
ization Theorem for Riemann surfaces and its 3-dimensional counterpart, 
Thurston’s visionary Geometrization program for understanding the topology of 
3-manifolds. 

However, despite its pivotal role in many areas of mathematics and physics, 
modern geometry is still not a standard part of the undergraduate curriculum at 
most institutions. Indeed, geometry tends to fare rather poorly at the undergradu- 
ate level: in many colleges and universities, if there is a regular geometry course, it 
is likely to be one intended for future secondary school teachers - a course chiefly 
concerned with questions of axiomatics, which treats non-Euclidean geometry from 
a “‘synthetic” viewpoint and perhaps introduces the Poincaré plane, but does little 
to lay the groundwork for an understanding of the spectacular successes of modern 
Riemannian geometry. This is unfortunate, as there are a number of really 
excellent texts now available. Three particularly nice examples are Differential 
Geometry of Curves and Surfaces by Manfredo do Carmo (Prentice-Hall, 1976), 
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Elementary Differential Geometry by Barrett O’Neill (Academic Press, 1966), and 
Riemannian Geometry-A Beginner’s Guide by Frank Morgan (Jones and Bartlett, 
1992). 

These observations raise two natural questions: Why does geometry tend to get 
such short shrift in the traditional undergraduate curriculum? And why is a new 
book needed? 

There are many answers to the first question. One obvious answer is that there 
is an intrinsic obstacle to doing geometry which always seems to rear its head: 
although the ideas are often clear, beautiful, and compelling, making them precise 
(and avoiding the mistakes which can easily arise from a free-wheeling use of 
“geometric intuition”) requires some machinery which seems far too sophisticated 
given the simplicity of the underlying ideas; this arises chiefly because one must 
introduce coordinates and hence must keep track of what happens when one 
changes the coordinates. (Anyone who has taught linear algebra frequently is 
aware of the difficulties students have in mastering the yoga of basis change and 
similarity, even in the rather temperate climate of a linear space; in the more 
forbidding climes of a nonlinear object such as a manifold, the pedagogical 
difficulties are that much worse.) A related but less obvious difficulty arises from 
the nature of the underlying objects: one is usually interested in intrinsic geometry, 
the study of spaces whose geometry arises from an intrinsic notion of distance 
rather than being “inherited” from the geometry of an ambient Euclidean space. 
In undertaking such a study, it is of course natural to try to set up the foundations 
in such a way that one can do calculus in a nonlinear space without relying upon 
an ambient linear space...but this takes some work: it leads one to introduce a 
panoply of objects (including abstract manifolds and vector bundles) which tend to 
bewilder the neophyte and to lend credence to the jocular pronouncement that 
differential geometry is the study of those properties which are invariant under 
change of notation. An alternative course is to begin with manifolds embedded in 
Euclidean space, then gradually phase the ambient space out of the picture 
(thereby, as it were, removing the Cheshire Cat and leaving only the grin remain- 
ing). This approach is perhaps sounder pedagogically (and is the one adopted in 
the three books cited above); however, it tends to distract one from the intrinsic 
character of the study. 

Stillwell’s book circumvents these difficulties by restricting its focus to a special 
case, but one which is quite interesting and leads to much beautiful classical 
geometry: the study of surfaces of constant curvature. (This is much easier, just as 
it is much easier to define what is meant by “‘Lebesgue measure zero” than to 
define what is meant by “Lebesgue measure’’.) The point of view is that of Klein’s 
program of studying geometry via the study of groups of symmetries. The basic 
objects of study, rather than being surfaces lying in some Euclidean space from 
which they derive their geometry, are now quotients of the Euclidean plane, of the 
hyperbolic plane, or of the round 2-sphere by discrete groups of isometries acting 
without fixed points. In this approach, one is doing “intrinsic” geometry right at 
the outset, since these are naturally quotients rather than subobjects of familiar 
geometric spaces. Restricting to the case of constant curvature relieves one of the 
necessity of rounding up the usual suspects (vector fields, forms, frame fields, 
shape operators, connections, etc.) before embarking; one can easily formulate 
basic notions such as “‘geodesic”’ and “area” locally by lifting them to the familiar 
geometry of the covering surface. The main role is thus played not by the 
machinery of calculus, but by that of group theory (note that the word “differen- 
tial” does not appear in the title). 


1994] REVIEWS 189 


The book features leisurely and informative treatments of the three kinds of 
geometry considered: Euclidean (curvature = 0), spherical (curvature = 1), and 
hyperbolic (curvature = —1). In each case, one begins with a study of the isome- 
tries of the simply connected space having the desired constant curvature and then 
proceeds with the study of general surfaces by means of a study of the discrete 
subgroups of the group of isometries. Of the three, hyperbolic geometry is perhaps 
the most interesting, since it is the geometry of ““most”” Riemann surfaces, and the 
connections with complex analysis are especially appealing; appropriately enough, 
it gets the lion’s share of attention. The sections labeled ‘‘Discussion” are espe- 
cially noteworthy: they furnish brief heuristic introductions to some fascinating 
connections with other areas, including elliptic functions, modular forms, and 
Riemann surfaces. Later on, the book includes some basic material about the 
topological classification of surfaces and the fundamental group. When pursuing 
the “Riemannian covering” approach to geometry adopted here, it is quite natural 
to inquire what happens if one discards the requirement that the groups of 
isometries act without fixed points. This leads to the idea of a Riemannian orbifold 
of constant curvature, a space which locally looks geometrically like the quotient of 
a simply connected space of constant curvature by a finite group of isometrics; this 
idea is introduced in the final chapter. It should also be noted that the book reads 
quite smoothly and is liberally furnished with helpful pictures. 

This summary suggests an answer to the second question: Why is a new book 
needed? One answer is that the approach taken here is significantly different from 
that of most available texts written at a comparable level, and it furnishes 
comparatively easy access to some very beautiful mathematics. One of the frustrat- 
ing things about teaching a one-quarter differential geometry course from one of 
the standard texts is that one scarcely has time to introduce the basic machinery 
before the term is over, and hence one never really gets very far into the geometric 
heart of the subject. For such a course, this book fills the bill admirably. Of course, 
there are inevitable drawbacks as well: the book neatly sidesteps the difficulties of 
teaching differential geometry by the rather brutal expedient of excising the 
“differential”? to concentrate on the “geometry”, so that a student wishing to go 
further (say, to study traditional Riemannian geometry or general relativity) will be 
ill-equipped to do so; such a student would be well-advised to peruse do Carmo, 
O’Neill, Morgan, or even one of the many more advanced treatments of the 
subject to gain familiarity with the requisite machinery. Thus Stillwell’s book 
should be viewed as complementary to (rather than as a substitute for) the 
standard treatments, and in this role it is a welcome contribution. Most important, 
however, this book is tremendously valuable as a reminder of why geometry is so 
captivating in the first place. 


Department of Mathematics 
Dartmouth College 
Hanover, NH 03755 


Answer to Picture Puzzle 


(p. 139) 
Alfred Haar. 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 

T : Textbook 

C : Computer Software 


1-4: Semester 
** » Special Emphasis 
?? : Questionable 


P : Professional Reading 
L : Undergraduate Library 
13: Grade Level 


S : Supplementary Reading 
Readers are advised that price information is subject to change. Selected books and 
software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, 


MN 55057-1098. 


General, T(15-16: 1), S, P, L. Laws of the 
Game: How the Principles of Nature Gov- 
ern Chance. Manfred Eigen, Ruthild Winkler. 
Transl: Robert & Rita Kimber. Princeton Univ 
Pr, 1993, xv + 347 pp, $16.95 (P). [ISBN 
0-691-02566-5] From the Foreword: “Every- 
thing that happens in our world resembles a vast 
game in which nothing is determined in advance 
but the rules, and only the rules are open to ob- 
jective understanding.” BC 


Logic, P. Gédel’s Theorems: A Workbook on 
Formalization. Verena Huber-Dyson. Teubner- 
Texte zur Mathematik, B. 122. BG Teubner 
Stuttgart, 1991, 292 pp, (P). [ISBN 3-8154- 
2023-7] Detailed explication of Gédel’s The- 
orems for mathematicians and philosophers. In- 
cludes introduction to intuitionistic reasoning 
and topos theory, examples from combinato- 
rial group theory. Exercises and essay sugges- 
tions. KES 

Logic, T(13-14: 1). Clear Thinking: An Invi- 
tation to Logic. Gary Jason. Jones & Bartlett, 
1992, x + 518 pp, $30 (P). [ISBN 0-86720- 
182-7] Text for introductory logic course (not 
mathematical logic). KES 


Foundations, T*(14: 1), 8, L. Sets, Functions 
and Logic: An Introduction to Abstract Mathe- 
matics, Second Edition. Keith Devlin. Chap- 
man & Hall, 1992, xi + 147 pp, $22.50 (P). 
[ISBN 0-412-45980-9] Readable, concise in- 
troduction to logic, sets, functions, relations, 
complex numbers. Good exercises; no so- 
lutions in book. (First Edition, TR, March 
1982.) KES 


Combinatorics, T(15-16: 2), L. Aspects of 
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Combinatorics: A Wide-Ranging Introduction. 
Victor Bryant. Cambridge Univ Pr, 1993, viti+ 
266 pp, $64.95. [ISBN 0-521-41974-3] Em- 
phasizes graph and transversal theory. Chap- 
ters treat Latin squares, the marriage theorem, 
rook polynomials, planar graphs, Ramsey the- 
ory; generating functions don’t appear. Exer- 
cises, hints, and solutions. LIC 


Discrete Mathematics, T*(13-14: 1). Dis- 
crete Mathematics, Second Edition. John A. 
Dossey, et al. HarperCollins College, 1993, 
xv +555 pp, $40. [ISBN 0-673-46287-0] Re- 
vised chapters on graphs, trees, recurrence; new 
material on generating functions; supplemen- 
tary exercises and computer projects for each 
chapter. (First Edition, TR, August-September 
1987; Extended Review, February 1988.) KES 


Discrete Mathematics, T(13—14: 1), L. Dis- 
crete Mathematics for Computing. John E. 
Munro. Chapman & Hall, 1992, x + 306 pp, 
(P). [ISBN 0-412-45650-8] Concise text cov- 
ers integers, number representation, sets, func- 
tions, relations, logic, proof, recursion, analysis 
and correctness of algorithms, graphs and trees, 
counting, algebraic structures. Introduction to 
each chapter explains relevance of topic to com- 
puter science. Algorithms in English. KES 


Number Theory, T**(16-17: 2), L.  Intro- 
duction to Elliptic Curves and Modular Forms, 
Second Edition. Neal Koblitz. Grad. Texts 
in Math., V. 97. Springer-Verlag, 1993, x + 
248 pp, $49. [ISBN 0-387-97966-2] New 
edition of an exceptional introduction to ellip- 
tic curves. Changes focus on new develop- 
ments concerning the Birch and Swinnerton- 
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Dyer conjectures. (First Edition, TR, April 
1985.) Highly recommended. MPR 


Linear Algebra, T*(14: 1), L. Elementary 
Linear Algebra, Second Edition. Roland E. 
Larson, Bruce H. Edwards. DC Heath, 1991, 
xiv + 670 pp, $36 net. [ISBN 0-669-24592-5] 
Excellent applications, many exercises (some 
theoretical), historical notes. New chapters on 
complex spaces, linear programming. (First 
Edition, TR, December 1988.) CEC 


Algebra, T(17-18: 1, 2), L. Elementary Ab- 
stract Algebra. Lawrence E. Spence, Charles 
Vanden Eynden. HarperCollins College, 1993, 
XVili + 456 pp, $42. [ISBN 0-673-38583-3] 
Starts with ring theory (elementary properties, 
subrings, direct sums, homomorphisms and iso- 
morphisms), then fields, integral domains, ex- 
tension fields; all before group theory. Worth a 
look. LC 


Calculus, T(13), L. Calculus. Deborah 
Hughes-Hallett, Andrew M. Gleason. Wi- 
ley, 1994, xviii + 685 pp, (P). [ISBN 0-471- 
31055-7] The “Harvard Consortium materi- 
als.” Now in attractive soft-cover form with 
the customary inexplicable photograph on the 
cover (a sky-diver, in this case). Only slightly 
less lean than in its preliminary edition (partial 
fractions and a glancing contact with the Mean 
Value Theorem have been added). Not isomor- 
phic to the usual calculus book: no epsilons and 
deltas, few proofs. Many standard topics (e.g., 
trig substitution) are missing, but the problems 
and the text give students much more support 
than usual in attaining an intuitive understand- 
ing of the ideas of calculus. Get the book, and 
think about the trade-offs yourself. JO 


Calculus, T(13-14: 3), C, L. Calculus Us- 
ing Mathematica. K.D. Stroyan. Academic 
Pr, 1993, xxv + 532 pp, $50 with disk, [ISBN 
0-12-672972-7]; Scientific Projects and Math- 
ematical Background, xi + 353 pp, (P). [ISBN 
0-12-672975-1] In four parts: core text, sci- 
ence projects, mathematical projects, and Math- 
ematica NoteBooks. Introduces calculus ideas 
through intriguing economics, science, and en- 
gineering applications. Use of Mathematica 
and student projects are essential for any course 
based on the text. Very interesting, albeit some- 
what nontraditional. AO 


Complex Analysis, T(17-18: 1, 2), P*. Nor- 
mal Families. Joel L. Schiff. Universi- 
text. Springer-Verlag, 1993, xii + 236 pp, 
$39 (P). [ISBN 0-387-97967-0] First book- 
length study of normal families of analytic func- 
tions since introduction in 1920’s by Paul Mon- 
tel. Relatively self-contained: covers basics, 
analytic theory, meromorphic theory, Bloch 
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principle, current applications. Scope ranges 
from early graduate through research levels. 
Many references; no exercises. PZ 


Differential Equations, S(14—15), C*. Mac- 
Math 9.2: A Dynamical Systems Software Pack- 
age for the Macintosh. John H. Hubbard, Bev- 
erly H. West. Springer-Verlag, 1993, vii + 
162 pp, $49.95 (P), with disk. [ISBN 0-387- 
94135-5] Software updated for System 7 op- 
erating system. (MacMath 9.0 owners may up- 
date to Version 9.2 by contacting authors). AO 


Differential Equations, P. Nonlinear Stokes 
Phenomena. Ed: Yu. S. Il’yashenko. Adv. in 
Soviet Math., V. 14. AMS, 1993, xiv + 287 pp, 
$116. [ISBN 0-8218-4112-2] 5 papers survey 
recent developments. 


Dynamical Systems, P. Encounter with 
Chaos: Self-Organized Hierarchical Complex- 
ity in Semiconductor Experiments. J. Peinke, 
et al. Springer-Verlag, 1992, x + 289 pp, $59. 
[ISBN 0-387-55647-8] Nonlinear behavior of 
low-temperature impact ionization breakdown 
in intrinsic germanium serves as experimental 
background for the development of chaotic dy- 
namics. Good for experimenters who want to 
learn more about mathematics of nonlinear dy- 
namics, and for mathematicians who want to 
learn about applications of chaos theory. SP 


Analysis, T(18: 1, 2), P. Difference Equa- 
tions and Inequalities: Theory, Methods, and 
Applications. Ravi P. Agarwal. Pure & Appl. 
Math., V. 155. Marcel Dekker, 1992, xiii + 
777 pp, $150. [ISBN 0-8247-8676-9] Com- 
prehensive treatment develops discrete versions 
of Rolle’s, mean value, Kneser’s theorems; Tay- 
lor’s formula; |’H6pital’s rule. SP 


Analysis, T*(17: 2). Real and Functional 
Analysis, Third Edition. Serge Lang. Grad. 
Texts in Math., V. 142. Springer-Verlag, 1993, 
xiv + 580 pp, $49.50. [ISBN 0-387-94001- 
4] Successful text has been reorganized; inte- 
gration now precedes functional analysis. Be- 
gins with topics from point set topology, covers 
the usual topics in a solid real analysis course 
followed by a rigorous development of key 
ideas in functional analysis. Well-conceived, 
clearly presented. Nice problem sets through- 
out. (1983 edition published as Real Analysis, 
TR, November 1983.) TAV 


Algebraic Geometry, T(16—18: 1, 2), S, P, L. 
Grodbner Bases: A Computational Approach to 
Commutative Algebra. Thomas Becker, Volker 
Weispfenning, Heinz Kredel. Grad. Texts in 
Math., V. 141. Springer-Verlag, 1993, xxii + 
574 pp, $49. [ISBN 0-387-97971-9] |The the- 
ory of Grodbner bases is an analogue, for multi- 
variate polynomials, of the Euclidean algorithm 
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for computing gcd’s. Advances in symbolic 
computation have rendered eminently practical 
what used to be primarily of theoretic inter- 
est. BC 

Differential Geometry, T(17-18: 1), P. Ge- 
ometry and Spectra of Compact Riemann Sur- 
faces. Peter Buser. Progress in Math., V. 106. 
Birkhauser, 1992, xiv + 454 pp, $69.50. [ISBN 
0-8176-3406-1] First half introduces geom- 
etry of compact Riemann surfaces of genus 
greater than one based on hyperbolic geometry, 
cutting and pasting. Includes Fenchel-Nielsen, 
trigonometry, Bers’ partition theorem, and Te- 
ichmiiller space. Second half introduces the 
Laplacian’s spectrum on such surfaces, and how 
the spectrum reflects their geometry. SP 
Geometry, T*(15-—16), L. The Poincaré Half- 
Plane: A Gateway to Modern Geometry. Saul 
Stahl. Jones & Bartlett, 1993, xii + 298 pp, 
$41.75. [ISBN 0-86720-298-X] An excellent 
texton modern geometry. Mostly on hyperbolic 
geometry in the upper half-plane, but also nicely 
overviews contrast between euclidean and non- 
euclidean geometry. Many exercises. MPR 
Optimization, T(16-17), L. Introduction to 
the Calculus of Variations. Hans Sagan. Dover, 
1992, xvi + 449 pp, $12.95 (P). [ISBN 0-486- 
67366-9] Are-issue of the 1969 McGraw-Hill 
text (TR, August-September 1970; Extended 
Review, February 1971). Mathematically rig- 
orous introduction. First and second variations 
are thoroughly examined. The many and useful 
exercises treat both theoretical considerations 
and computationally feasible examples. SM 
Programming, T(13-14: 1). Programming 
with Standard ML. Colin Myers, Chris Clack, 
Ellen Poon. Prentice Hall, 1993, x + 301 pp. 
[ISBN 0-13-722075-8] A practical introduc- 
tion to structured programming using a func- 
tional subset of Standard ML. AO 

Computer Systems, P. Proceedings: 7th An- 
nual X Technical Conference. Ed: Adrian Nye. 
X Resource, Issue 5. O’Reilly & Assoc, 1993, 
272 pp, $22.50 (P). [ISBN 1-56592-020-1] 
Artificial Intelligence, T(15—16: 2), C. Practi- 
cal Neural Network Recipes in C++. Timothy 
Masters. Academic Pr, 1993, xviii + 493 pp, 
$44.95 (P). [ISBN 0-12-479040-2] A “cook- 
book” introduction to neural networks; assumes 
no special background. Stresses three-layer 
feed forward model. Local minimization of 
the error function is addressed via simulated 
annealing and genetic optimization. MPR 
Computer Science, T(14: 2), C. Going 
from C to C++. Robert J. Traister. Aca- 
demic Pr, 1993, xiii + 188 pp, $34.95 (P). 
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TELEGRAPHIC REVIEWS 


[ISBN 0-12-697412-8] A nuts-and-bolts in- 
troduction to C++ moves the reader briskly 
to the point of writing object-oriented code. 
Appropriate for anyone already familiar with 
object-oriented programming, although con- 
ceptual differences between procedural and 
object-oriented languages are given little atten- 
tion. With disk. MPR 


Applications (Engineering), T(15-18: 4). Ad- 
vanced Engineering Mathematics. Dennis G. 
Zill, Michael R. Cullen. PWS-Kent, 1992, xv 
+ 1200 pp. [ISBN 0-534-92800-5] Six mod- 
ules: a standard ODE text; linear algebra and 
vector calculus; systems of differential equa- 
tions; Fourier series and boundary-value prob- 
lems; numerical analysis; complex analysis. 
Answers to odd-numbered problems. SP 


Applications (Fluid Dynamics), P. Annual 
Review of Fluid Mechanics, Volume 25, 1993. 
Eds: John L. Lumley, Milton Van Dyke, Helen 
L. Reed. Annual Reviews, 1993, x + 641 pp, 
$44, [ISBN 0-8243-0725-9] 


Applications (Physics), P, L. Wulff Construc- 
tion: A Global Shape from Local Interaction. R. 
Dobrushin, R. Kotecky, S. Shlosman. Transl. 
of Math. Mono., V. 104. AMS, 1992, ix + 
204 pp, $130. [ISBN 0-8218-4563-2] A sim- 
ple question with a complicated answer: What 
is the shape of a droplet? Authors use Wulff’s 
construction in a long proof of one theorem 
on the asymptotic shape of a two-dimensional 
droplet. MPR 


Applications (Physics), T(16-18: 1-3), S, P, 
L. Principles of Physical Cosmology. P.J.E. 
Peebles. Ser. in Physics. Princeton Univ Pr, 
1993, xviii + 718 pp, $29.95 (P). [ISBN 
0-691-01933-9] A comprehensive overview 
and a genuine tour-de-force. First section, 
the Development of Physical Cosmology, cov- 
ers basic topics: the expanding universe, back- 
ground radiation, steady state cosmology, etc. 
Next section, General Relativity and Cosmol- 
ogy, iS more technical but still readable. Fi- 
nal section, Topics in Modern Cosmology, cov- 
ers several more esoteric areas. Chapter in- 
troductions make the topics accessible to any- 
one with a good background in undergraduate 
physics. MU 
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terson, Carleton; MPR: Matthew P. Richey, St. Olaf; KES: 
Kay E. Smith, St. Olaf; MU: Milton Ulmer, Carleton; TAV: 
Theodore A. Vessey, St. Olaf; PZ: Paul Zorn, St. Olaf. 
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EXCURSIONS IN CALCULUS: —_ 
an Interplay of the Continuous = |, = = 


and the Discrete 


Robert M. Young 


An excellent source of projects for well motivated 
students. This list of 463 references is a valuable 
aid for those who wish to dig deeper. _ CHOICE 


The presentation is clear and the topics very 
interesting...fully accessible to students for whom 
the book is intended. The book will be influential in 
awakening students' awareness for good classi- 
cal mathematics. —Paulo Ribenboim 


Printed with eight full—color plates. 


The purpose of this book is to explore, within the 
context of elementary calculus, the rich and el- 
egant interplay that exists between the two main 
currents of mathematics, the continuous and the 
discrete. Such fundamental notions in discrete 
mathematics as induction, recursion, combinato- 
rics, number theory, discrete probability, and the 
algorithmic point of view as a unifying principle are 
continually explored as they interact with tradi- 
tional calculus. The interaction enriches both. 


The book is addressed primarily to well-trained 
calculus students and their teachers, but it can 
serve as a supplement in a traditional calculus 
course for anyone who wants to see more. 
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CONTENTS: 


¢ Infinite Ascent, Infinite Descent: The Principle 
of Mathematical Induction 

¢ Patterns, Polynomials, and Primes: Three 
Applications of the Binomial Theorem 

¢ Fibonacci Numbers: Function and Form 

¢ On the Average 

Approximation: from Pi to the Prime Number 

Theorem 

Infinite Sums: A Potpourri 


The problems, taken for the most part from 
probability, analysis and number theory, are an 
integral part of the text. Many point the reader 
toward further excursions. There are over 400 
problems presented in this book. 


408 pp., 1992, Paperbound 

ISBN 0-88385-317-5 

List: $39.00 MAA Member: $31.00 
Catalog Number DOL-13 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-800-331-1622 Fax (202) 265-2384 


Total $ 
Payment Check © VISA © MASTERCARD 


Credit Card No. 
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Two OF THE GREATEST 


FouR MORE GREATS 


Calculus with Analytic 
Geometry, 5/e 
0-03-096800-3 1994 


Robert Ellis and Denny Gulick both of 
the University of Maryland, College Park 


This bestseller now combines early 
coverage of logarithmic and exponential 
functions with trigonometric functions. Its 
reputation for precise, error-free mathe- 
matics is enhanced by the addition of 
graphing calculator exercises and Topics 
for Discussion which develop students’ 
critical thinking/writing and group Skills. 


Calculus from Graphical, 
Numerical, and Symbolic Points 
of View 


Vol. I (1994 Prelim. Ed.) 0-03-098731-8 
Vol. II (1994 Prelim. Ed.) 0-03-098732-6 


Arnold Ostebee and Paul Zorn both of 
St. Olaf College 


Students are challenged to compare 
graphical, numerical and symbolic view- 
points of calculus. The text uses an early 
transcendental approach and provides 
more extensive coverage of differential 
equations. Appropriate use of technology 
— graphing calculators or computers — 
is stressed. 


Klementary Linear Algebra, 5/e 
0-03-097354-6 1994 


Stanley I. Grossman, University of 
Montana and University College, London 


This new edition achieves a unique 
balance between theory and technique, 
and features Matlab® applications, and 
an emphasis on geometric interpretation 
to make linear algebra easier for students 
to grasp. 


Numerical Analysis 
0-03-098330-4 1994 


Vithal A. Patel, Humboldt State 
University 


Patel introduces methods and algorithms 
required to succeed in scientific comput- 
ing. This new text provides a solid 
theoretical foundation in numerical 
analysis enabling students to solve 
problems successfully and stimulate 
their interest in developing new numerical 
methods. Carefully developed computer 
exercises help students make the 
transition to writing their own computer 
problems. 

To order examination copies of 

these texts, contact your 

Saunders Representative or 

call 1-708-647-8822. 


Absolute Values in Math 


Saunders College Publishing / a division of Harcourt Brace College Publishers 
Public Ledger Building / 620 Chestnut Street, Suite 560 / Philadelphia, PA 19106-3477 


OLD AND NEW UNSOLVED 


PROBLEMS IN PLANE 
GEOMETRY AND 
NUMBER THEORY 


Victor Klee and Stan Wagon 


Many facts and problems to fascinate both on 
familiar and unfamiliar topics. It is compulsive 
reading and will fill your mind with problems that 
will come back to haunt you again and again 
during idle moments. 

—Mathematical Intelligencer 


The book will serve well as a point of entry for 

students who want to know more about cel- 

ebrated questions, or simply take in the vistas. 
—CHOICE 


This is a book that not only belongs in every 

university, college and high school library, it very 

definitely belongs in every public library. 
—Mathematical Reviews 


Part of the broad appeal of mathematics is that 
there are simply stated questions that have not 
yet been answered. These questions are plenti- 
ful in the areas of plane geometry and number 
theory, and the purpose of this book is to discuss 
some unsolved problems in these fields. Be- 
cause the central concepts of geometry and 
number theory are understood by everyone, many 
of the questions can be understood by readers 
with extremely little mathematical background. 


The authors place each problem in its historical 
and mathematical context. Each problem section 
is presented in two parts: The first gives an 
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OLD AND NEW 


! UNSOLVED 
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Plane Geometry and Number Theory 


VICTOR 
KELFE 


SIAN G 
WAGON 


elementary overview discussing the history and 
both solved and unsolved variants of the prob- 
lem. Part Two contains more details, including a 
few proofs of related results, a wider and deeper 
survey of what is known about the problem and 
its relatives, and alarge collection of references. 
Both parts contain exercises, and solutions to 
the exercises are included. 


The book is aimed at both teachers and stu- 
dents of undergraduate mathematics, and at 
beginning graduate students. It could be used 
as a text in a course about unsolved problems, 
and also in courses in geometry or number 
theory. High school teachers interested in learn- 
ing about developments in modern mathemat- 
ics will find much of interest here. 


352 pp., Paperbound, 1991 

ISBN 0-88585-315-9 

List: $26.00 MAA Member: $19.00 
Catalog Number DOL-11 


ORDER FROM: 


Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC. 20036 


Quantity Title 


Payment 4 Check QO VISA 4 MASTERCARD 


Credit Card No. TOTAL $ 
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NEw FROM McGrRaAw-HILL For 1994 


A MATHEMATICAL JOURNEY, DISCOVERING CALCULUS: A 
Second Edition PRELIMINARY VERSION, Volumes I & Il 
Stanley Gudder, University of Denver Alan L. Levine & George M. Rosenstein, Jr., 
Both of Franklin and Marshall College 
ELEMENTARY ALGEBRA: 
STRUCTURE AND Use, Sixth Edition FINITE MATHEMATICS WITH 
INTERMEDIATE COMPUTER-SUPPORTED 
; APPLICATIONS 
ALGEBRA: STRUCTURE James Calvert & William Voxman, 
AND Us, Fifth Edition Both of The University of Idaho 
both by Raymond Barnett, 
Merritt College CONTEMPORARY 
Thomas Kearns, Northern STATISTICS: A 
Kentucky University MATHEMATICAL | Computer AppRoACH 
Sheldon P. Gordon, 
PREALGEBRA, SYMBOL Suffolk Community College 
First Edition FOR QUALITY Florence S. Gordon, 
ESSENTIAL New York Institute of Technology 
MATHEMATICS 
WITH APPLICATIONS, AN INTRODUCTION TO 


DIFFERENTIAL EQUATIONS 
AND THEIR APPLICATIONS 


Stanley J. Farlow, University of Maine 


Second Edition 
both by Lawrence A. Trivieri, DeKalb College 


FINITE MATHEMATICS AND ITS ENGINEERING MATHEMATICS 

APPLICATIONS, Second Edition WITH MATHEMATICA 

Stanley J. Farlow, University of Maine John S. Robertson, U.S. Military Academy 
Also Available 

CALCULUS AND ANALYTIC DISCOVERING CALCULUS WITH THE 

GEOMETRY, 5/e Ti-S1 AND THE TI-85 


Sherman K. Stein, 


University of California, Davis DISCOVERING CALCULUS WITH THE 
’ CASIO fx-7700 AND fx-8700 

Anthony Barcellos, 

American River College DISCOVERING CALCULUS WITH THE 

HP-28 AND THE HP-48 

CALCULUS LABORATORIES WITH Robert T. Smith, 

MATHEMATICA, VOLS. I, Il, lll Millersville University of Pennsylvania 

Michael Kerckhove and Van Nall, Roland B. Minton, 

Both of University of Richmond Roanoke College 


For more information, please contact your local McGraw-Hill representative or write: McGraw-Hill 
College Division, Comp Processing & Control, P.O. Box 448, Hightstown, New Jersey 08520-0448. 


LURE OF THE INTEGERS 


Joe Roberts 


A joy to read and ponder, this book is a welcome 
addition to the body of mathematical literature. It 
belongs in every mathematical library. 

—Journal of Recreational Mathematics 


Will enrich library collections serving curricula with 
theory of numbers courses. —Choice 


In some small way, this book is an introduction to 
a mythical book which might go under the name of 
The Book of Integers. This mythical book has on 
page n all of the interesting properties of the 
integer n. This introduction stems from many 
years’ casual accumulation of numerical facts. 
Most of the material presented belongs to elemen- 
tary mathematics in the sense that no deep or 
profound mathematical background is required in 
order to understand what is said. Much of the 
material is drawn from the theory of numbers. 


Many of the topics touch on contemporary re- 
search and most of the results are stated without 
proof. As a general rule, one cannot tell from the 
statements of the results whether or not their 
proofs will be elementary. Indeed, this is a hall- 
mark of mathematics and is one of the things that 
gives the subject a special flavor and interest. 
Until one knows that expert practitioners have 
been unable to solve a problem, one does not 
know that the problem is difficult. Even then it may 
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some of the material will be familiar to people 
having only a small acquaintance with mathemat- 
ics. Even in those cases, the author provides 
something new. On the other hand, much of the 
material is sufficiently out of the main stream of 
concern that even professional mathematicians 
may be unfamiliar with the results. The many 
references to the literature will almost always 
enable a reader to track down further information. 
In Lure of the Integers the author has presented 
a body of material which will prove interesting to 
the enlightened layman as well as to the profes- 
sional. 


300 pp., Paperbound, 1992 
ISBN-0-88385-502-X 

List: $28.50 MAA Member: $19.50 
Catalog Number LURE 
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The Mathematical Association of America 
1529 Eighteenth Street, NW 
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Foreign Orders Please add $3.00 per item ordered 
to cover postage and handling fees. The order willbe 
sent via surface mail. If you want your order sent by 
air, we will be happy to send you a proforma invoice 
for your order. 
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symbolic Computation in 
Undergraduate Mathematics 


Education 


Zaven Karian, Editor 


If you are considering putting a symbolic comput- 
ing system into your curriculum, this is one publi- 
cation you should have. 

—Mathematics Teacher 


This well-written book should be helpful to anyone 
using symbolic computation as an aid in teaching 
undergraduates—The book provides a number of 
examples for presenting probability and statistics 
ina way that removes the tedium and emphasizes 
the underlying ideas. 

—AAAS, Science Book and Films 


lf you have any plans to integrate symbolic com- 

puting into your program, read and study this book 

first. Your students will thank you for it. 
—AMATYC Review 


This volume brings together many of the facets 
associated with the pedagogic uses of symbolic 
computation. 


Part | consists of articles that deal with general 
issues of learning mathematics and the role of 
symbolic computation in that process. The articles 
in Part II describe the use of symbolic computation 
in teaching calculus. Some of the areas covered 
are the use of symbolic computation in a labora- 
tory calculus course, the uses of Derive in the 
instruction of calculus, antidifferentiation and the 
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in Undergraduate 
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Zaven A Karion, Editor 
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definite integral, and the experiences and reflec- 
tions of teachers who have used symbolic compu- 
tation in calculus instruction. 


Part Ill consists of papers on sophomore-level 
courses on linear algebra and differential equa- 
tions. The articles in Part lV describe what can be 
done in using symbolic computation in teaching 
combinatorics, probability and statistics courses. 
The articles and references in Part V will help you 
get started in using some of these ideas at your 
own institution. 


200 pp., 1992, Paperbound 
ISBN 0-88385-082-6 

List: $22.00 

Catalog Number NTE-24 


ORDER FROM: 


The Mathematical Association of America 
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Mathematical Cranks 


Underwood Dudley 


A delightful collection...lt is hard to put down and 
provides topics for an unending series of interest- 
ing discussions. The organization and breadth of 
the book are impressive, supported by a helpful 
index and a list of resources that encourage 
further explorations. A classic. —CHOICE 


A jewel... The most interesting book that | have 
read this year. 
—Journal of Recreational Mathematics 


It's a gem...Dudley hasn't tossed out crank sub- 
missions over the years; he's saved them and 
collected samples from other mathematicians. 
And what wonderful samples they are. 
—Sunday Telegraph, Nashua, New Hampshire 


Mathematical Cranks is about people who think 
that they have done something impossible, like 
trisecting the angle, squaring the circle, duplicat- 
ing the cube, or proving Euclid’s parallel postulate; 
people who think they have done something that 
they have not, like proving Fermat’s Last Theo- 
rem, verifying Goldbach’s Conjecture, or finding a 
simple proof of the Four Color Theorem; people 
who have eccentric views, from mild (thinking we 
should count by 12s instead of 10s) to crazy 
(thinking that second-order differential equations 
will solve all problems of economics, politics, and 
philosophy); people who pray in matrices; people 
who find the American Revolution ruled by the 
number 57; people who have in common some- 
thing to do with mathematics and something odd, 
peculiar, or bizarre. 
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Cranks and their ideas come in great variety. The 
book is a collection of examples, designed to give 
readers an idea of whatcranks do and how they do 
it. Contemplating the odd, peculiar, or bizarre can 
be entertaining or enlightening. There can be no 
solution to the problem of mathematical cranks— 
obsessive people we will always have with us, and 
some will become obsessed with mathematics— 
but perhaps viewing the futility of their efforts will 
turn some prospective cranks toward more fruitful 
endeavors. 


This is a truly unique book, written with wit and 
style. 
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its programming language. ey 
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Paperback, $44.95, c. 400 pp. EA 
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Mathematica is a registered trademark of 
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COMMENTS 


"In view of the speculation on the status of my work on the Taniyama-Shimura 
conjecture and Fermat's Last Theorem I will give a brief account of the situation. During 
the review process a number of problems emerged, most of which have been resolved, 
but one in particular I have not yet settled. The key reduction of (most cases of ) the 
Taniyama-Shimura conjecture to the calculation of the Selmer group is correct. However 
the final calculation of a precise upper bound for the Selmer group in the semistable case 
(of the symmetric square representation associated to a modular form) is not yet 
complete as it stands. I believe that I will be able to finish this in the near future using the 
ideas explained in my Cambridge lectures. 

"The fact that a lot of work remains to be done on the manuscript makes it still 
unsuitable for release as a preprint . In my course in Princeton beginning in February I will 


give a full account of this work." 
Andrew Wiles (December, 1993). 


Well, this is a fine mess. When word flashed around the world last summer 
that Andrew Wiles had proved the Fermat Conjecture, mathematicians quickly 
took advantage of the spotlight. Articles about mathematics not only appeared 
in Science and Science News, they also appeared in Time magazine and the 
Bloomington Herald Times. Eric Zorn of the Chicago Tribune wrote a 
humorous column about the solution. Marylin vos Savant wrote an article in 
Parade Magazine. (The proof used "hyperbolic geometery", she said, so it was 
questionable.) Important people (mainly in Washington) pointed out that this was 
a triumph of "useless" mathematics, which was supported by public funds for over 
40 years (and we need more). The rest of us, concerned with funding on a smaller 
scale, sold T-shirts with Fermat on the front. 

The excitement had a downside for editors. The usual trickle of Fermat 
proofs became a torrent. Covering letters scoffed at a proof in 200 pages; here's 
one in four! And the excitement encouraged those who trisect angles and square 
circles and prove well known results on the back of a postcard. Rejection letters 
for such manuscripts are polite but firm . 

Now what do I tell those people? If Andrew Wiles made a mistake, why 
not the editor as well? What do we tell the readers of Time who thought 
mathematics dealt in absolute truths? What do we tell the congressman, who was 
ready to double funding for mathematics? (Well, maybe just for Number Theory.) 

We were carried away by the excitement. Andrew Wiles warned us --- the 
proof wasn't written down yet. Heady with public attention, we didn't listen. 

The January issue of the Monthly contained an article by David Cox on 
the Fermat Conjecture and its history. In this issue we publish another article on 
the mathematics behind the recent work of Wiles. Why another? Fernando 
Gouvéa wrote this article soon after the anouncement by Wiles last summer, well 
before anyone could check the details. Its purpose is to explain some of the lovely 
mathematics that underlies Wiles's attempted proof. In the coming months, as 
mathematicians debate the proof and its status, this mathematics becomes even 
more important. Learning some of the terms and ideas will make us better able 
to follow that debate . . . and perhaps more cautious about sensational 
pronouncements in the future. 

John Ewing 
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‘*A Marvelous Proof” 


Fernando Q. Gouvéa 


No one really knows when it was that the story of what came to be known as 
“Fermat’s Last Theorem” really started. Presumably it was sometime in the late 
1630s that Pierre de Fermat made that famous inscription in the margin of 
Diophantus’ Arithmetic claiming to have found “a marvelous proof’. It seems now, 
however, that the story may be coming close to an end. In June, 1993, Andrew 
Wiles announced that he could prove Fermat’s assertion. Since then, difficulities 
seem to have arisen, but’ Wiles’ strategy is fundamentally sound and may yet 
succeed. 

The argument sketched by Wiles is an artful blend of various topics that have 
been, for years now, the focus of intensive research in number theory: elliptic 
curves, modular forms, and Galois representations. The goal of this article is to 
give mathematicians who are not specialists in the subject access to a general 
outline of the strategy proposed by Wiles. Of necessity, we concentrate largely on 
background material giving first a brief description of the relevant topics, and only 
afterwards describe how they come together and relate to Fermat’s assertion. 
Readers who are mainly interested in the structure of the argument and who do 
not need or want too many details about the background concepts may want to 
skim through Section 2, then concentrate on Section 3. Our discussion includes a 
few historical remarks, but history is not our main intention, and therefore we only 
touch on a few highlights that are relevant to our goal of describing the main ideas 
in Wiles’ attack on the problem. 

Thanks are due to Barry Mazur, Kenneth Ribet, Serge Lang, Noriko Yui, 
George Elliot, Keith Devlin, and Lynette Millett for their help and comments. 


1 PRELIMINARIES. We all know the basic statement that Fermat wrote in his 
margin. The claim is that for any exponent n > 3 there are no non-trivial integer 
solutions of the equation x” + y” =z”. (Here, “non-trivial” will just mean that 
none of the integers x, y, and z is to be equal to zero.) Fermat claims, in his 
marginal note, to have found “a marvelous proof” of this fact, which unfortunately 
would not fit in the margin. 

This statement became known as “Fermat’ s Last Theorem,” not, apparently, 
due to any belief that the “theorem” was the last one found by Fermat, but rather 
due to the fact that by the 1800s all of the other assertions made by Fermat had 
been either proved or refuted. This one was the last one left open, whence the 
name. In what follows, we will adopt the abbreviation FLT for Fermat’s statement, 
and we will refer to x” + y” =z” as the “Fermat equation.” 

The first important results relating to FLT were theorems that showed that 
Fermat’s claim was true for specific values of n. The first of these is due to Fermat 
himself: very few of his proofs were ever made public, but in one that was he shows 
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that the equation 

xi +y4 =z? 
has no non-trivial integer solutions. Since any solution of the Fermat equation with 
exponent 4 gives a solution of the equation also, it follows that Fermat’s claim is 
true for n = 4. 

Once that is done, it is easy to see that we can restrict our attention to the case 
in which n is a prime number. To see this, notice that any number greater than 2 is 
either divisible by 4 or by an odd prime, and then notice that we can rewrite an 
equation 


xink + ymk _ gmk 
as 
(x™)" + Cy) = C27)", 
so that any solution for n = mk yields at once a solution for n = k. If n is not 
prime, we can always choose k to be either 4 or an odd prime, so that the problem 
reduces to these two cases. 

In the 1750s, Euler became interested in Fermat’s work on number theory, and 
began a systematic investigation of the subject. In particular, he considered the 
Fermat equation for n = 3 and n = 4, and once again proved that there were no 
solutions. (Euler’s proof for n = 3 depends on studying the ‘“‘numbers” one gets by 
adjoining Y— 3 to the rationals, one of the first instances where one meets 
“algebraic numbers.”) A good historical account of Euler’s work is to be found in 
[Wei83]. In the following years, several other mathematicians extended this step by 
step ton = 5, 7,.... A general account of the fortunes of FLT during this time 
can be found in [Rib79]. 

Since then, ways for testing Fermat’s assertion for any specific value of n have 
been developed, and the range of exponents for which the result was known to be 
true kept getting pushed up. As of 1992, one knew that FLT was true for 
exponents up to 4000 000 (by work of J. Buhler). 

It is clear, however, that to get general results one needs a general method, i.e., 
a way to connect the Fermat equation (for any n) with some mathematical context 
which would allow for its analysis. Over the centuries, there have been many 
attempts at doing this; we mention only the two biggest successes (omitting quite a 
lot of very good work, for which see, for example, [Rib79]. 

The first of these is the work of E. Kummer, who, in the mid-nineteenth 
century, established a link between FLT and the theory of cyclotomic fields. This 
link allowed Kummer to prove Fermat’s assertion when the exponent was a prime 
that had a particularly nice property (Kummer named such primes “regular’’). The 
proof is an impressive bit of work, and was the first general result about the 
Fermat equation. Unfortunately, while in numerical tests a good percentage of 
primes seem to turn out to be regular, no one has yet managed to prove even that 
there are infinitely many regular primes. (And, ironically, we do have a proof that 
there are infinitely many primes that are not regular.) A discussion of Kummer’s 
approach can be found in [Rib79]; for more detailed information on the cyclotomic 
theory, one could start with [Was82]. 

The second accomplishment we should mention is that of G. Faltings, who, in 
the early 1980s, proved Mordell’s conjecture about rational solutions to certain 
kinds of polynomial equations. Applying this to the Fermat equations, one sees 
that for any n > 4 one can have only a finite number of non-trivial solutions. Once 
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again, this is an impressive result, but its impact on FLT itself turns out to be 
minor because we have not yet found a way to actually determine how many 
solutions should exist. For an introduction to Faltings’ work, check [CS86], which 
contains an English translation of the original paper. 

Wiles’ attack on the problem turns on another such linkage, also developed in 
the early 1980s by G. Frey, J.-P. Serre, and K. A. Ribet. This one connects FLT 
with the theory of elliptic curves, which has been much studied during all of this 
century, and thereby to all the machinery of modular forms and Galois representa- 
tions that is the central theme of Wiles’ work. The main goal of this paper is to 
describe this connection and then to explain how Wiles attempts to use it to prove 
FLT. 


Notation. We will use the usual symbols @ for the rational numbers and Z for the 
integers. The integers modulo m will be written! as Z/mZ; we will most often 
need them when m is a power of a prime number p. If p is prime, then Z/pZ isa 
field, and we commemorate that fact by using an alternative notation: F, = Z/pZ. 


2 THE ACTORS. We begin by introducing the main actors in the drama. First, we 
briefly (and very informally) introduce the p-adic numbers. These are not so much 
actors in the play as they are part of the stage set: tools to allow the actors to do 
their job. Then we give brief and impressionistic outlines of the theories of Elliptic 
Curves, Modular Forms, and Galois Representations. 


2.1 p-adic Numbers. The p-adic numbers are an extension of the field of rational 
numbers which are, in many ways, analogous to the real numbers. Like the real 
numbers, they can be obtained by defining a notion of distance between rational 
numbers, and then passing to the completion with respect to that distance. For our 
purposes, we do not really need to know much about them. The crucial facts are: 


1. For each prime number p there exists a field @, which is complete with 
respect to a certain notion of distance and contains the rational numbers as 
a dense subfield. 

2. Proximity in the p-adic metric is closely related to congruence properties 
modulo powers of p. For example, two integers whose difference is divisible 
by p” are “close” in the p-adic world (the bigger the n, the closer they are). 

3. As a consequence, one can think of the p-adics as encoding congruence 
information: whenever one knows something modulo p” for every n, one 
can translate this into p-adic information, and vice-versa. 

4. The field @, contains a subring Z,, which is called the ring of p-adic 
integers. (In fact, Z, is the closure of Z in Q,.) 


There is, of course, a lot more to Say, and the reader will find it said in many 
references, such as [Kob84], [Cas86], [Ami75], and even [Gou93]. The p-adic 
numbers were introduced by K. Hensel (a student of Kummer), and many of the 
basic ideas seem to appear, in veiled form, in Kummer’s work; since then, they 
have become a fundamental tool in number theory. 


'Many elementary texts like to use Z,, as the notation for the integers modulo m; for us (and for 
serious number theory in general), this notation is inconvenient because it collides with the notation for 
the p-adic integers described below. 
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2.2 Elliptic Curves. Elliptic curves are a special kind? of algebraic curves which 
have a very rich arithmetical structure. There are several fancy ways of defining 
them, but for our purposes we can just define them as the set of points satisfying a 
polynomial equation of a certain form. 

To be specific, consider an equation of the form 

y* + a,xy + a,y =x? + a,x? + a,x + ag, 

where the a, are integers (there is a reason for the strange choice of indices on the 
a,, but we won’t go into it here). We want to consider the set of points (x, y) which 
satisfy this equation. Since we are doing number theory, we don’t want to tie 
ourselves down too seriously as to what sort of numbers x and y are: it makes 
sense to take them in the real numbers, in the complex numbers, in the rational 
numbers, and even, for any prime number p, in F, (in which case we think of the 
equation as a congruence modulo p). We will describe the situation by saying that 
there is an underlying object which we call the curve E and, for each one of the 
possible fields of definition for points (x, y), we call the set of possible solutions 
the ‘‘points of E” over that field. So, if we consider all possible complex solutions, 
we get the set E(C) of the complex points of E. Similarly, we can consider the real 
points E(R), the rational points E(Q), and even the F,-points E(F,). 

We haven’t yet said when it is that such equations define elliptic curves. The 
condition is simply that the curve be smooth. If we consider the real or complex 
points, this means exactly what one would expect: the set of points contains no 
“singular” points, that is, at every point there is a well-defined tangent line. We 
know, from elementary analysis, that an equation f(x, y) = 0 defines a smooth 
curve exactly when there are no points on the curve at which both partial 
derivatives of f vanish. In other words, the curve will be smooth if there are no 
common solutions of the equations 


a a 
f(x,y) =0 wrx.) = 0 (x.y) = 0. 


Notice, though, that this condition is really algebraic (the derivatives are deriva- 
tives of polynomials, and hence can be taken formally). In fact, we can boil it down 
to a (complicated) polynomial condition in the a;. There is a polynomial ACE) = 
A(a,, 45, 43, 44,4.) in the a, such that E is smooth if and only if ACE) # 0. This 
gives us the means to give a completely formal definition (which makes sense even 
Over F). The number A(£) is called the discriminant of the curve E. 


Definition 1. Let K be a field. An elliptic curve over K is an algebraic curve 
determined by an equation of the form 


y* + a,xy + a3y =x? + ax? +a,x + ag, 


where each of the a, belongs to K and such that A(a,, ay, 43, 44, 4¢) # 0. 


Specialists would want to rephrase that definition to allow other equations, 
provided that a well-chosen change of variables could transform them into equa- 
tions of this form. 


* Perhaps it’s best to dispel the obvious confusion right up front: ellipses are not elliptic curves. In 
fact, the connection between elliptic curves and ellipses is a rather subtle one. What happens is that 
elliptic curves (over the complex numbers) are the “natural habitat” of the elliptic integrals which arise, 
among other places, when one attempts to compute the arc length of an ellipse. For us, this connection 
will be of very little importance. 
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It’s about time to give some examples. To make things easier, let us focus on the 
special case in which the equation is of the form y* = g(x), with g(x) a cubic 
polynomial (in other words, we’re assuming a, = a; = 0). In this case, it’s very 
easy to determine when there can be singular points, and even what sort of 
singular points they will be. If we put f(x, y) = y* — g(x), then we have 


0 7) 
or ,y) = —g'(x) and oC y) = 2y, 


and the condition for a point to be “‘bad”’ becomes 
y°=g(x) -—gi(x)=0 2y=0, 
which boils down to y = g(x) = g’(x) = 0. In other words, a point will be bad 


exactly when its y-coordinate is zero and its x-coordinate is a double root of the 
polynomial g(x). Since g(x) is of degree 3, this gives us only three possibilities: 


e g(x) has no multiple roots, and the equation defines an elliptic curve; 
e g(x) has a double root; 
e g(x) has a triple root. 


Let’s look at one example of each case, and graph the real points of the 
corresponding curve. 

For the first case, consider the curve given by y? = x? — x. Its graph is in figure 
1 (a) (to be precise, this is the graph of its real points). A different example of the 
same case is given by y7 =x° +x; see figure 1(b). (The reason these look so 


3 


(a) y*=x?-x (b) y2=x?4+x 


(c) y* =x? + x7: a node (d) y? = x?: a cusp 
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different is that we are only looking at the real points of the curve; in fact, over the 
complex numbers these two curves are isomorphic.) 

When there are “bad” points, what has happened is that either two roots of 
g(x) have “come together” or all three roots have done so. In the first case, we get 
a loop. At the crossing point, which is usually called a “node,” the curve has two 
different tangent lines. See Figure 1(c), where we have the graph of the equation 
y* =x? +x? (double root at zero). 

In the final case, not only have all three roots of g(x) come together, but also 
the two tangents in the node have come together to form a sort of “double 
tangent” (this can be made precise with some easy algebra of polynomials, but it’s 
more fun to think of it geometrically). The graph now looks like Figure 1(d), and 
we call this kind of singular point a “‘cusp”’. 

How does all this relate to the discriminant A we mentioned above? Well, if 
ri’, and r, are the roots of the polynomial g(x), the discriminant for the 
equation y* = g(x) turns out to be 


2 2 2 
A = K(r, — 1r2) (% — 13) (12 — £3)’; 


where K is a constant. This does just what we want: if two of the roots are equal, it 
is zero, and if not, not. Furthermore, it is not too hard to see that A is actually a 
polynomial in the coefficients of g(x), which is what we claimed. In other words, 
all that the discriminant is doing for us is giving a direct algebraic procedure for 
determining whether there are singular points. 

While this analysis applies specifically to curves of the form y’ = g(x), it 
actually extends to all equations of the sort we are considering: there is at most 
one singular point, and it is either a node or a cusp. 

One final geometric point: as one can see from the graphs, these curves are not 
closed. It is often convenient to “close them up.” This is done by adding one more 
point to the curve, usually referred to as “the point at infinity.”” This can be done 
in a precise way by embedding the curve into the projective plane, and then taking 
the closure. For us, however, the only important thing is to remember that we 
actually have one extra point on our curves. (One should imagine it to be 
“infinitely far up the y-axis,” but keep in mind that there is only one “point at 
infinity” on the y-axis, so that it is also “infinitely far down.’’) 

With some examples in hand, we can proceed to deeper waters. In order to 
understand the connection we are going to establish between elliptic curves and 
FLT, we need to review quite a large portion of what is known about the rich 
arithmetic structure of these curves. 

The first thing to note is that one can define an operation on the set of points of 
an elliptic curve that makes it, in a natural way, an abelian group. The operation is 
usually referred to as “‘addition.” The identity element of this group turns out to 
be the point at infinity (it would be more honest to say that we choose the point at 
infinity for this role). 

We won’t enter into the details of how one adds points on an elliptic curve. In 
fact, there are several equivalent definitions, each of which has its advantages! The 
reader should see the references for more details of how it is done (and the proof 
that one does get a group). The main thing to know about the definition, for now, 
is that it preserves the field of definition of the points: adding two rational points 
gives a rational point, and so on. 

What this means is that for every choice of a base field, we can get a group of 
points on the curve with coordinates in that field, so that in fact an elliptic curve 
gives us a whole bunch of groups, which are, of course, all related (though 
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sometimes related in a mysterious way). So, given an E, we can look at the 
complex points E(C), which form a complex Lie group which is topologically a 
torus, or we can look at the real Lie group E(R), which turns out to be either 
isomorphic to the circle S! or to the direct product Z/2Z x S'. (Look back at the 
examples above; can you see which is which?) 

From an arithmetical point of view, however, the most interesting of these 
groups is the group of rational points, E(Q). A point P © E(Q) gives a solution in 
rational numbers of our cubic equation, and looking for such solutions is, of 
course, an example of solving a diophantine equation, a sort of problem that is 
quite important in number theory. What is especially nice about E(Q) is the fact, 
proved by L. Mordell (and extended by A. Weil) in the 1920s, that it is a finitely 
generated abelian group. What this means is just the following: there is a finite list 
of rational points on the curve (or, if one prefers, of rational solutions to the 
equation) such that every other rational solution is obtained by combining (using 
the addition law) these points with one another. These points are called the 
generators of the group E(Q), which is usually called the Mordell-Weil group of E. 

The curves we considered earlier have very simple Mordell-Weil groups. For the 
curve given by y* =x°* — x (figure 1a), it has four points; and for y?=x° +x 
(figure 1b) it has two. It is easy, though, to give more interesting examples. 
Here is one, chosen at random from [Cre92]: if E is the curve defined by y* + y = 
x? — x? — 2x + 2, the Mordell-Weil group E(Q) is an infinite cyclic group, gener- 
ated by the point (2, 1). 

Of course, knowing that we have a finitely generated group raises the obvious 
question of estimating or computing the number of generators needed and of how 
one might go about actually finding these generating points. Both of these 
questions are still open, even though there are rather precise conjectures about 
what their answers should be. For many specific curves, both the number and the 
generators themselves have been completely worked out (see, for example, the 
tables in [Cre92]), but the general problem still seems quite difficult. 

A fundamental component of the conjectural plan for determining the genera- 
tors 1s considering, for each prime number p, the reduction of our curve modulo p. 
The basic idea is quite simple: since our equation has integer coefficients, we can 
reduce it modulo p” and look for solutions in the field F,, of integers modulo p. 
This should give a finite? group E(f,), whose structure should be easier to analyse 
than that of the big group E(Q). It’s a rather simple idea, but several complica- 
tions* arise. 

The main thing that can go wrong is that the reduction modulo p may fail to be 
an elliptic curve. That is actually very easy to see. To tell whether the curve is 
elliptic (that is, if it has no singular points), one needs to look at A. It is perfectly 
possible for A to be nonzero (So the curve over @ is elliptic) while being at the 
same time congruent to zero modulo p (so that the curve over F, is singular). This 
pheromenon is called bad reduction, and it is easy to come up with examples. One 
might take p = 5, and look at the curve y* = x? — 5. This turns out to be an 
elliptic curve over Q, but its reduction modulo 5 is going to have a cusp. One says, 
then, that the curve has bad reduction at 5. In fact, the discriminant turns out to be 


*It is finite because, apart from the point at infinity, there are only p* possible points. In fact, the 
maximum possible number of points is smaller than that, but that fact takes some proving. 

“It may seem a bit perverse to dwell on the nature of these complications, but it will turn out that 
we need to have at least some understanding of how this goes later on. 
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A = —10800, which is clearly divisible by 2,3, and 5, so that the curve has bad 
reduction at each of these. (In each case, it’s easy to verify that the reduced curve 
has a cusp.) 

We want to classify the possible types of reduction, but there is one further 
glitch that we have to deal with before we can do so. To see what it is, consider the 
curve y* = x° — 625x. At first glance, it seems even worse than the first, and the 
discriminant, which turns out to be A = — 15625000000, looks very divisible by 5S. 
But look what we can do: let’s change variables by setting x = 25u = 5*u and 

= 125v = 5°v. Then our equation becomes 


(530) = (52u) — 625(52u), 


which simplifies to 


and hence to 


v-=u°>—U, 


which is not only a nice elliptic curve, but has good reduction at 5. In other words, 
this example shows that curves which are isomorphic over Q can have very different 
reductions modulo p. 

It turns out that among all the possible equations for our curve, one can choose 
an equation that is minimal, in the sense that its discriminant will be divisible by 
fewer primes than the discriminant for other equations. Since the primes that 
divide the discriminant are the primes of bad reduction, a minimal equation will 
have reduction properties that are as good as possible. When studying the 
reduction properties of the curve, then, one must also pass to such a minimal 
equation (and there are algorithms to do this). 

Well, then, suppose we have done so, and have an elliptic curve E given by a 
minimal equation. Then we can classify all prime numbers into three groups: 


e Primes of good reduction: those which do not divide the discriminant of the 
minimal equation. The curve modulo p is an elliptic curve, and we have a 
group E(F,). | 

e Primes of multiplicative reduction: those for which the curve modulo p has a 
node. If the singular point is (x9, yo), it turns out that the set E(F,) — 
{(x9, Yo)} has a group structure, and is isomorphic to the multiplicative group 
F, — {0}. 

e Primes of additive reduction: those for which the curve modulo p has a cusp. 
If the singular point is (x9, yo), the set E(F,) — {(xo, ¥o)} once again has a 
group structure, and is isomorphic to the additive group F,,. 


No curve can have good reduction everywhere, so there will always be some bad 
primes, but the feeling one should get is that multiplicative reduction is somehow 
not as bad as additive reduction. There are various technical reasons for this, 
which we don’t really need to go into. Instead, we codify the information about the 
reduction types of the curve into a number, called the conductor of the curve. We 
define the conductor to be a product N = [I p””, where 


0 if E has good reduction at p 
n(p) =(¢1 if E has multiplicative reduction at p 
>2 if E has additive reduction at p 


(The exact value of n(p) for the case of additive reduction depends on some rather 
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subtle properties of the reduction modulo such primes; most of the time, the 
exponent is 2.) The result is that one can tell, by looking at the conductor, exactly 
what the reduction type of E at each prime is. 

The elliptic curves we will want to consider are those whose reduction proper- 
ties are as good as possible. Since good reduction at all primes is not possible, we 
opt for the next best thing: good reduction at almost all primes, multiplicative 
reduction at the others. Such curves are called semistable: 


Definition 2. An elliptic curve is called semistable if all of its reductions are either 
good or multiplicative. Equivalently, a curve is semistable if its conductor is square- 
free. 


A crucial step in the application of Wiles’ theorem to FLT will be verifying that 
a certain curve is semistable. Just to give us some reference points, let’s look at a 
few examples. 


1. Let E, be the curve y* = x? — 5, which we considered above. One checks 
that this equation is minimal, and that the curve has additive reduction at 
2,3, and 5, so that it is not semistable. The conductor turns out to be equal 
to 10800 (essentially, the same as the discriminant!). 

2. Let E, be the curve y? + y = x° + x. This has multiplicative reduction at 7 
and 13 (checking this makes a nice exercise) and good reduction at all other 
primes. Hence, E, is semistable and its conductor is 91. 

3. Let E; be the curve y? =x°* +.x* + 2x + 2 (which is minimal). This has 
discriminant A = — 1152 = —2’ - 32, so that the bad primes are 2 and 3. It 
turns out that the reduction is multiplicative at 3 and additive at 2, and the 
conductor is 384; the curve is not semistable. 

4. The main example for the purpose at hand: Let a,b,and c be relatively 
prime integers such that a + b +c =0. Consider the curve E,,. whose 
equation>® is y? = x(x — ax + b). Depending on what a, b, and c are, this 
equation may or may not be minimal, so let’s make the additional assump- 
tions that a = —1 (mod 4) and that b = 0 (mod 32). In this case, the 
equation is not minimal. A minimal equation for this curve turns out to be 

b-a-1 ab 

2 _ 13 2 
yr +xy =x + mn x 16” 
which we get by the change in variables x —- 4x, y —- 8y + 4x. One can 
then compute that the discriminant is A = a7b*c*/256 (not surprising: a 
constant times the product of the squares of the differences of the roots of 
the original cubic), and that the curve is semistable. The primes of bad 
reduction are those that divide abc (this would be easy to see directly from 
the equation, by checking when there is a multiple root modulo p), and 
therefore the conductor is equal to the product of the primes that divide 
abc: 


N= [Ip 


p\labc 


“Tt may strike the reader as funny that c is absent from the equation. Keep in mind, however, that c 
is completely determined by a and b, so that it is really not as absent as all that. The crucial point is 
that the roots of the cubic on the right hand side are 0, a, and —b, so that the differences of the roots 
are (up to sign) exactly a, b, and c. 
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(this number is sometimes called the radical of abc). We will be using 
curves of the form E_,. (for very special a, b, and c) when we make the link 
with FLT. 


abc 


We need a final bit of elliptic curve theory. It is interesting to look at the 
number of points in the groups E(,) aS p ranges through the primes of good 
reduction for EF. Part of the motivation for this is the reasoning that if the group 
E(Q) is large (i.e. there are many rational solutions), one would expect that for 
many choices of the prime p many of the points in E(Q) would survive reduction 
modulo p, so that the group E@,,) would be large. Therefore, one would like to 
make some sort of conjecture that said that if the E(F,) are very large for many 
primes p, then the group E(Q) will be large. 

Elaborating and refining this idea leads to the conjecture of Birch and Swinner- 
ton-Dyer, which we won’t get into here. But even this coarse version suggests that 
the variation of the size of E(F,) aS p runs through the primes should tell us 
something about the arithmetic on the curve. To “encode”’ this variation, we start 
by observing that the (projective) line over F, has exactly p + 1 points (the p 
elements of F,,, plus the point at infinity). We take this as the “standard” number 
of points for a curve over F,,, and, when we look at E(f,), record how far from the 
standard we are. To be precise, given an elliptic curve E and a prime number p at 
which F has good reduction, we define a number a, by the equation 


#E(F,) =p+1-a,. 


For primes of bad reduction, we extend the definition in a convenient way; it turns 
out that we get a, = +1 when the reduction is multiplicative (with a precise rule 
to decide which) and a, = 0 when it is additive. 

The usual way to “record” the sequence of the a, is to use them to build a 
complex analytic function called the L-function of the curve E. It then is natural to 
conjecture that this L-function has properties similar to those of other L-functions 
that arise in number theory, and that one can read off properties of E from 
properties of its L-function. This is a huge story which we cannot tell in this 
article, but which is really very close to some of the issues which we do discuss 
later on. Suffice it to say, for now, that we get a function 


00 
n 


L(E,s) = — 


n=1 n° 
where the a, are exactly the same as the ones we just introduced, the a, are 
determined from the a, by “Euler product” expansion for the L-function, and the 
series can be shown to converge when Re(s) > 3/2. The L-function is conjectured 
to have an analytic continuation to the whole complex plane and to satisfy a 
certain functional equation. 

It is time to introduce the other actors in the play and to explain how they relate 
to elliptic curves. The reader who would like to delve further into this theory has a 
lot to choose from. As an informal introduction, one could look at J. Silverman’s 
article [Sil93], which relates elliptic curves to “sums of two cubes” and Ramanujan’s 
taxicab number. Various introductory texts are available, including [Cas91], [Hus87], 
[Kna92], [Sil86], and [ST92]. Each of these has particular strengths; the last is 
intended as an undergraduate text. In addition to these and other texts, the 
interested reader might enjoy looking at symbolic manipulation software that will 
handle elliptic curves well. Such capabilities are built into Gp-PARI and SIMATH, and 
can be added to Mathematica by using Silverman’s EllipticCurveCalc package 
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(which is what we used for most of the computations in this paper), and to Maple 
by using Connell’s Apecs package. See[C *], [Z *], [SvM], [Con]. 


2.3 Modular Forms. Modular forms start their lives as analytic objects (or, to be 
more honest, as objects of group representation theory), but end up playing a very 
intriguing role in number theory. In this section, we will very briefly sketch out 
their definition and explain their relation to elliptic curves. 

Let 6 = {x + iyly > 0} be the complex upper half-plane. As is well known (and, 
in any case, easy to check), matrices in SL,(Z) act on § in the following way. If 
y € SL(Z) is the matrix 


(so that a, b,c,and d are integers and ad — bc = 1), and z € 5, we define 
az +b 
cz +d_ 


y°z= 


It is easy to check that if z € h then y-z € §, and that y, - (y,°z) = (72) ° Z. 

We want to consider functions on the upper half-plane which are “‘as invariant 
as possible” under this action, perhaps when restricted to a smaller group. The 
subgroups we will need to consider are the “congruence subgroups” which we get 
by adding a congruence condition to the entries of the matrix. Thus, for any 
positive integer N, we want to look at the group 


d 


We are now ready to begin defining modular forms. They will be functions 
f:§ — C, holomorphic, which “transform well” under one of the subgroups [')(NV). 
To be specific, we require that there exist an integer k such that 


— 


D(N) = (7 = (< A E SL,(Z)|c = O(mod N) . 


cz+ad 


| = (cz +d)" f(z). 


Applying this formula to the special case in which the matrix is 


(5) 


shows that any such function must satisfy f(z + 1) = f(z), and hence must have a 
Fourier expansion 
f(z) = % aq" where gq =e?7”. 
n= —0 

We require that this expression in fact oniy involve non-negative powers of g (and 
in fact we extend that requirement to a finite number of other, similar, expansions, 
which the experts call the “Fourier expansions at the other cusps”). A function 
satisfying all of these constraints is called a modular form of weight k on T,(N). 
The number JN is usually called the level of the modular form f. 

We will need to consider one special subspace of the space of modular forms of 
a given weight and level. Rather than having a Fourier expansion with non-nega- 
tive powers only, we might require positive powers only (in the main expansion and 
in the ones “at the other cusps”). We call such modular forms cusp forms; they 
turn out to be the more interesting part of the space of modular forms. 
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Finally, one must make a remark on the relation between the theory at various 
levels: if N divides M, then every form of level N (and weight k) gives rise to (a 
number of) forms of level M (and the same weight). The subspace generated by all 
forms of level M and weight k which arise in this manner (from the various 
divisors of M) is called the space of old forms of level M. With respect to a 
natural inner product structure on the space of modular forms, one can then take 
the orthogonal complement of the space of old forms. This complement is called 
the space of new forms, which are the ones we will be most interested in. 

What really makes the theory of modular forms interesting for arithmetic is the 
existence of a family of commuting operators on each space of modular forms, 
called the Hecke operators. We will not go into the definition of these operators 
(they are quite natural from the point of view of representation theory); for us the 
crucial things will be: 


e For each positive integer n relatively prime to the level N, there is a Hecke 
operator 7, acting on the space of modular forms of fixed weight and level N. 

e The Hecke operators commute with each other. 

e If m and n are relatively prime, then 7,,,, = T,T,,. 


We will be especially interested in modular forms which are eigenvectors for the 
action of all the Hecke operators, i.e., forms for which there exist numbers A,, such 
that 7.(f) =A, f for each n which is relatively prime to the level. We will call 
such forms eigenforms. 

This is all quite strange and complicated, so let’s immediately point out one 
connection between modular forms and elliptic curves. Suppose one has a modular 
form which is 


e of weight 2 and level N, 
e a cusp form, 

e new, 

e an eigenform. 


If that is the case, one can normalize the form so that its Fourier expansion looks 
like 


f(z) = Yia,q" with a, =1. 


n=1 
Suppose that, once we have done the normalization, 
e all of the Fourier coefficients a, are integers. 


Then there exists an elliptic curve whose equation has integer coefficients, whose 
conductor is N, and whose a, are exactly the ones that appear in the Fourier 
expansion of f. In particular, the L-function of EF can be expressed in terms of f 
(as a Mellin transform), and the nice analytic properties of f then allow us to 
prove that the L-function does have an analytic continuation and does satisfy a 
functional equation. 

This connection between forms and elliptic curves is so powerful that it led 
people to investigate the matter further. The first one to suggest that every elliptic 
curve should come about in this manner was Y. Taniyama, in the mid-fifties. The 
suggestion only penetrated the mathematical culture much later, largely due to the 
work of G. Shimura, and it was made more precise by A. Weil’s work pinning 
down the role of the conductor. We now call this the “Shimura-Taniyama-Weil 
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Conjecture.” Here it is: 


Conjecture 1 (Shimura-Taniyama-Weil). Let E be an elliptic curve whose equation 
has integer coefficients. Let N be the conductor of E, and for each n let a, be the 
number appearing in the L-function of E. Then there exists a modular form of weight 
2, new of level N, an eigenform under the Hecke operators, and (when normalized ) 
with Fourier expansion equal to >a, q”. 


For any specific curve, it is not too hard to check that this is true. One takes E, 
determines the conductor and the a, for a range of n. Since the space of modular 
forms of weight 2 and level N is finite-dimensional, knowing enough of the a, 
must determine the form, and we can go and look if it is there. (In general, given a 
list of a, it is not at all easy to determine whether }a,q” is the Fourier expansion 
of a modular form, so we need to do it the other way: we generate a basis of the 
space of modular forms, then try to find our putative form as a linear combination 
of the basis.) If we find a form with the right (initial chunk of) Fourier expansion, 
this gives prima facie evidence that the curve satisfies the STW conjecture. To 
clinch the matter, one can use a form of the Cebotarev density theorem to show 
that if enough (in an explicit sense) of the a,, are right, then they all are. 

This method has been used to verify the STW conjecture for any number of 
specific curves (see, for example, [Cre92]). The conjecture has a really crucial role 
in the theory of elliptic curves; in fact, curves that satisfy the conjecture are known 
as “modular elliptic curves,” and many of the fundamental new results in the 
theory have only been proved for curves that have this property. 

As our final remark on modular forms, we point out that it is possible, for any 
given N, to determine (essentially using the Riemann-Roch theorem) the exact 
dimension of the space of cusp forms of weight 2 and level N. This gives us a very 
good handle on what curves of that conductor should exist Gf the STW conjecture 
is true). 

For more information on modular forms, one might look at [Lan76], [Miy89], or 
[Shi71]. There is an intriguing account of the Shimura-Taniyama-Weil conjecture, 
in a very different spirit, in Mazur’s article [Maz91], and a useful survey in [Lan91]. 


2.4 Galois Representations. The final actors in our play are Galois representa- 
tions. One starts with the Galois group of an extension of the field of rational 
numbers. To understand this Galois group, one can try to “represent” the 
elements of the group as matrices. In other words, one can try to find a vector 
Space on which our Galois group acts, which gives a way to associate a matrix to 
each element of the group. This in fact gives a group homomorphism from the 
Galois group to a group of matrices (this need not be injective; when it is, one calls 
the representation ‘“‘faithful’’). 

Rather than work with specific finite éxtensions of Q, we work with the Galois 
group G = Gal(Q/Q) of the algebraic closure of Q. This is a huge group (which 
one makes more manageable by giving it a topology) that hides within itself an 
enormous amount of arithmetic information. The representations we will be 
considering will be into 2 <2 matrices over various fields and rings, and they will 
(for the most part) be obtained from elliptic curves and from modular forms. 

To see how to get Galois representations from an elliptic curve, let’s start with 
an elliptic curve E, whose equation has coefficients in Z. Choose a prime p. Since 
the (complex, say) points of E form a group, one can look in this group for points 
which are of order p (that is, for points (x, y) such that adding them to themselves 
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p times gives the identity). It turns out that (over C) there are p* such points, and 
they form a subgroup which we donate by E[ p]. In fact, this group is isomorphic to 
the product of two copies of F,: 


E[p] =F, x F,. 


Now, the points in E[ p] are a priori complex, but on closer look one sees that in 
fact they are all defined over some extension of Q, and in particular that 
transforming the coefficients of a point of order p by the Galois group G yields 
another point of order p. In fact, it’s even better than that: since the rule for 
adding points is defined in rational terms, the whole group structure is preserved. 
Since E| p] looks like a vector space of dimension 2 over F,, this means that each 
element of G acts as a linear transformation on this space, and hence that we get a 
representation 


PrpiG> GL,(F,). 


(We use a bar to remind ourselves that this is a representation ‘modulo p.’’) 

Now, GL,(F,) is a finite group, and G is very infinite, so this representation, 
while it tells us a lot, can’t be the whole story. It turns out, however, that we can 
use p-adic numbers to get a whole lot more. Instead of considering only the points 
of order p, we can consider points of order p” for each n. This gives a whole 
bunch of subgroups 


E[p] CE|p?| cE[p?] c... 


and a whole bunch of representations, into GL,(F,), then into GL,(Z/p*Z), then 
into GL,(Z/p°Z)... . Putting all of these together ends up by giving us a p-adic 
representation 


Pepi G> GL,(Q,) 


which hides within itself all of the others. The representations p, , contain a lot of 
arithmetic information about the curve E. 

And how does it look on the modular forms side? Well, it follows from the work 
of several mathematicians (M. Eichler, G. Shimura, P. Deligne, and J.-P. Serre) 
that, whenever we have a modular form f (of any weight) which is an eigenform 
for the action of the Hecke operators and whose Fourier coefficients (after 
normalization) are integers, we can construct a representation 


Pepi G> GL,(Q,) 


which is attached to f in a precise sense which is too technical to explain here. 
(The construction of the representation is quite difficult, and in fact no satisfactory 
expository account is yet available.) ; 

The crucial thing to know, for our purposes, is that when an elliptic curve E 
arises from a modular form f, then the representations pz, and py , are the same. 
In fact, a converse is also true: given a curve £, if one can find a modular form f 
such that p,; , is the same as p, , then F will be modular. 


3 THE PLAY. We are now ready to take the plunge and try to see how all of this 
theory relates to Fermat’s Last Theorem. The idea is to assume that FLT is false, 
and then, using this assumption, to construct an elliptic curve that contradicts just 
about every conjecture under the sun. 
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3.1 Linking FLT to Elliptic Curves. So let’s start by assuming FLT is false, ie., 
that there exist three non-zero integers u,v,and w such that uw? + v? +w? = 0 
(as we know, we only need to consider the case of prime exponent p, which is 
therefore odd, so that we can recast a solution in Fermat’s form to be in the form 
above). Since we know that the theorem is true for p = 3, we might as well assume 
that p > 5. We may clearly assume that u,v,and w are relatively prime, which 
means that precisely one of them must be even. Let’s say v is even. Since p is 
bigger than two, we can see, by looking at the equation modulo 4, that one of u 
and w must be congruent to —1 modulo 4, and the other must be congruent to 1. 
Let’s say u = —1 (mod 4). 

Let’s use this data to build an elliptic curve, following an idea due to G. Frey 
(see [Fre86], [Fre87a], [Fre87b]). We consider the curve 


y*=x(x —u’)(x +0"). 


This is usually known as the Frey curve. Following our discussion, above, of the 
curve E,,,. we already know quite a bit about the Frey curve. Here’s a summary: 


c 


1. Since v is even and p > 5, we know that we have v? = 0 (mod 32). We also 
know that u? = —1 (mod 4). This puts us in the right position to use what 
we know about curves E,,.. 

2. The minimal discriminant of the Frey curve is 


_ (uvw)*? | 


A 
256 


3. The conductor of the Frey curve is the product of all the primes dividing 
u?v?w”, which is, of course, the same as the product of all the primes 
dividing uvw. 

4. The Frey curve is semistable. 


Now, as Frey observed in the mid-1980s, this curve seems much too strange to 
exist. For one thing, its conductor is extremely small when compared to its 
discriminant (because of that exponent of 2p). For another, its Galois representa- 
tions are pretty weird. Very soon, people were pointing out that there were several 
conjectures that would rule out the existence of Frey’s curve, and therefore would 
prove that Fermat was correct in saying that his equation had no solutions. 


3.2 FLT follows from the Shimura-Taniyama-Weil Conjecture. It was already 
clear to Frey that it was likely that the existence of his curve would contradict the 
Shimura-Taniyama-Weil conjecture, but he was unable to give a solid proof of this. 
A few months after Frey’s work, Serre pinpointed, in a letter to J.-F. Mestre, 
exactly what one would need to prove to establish the link. In this letter (published 
as [Ser87a]), Serre describes the situation with the phrase “STW + « implies 
Fermat.” Because of this, the missing theorem became known, for a while, as 
“conjecture epsilon.” This conjecture was proved by K. A. Ribet in [Rib90] about a 
year later, and this established the link. A survey of these results can be found in 
[Lan91]. 
What Serre noticed was that the representation modulo p 


Pzp:G > GL,(F,) 


obtained from the Frey curve was rather strange. It looked like the sort of 
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representation one would get from a modular form of weight 2, but if one applied 
the “usual recipe” for guessing the level of that modular form, the answer came 
out to be N = 2. He also showed that the modular form must be a cusp form. The 
problem is that there are no cusp forms of weight 2 and level 2! 

So suppose there is a solution of the Fermat equation for some prime p, and 
use this solution to build a Frey curve E. Let N be the conductor of E (which we 
determined above). Suppose, also, that STW holds for E, so that there exists a 
modular form of weight 2 and level N whose Galois representation is the same as 
the one for E. Then we have the following curious situation: we have a representa- 
tion p which we know comes from a modular form of weight 2 and level N, but 
which looks as if it should come from a modular form of smaller level. 

Here is where Ribet’s theorem comes in: he proves that (under certain hypothe- 
ses which will hold in our case) whenever this happens the modular form of 
smaller level must actually exist! Notice that this doesn’t mean that the original 
modular form came from lower level; what it means is that there is a form of lower 
level whose representation reduces modulo p to the same representation. 

The upshot of Ribet’s theorem is the following: 


Theorem 1 (Ribet). Suppose STW holds for all semistable elliptic curves. Then FLT 
is true. 


This is true because if FLT were false, one could choose a solution of the 
Fermat equation and use it to construct a Frey curve, which would be a semistable 
elliptic curve. By STW, this curve would be attached to a modular form, so that its 
Galois representation is attached to a modular form. By Ribet’s theorem, there 
must exist a modular form of weight 2 and level 2 which gives the same representa- 
tion modulo p. Just a little more work allows one to check that this modular form 
must be a cusp form. But this is a contradiction, because there are no cusp forms 
of weight 2 and level 2. 


3.3 Deforming Galois Representations. It is now that we come to Wiles’ work. His 
idea was that one can attack the problem of proving STW by using the Galois 
representations, and in particular by thinking of “‘deformations” of Galois repre- 
sentations. The idea is to consider not only a representation modulo p, but also all 
the possible p-adic representations attached to it (one speaks of “all the possible 
lifts” of the representation modulo p). These can be thought of as ‘“‘deformations” 
because, from the p-adic point of view, they are “‘close” to the original representa- 
tion. 

This sort of idea had been introduced by B. Mazur in [Maz89]. Mazur showed 
that one could often obtain a “universal lift,” i.e., a representation into GL, of a 
big ring such that all possible lifts were ‘‘hidden” in this representation. If one 
knew that the representation modulo p were modular, then one could make 
another big ring “containing” all the lifts which are attached to modular forms. 
The abstract deformation theory then provides us with a homomorphism between 
these two rings, and one can try to prove that this is an isomorphism. If so, it 
follows that all lifts are modular. 

What Wiles proposes to do is very much in this spirit, except that he restricts 
himself to lifts that have especially nice properties. He starts with a representation 
modulo p, and supposes that it is modular and that it satisfies certain technical 
assumptions. Then he considers all possible deformations which “look like they 
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could be attached to forms of weight 2,” and gets a deformation ring. Considering 
all deformations which are attached to modular forms of weight 2 gives a second 
ring (which is closely related to the algebra generated by the Hecke operators, in 
fact). Wiles then attempts to prove, using a vast array of recent results, including 
ideas of Mazur, Ribet, Faltings, V. Kolyvagin, and M. Flach, that these two rings 
are the same. 

It is not hard to see that the homomorphism between the two rings we want to 
consider is surjective. The difficulty is to prove it is also injective. Wiles reduces 
this question to bounding the size of a certain cohomology group. It is here that 
the brilliant ideas of Kolyvagin and of Flach come in. About five years ago, 
Kolyvagin came up with a very powerful method for controlling the size of certain 
cohomology groups, using what he calls “Euler systems” (see [Kol91] and the 
survey of the method in [Maz93]). This method seems to be adaptable to any 
number of situations, and has been used to prove several important recent results. 
The initial breakthrough showing how one could begin to use Kolyvagin’s method 
in our context is due to Flach (see [Fla92]), who found a way to construct 
something that can be thought of as the beginning of an Euler system applicable to 
our situation. Wiles called on all these ideas to construct a “geometric Euler 
system” which plays a central role in the argument. (/t is at this point that the 
current difficulty lies.) 

From the bound on the cohomology group one will get a proof that the two 
rings are in fact isomorphic. Translated back to the language of representations, 
this means that if one starts with a representation modulo p which satisfies Wiles’ 
technical assumptions (and is modular), then any lift of the kind Wiles considers is 
also modular. 


3.4 Put it all together.... Assume, then, that one can prove that all lifts of a 
modular representation are still modular. Now suppose we have an elliptic curve E 
whose representation modulo p we can prove (by some means) to be modular. 
Suppose also that this representation satisfies Wiles’ technical assumptions. Then 
any lift of this representation is modular. But the p-adic representation pz; , 
attached to EF is one such lift! It follows that this representation is modular, and 
hence that F is modular. 

All we need, now, is to prime the pump: we must find a way to decide that the 
representation modulo p is modular, and then use that to clinch the issue. What 
Wiles does is quite beautiful. 

First of all, he takes a semistable elliptic curve, and looks at the Galois 
representation modulo 3 attached to this curve. At this point, there are two 
possibilities. The representation, as we pointed out above, amounts to an action of 
the Galois group on the vector space F, X F,. Now, it may happen that there is a 
subspace of that vector space which is invariant under every element of the Galois 
group. In that case, one says that the representation is reducible. If not, it is 
irreducible. 

One has to be just a little more careful, Just as it sometimes happens that a real 
matrix has complex eigenvalues, it can happen that the invariant subspace only 
exists after we enlarge the base field. We will say a representation is absolutely 
irreducible when this does not happen: even over bigger fields, there is no invariant 
subspace. 

Well, look at p,; 3. It may or may not be absolutely irreducible. If it is, Wiles 
calls upon a famous theorem of J. Tunnell, based on work of R.P. Langlands (See 
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[Tun81], [Lan80]) to show that it is modular, and hence, using the deformation 
theory, that the curve is modular. 

If p; 3 is not absolutely irreducible, Wiles shows that there is another elliptic 
curve which has the same representation modulo 5 as our initial curve, but whose 
representation modulo 3 is absolutely irreducible. By the first case, it is modular. 
Hence, its representation modulo 5 is modular. But since this is the same as the 
representation modulo 5 attached to our original curve, we can apply the deforma- 
tion theory for p = 5 to conclude that our original curve is modular. 

If Wiles’ strategy is successful, we get: 


Theorem 2. The Shimura-Taniyama-Weil conjecture holds for any semistable elliptic 
curve. 


And, since the Frey curve is semistable, 


Corollary 1. For any n > 3, there are no non-zero integer solutions to the equation 
n 


x" +y” =z", 


Of course, this is just one corollary of the proof of the STW conjecture for 
semistable curves, and it is certain that there will be many others still. For 
example, as Serre pointed out in [Ser87b], one can apply Frey’s ideas to many 
other diophantine equations that are just as hard to handle as Fermat’s. These are 
equations that are closely related to the Fermat equation, of the form 


xP +y? = Mz?, 


where p iS a prime number and M is some integer. From Serre’s argument and 
Wiles’ result, one gets something like this: 


Corollary 2. Let p be a prime number, and let M be a power of one of the following 
primes: 


3,5, 7,11, 13, 17, 19, 23, 29, 53,59. 


Suppose that p = 11 and that p does not divide M. Then there are no nonzero integer 
solutions of the equation x? + y? = Mz?. 


The proof is precisely parallel to what we have done before: given a solution, 
construct a Frey curve, and consider the resulting modular form. Apply Ribet’s 
theorem to lower its level, and then study the space of modular forms of that level 
to see if the form predicted by Ribet is there. If there is no such form, there can be 
no solution. . 

In fact, one can even get a general result, as Mazur pointed out: 


Corollary 3. Let M be a power of a prime number ¢, and assume that ¢ is not of 
the form 2” + 1. Then there exists a constant C, such that the equation x? + y? = 
Mz? has no nonzero solutions for any p = Cy. 


However successful they may be in the end at proving the Shimura-Taniyama- 
Weil conjecture, Wiles’ new ideas are certain to have enormous impact. 
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Triangulating the Circle, at Random 


David Aldous 


1. INTRODUCTION. In a wonderful article [9] in this journal 38 years ago, 
George Polya discussed combinatorial questions concerning triangulations of the 
n-gon. In particular, the number of triangulations of the n-gon is given by the 
n — 1’st Catalan number c where 


_— (2m —2)! 


“™ “Cm —1)im!’ 


n-1? 


(1) 


One of the interesting aspects of Pdélya’s paper is that it exposed readers to his 
newly developed theory of “figurate series’. We wish to consider the idea of 
letting m — o and studying triangulations of the ~-gon, ie. the circle. This 
question doesn’t make much sense as combinatorics, but we can shift viewpoint 
and consider random triangulations of the n-gon, in which each of the c,_, 
possible triangulations is equally likely. The purpose of this paper is to show that 
there exists an object “the random triangulation of a circle” which is in a natural 
sense the n — © limit of the random triangulation of the n-gon. As with Polya [9], 
the exposition takes readers into some newly developed theory of the author. 
Let’s start by recalling a precise definition. A triangulation of a finite set S is a 
collection of nonintersecting line segments with endpoints from S such that the 
convex hull of S is partitioned into triangular regions. We shall be concerned only 
with the cases S, consisting of the vertices of the regular n-gon inscribed in a fixed 
circle of unit circumference. In such a triangulation each point is linked to its 
neighbor on either side, and may or may not be linked to other points. For each n 
there is a finite set A, of possible triangulation of S,, so it makes sense to talk 
about a (uniform) random triangulation of S,, where the word uniform emphasizes 
that all possible triangulations are equally likely. Figures 1 and 2 illustrate 
random triangulations for n = 12 and n = 2,000. In Ficure 2 the printer drew a 
2000-gon, but of course it looks like a circle, so it is tempting to regard FIGURE 2 as 


Figure 1 
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Figure 2 


approximately an example of our desired limit ‘“‘random triangulation of the 
circle’. To make sense of this object, let’s forget randomness for a while, and start 
by defining “triangulation of a circle’. As far as I know, the topic hasn’t been 
discussed before, so I get to make up my own definition. 


Definition 1. A triangulation of the circle is a closed subset of the closed disc whose 
complement is a disjoint union’ of open triangles with vertices on the circumference of 
the circle. 


It is not hard to show that the triangulations of the circle defined above are 
exactly the possible limits of triangulations of n-gons. Here “limit’’ presupposes a 
topology, and we use the Hausdorf metric on compact sets: 

d(C,,C,) = sup inf |y—x|+ sup inf ly — xl. 
xEC, VEC2 xEC, VEC) 
In this “limit” assertion we regard a triangulation of the n-gon as the closed set 
comprised of the 2n — 3 line segments. To illustrate, consider two sequences of 


triangulations of S,. Labeling the points as 1 through n, one possible triangulation 
links 


(1,2)(1,3)(1,4)(1,5)...(1, 7) 
For n = 2,000 (cf. Figure 2) the chords would look dense in the interior of the 
circle, and of course the n — & limit is the whole closed disc. Another possible 
triangulation, taking n to be a power of 2 for simplicity, links 


Nm MF e F) Flee aE a) 


The n — © limit is a triangulation by sequential bisection of the circle, with each 
chord isolated and only finitely many chords longer than a prespecified length 
L > 0. But the triangulation in Ficure 2 is qualitatively different from each of the 
“extreme” possibilities above: the chords are neither dense nor isolated. It turns 
out that the limit random triangulation of the circle, formalized as a closed subset 
of the closed disc, has* Hausdorff dimension 3/2, instead of dimension 2 or 1 as in 
the deterministic examples above. This fact, whose proof is sketched in section 6, is 


‘The union may be empty, finite or countable infinite 
* With probability 1 
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the main concrete result of the paper. I find it remarkable that such fractal 
structure arises naturally? in random combinatorial objects. 

Given Definition 1, one might want immediately to pose and try to solve 
quantitative probability questions such as Question 1 below. Note first that the 
length of the longest chord in a triangulation of the circle must be at least the 
side-length /, of an inscribed equilateral triangle, and at most the diameter /, of 
the circle. 


Question 1. Jn a random triangulation of the circle, what is the chance that the 
longest chord has length greater than (1, + 1,)/2? 


This question is phrased to resemble the well-known Bertrand’s paradox. 


Question 2. What is the chance that a random chord in the circle has length greater 
than 1? 


This is called a paradox because, as discussed by Martin Gardner ({7] Chapter 
19), three equally plausible calculations give three different answers. The concep- 
tual point is that the notion “random chord” has no canonical meaning: instead 
there are several different meanings we could ascribe to it, modeling different 
mechanisms for physically drawing a chord in some way influenced by chance. In 
mathematical terms, these lead to different probability measures on the set of 
chords. The same issues arise with triangulations of the circle: before attempting 
problems like Question 1 we need to be clear about the probability measure 
underlying the word “random.” Our resolution is to use the measure which is the 
limit of uniform random triangulations of n-gons, and so the issue changes to 
proving existence of such a limit. This is sketched in Section 5. How to solve 
quantitative problems like Question 1 is discussed in Section 7. 


2. CONTINUOUS FUNCTIONS AND TRIANGULATIONS OF THE CIRCLE. It 
turns out there is a simple way to specify a triangulation of the circle in terms of a 
more familiar object, viz a continuous function. Let f: [0, 1]'— [0,) be continuous 
and satisfy 


f(0) =f(1)=0, =f(t)>0 for0<t <1. (2) 


Suppose f, is a strict local minimum of f, that is to say f(t) > f(t,) for all t # t, 
in some neighborhood of ¢t,. Then by continuity there is a first time ¢t, > ¢, at 
which f(t,) = f(t,), and also a last time t, < t, at which f(t,) = f(t,). Now regard 
[0, 1] as the circumference of the circle, and draw a triangle with vertices at ¢,, t., 
and ¢,. Repeat for each strict local minimum t¢,. Note that if f(¢,) > f(t.) then t, 
is in one of the arcs (¢,, t,) or (t,, t,) or (t3, ¢,) and the triangle formed by the (¢}) 
lies inside the region between that arc and the corresponding edge of the triangle 
formed by (t;). This shows* that triangles associated with different local minima 
are disjoint. So we can define a triangulation of the circle as the complement of 
the union of all the open triangles associated with all the local minima. Of course, 
if f were a polynomial we would get only a finite number of local minima and 


As opposed to fractal structures arising from recursive constructions specifically designed to 
produce fractals, e.g. the Cantor set 

*More precisely, assume the values of f at different local minima are distinct, otherwise we might 
get both diagonals of a quadrilateral. 
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Figure 3 


hence only a finite number of triangles, so the triangulation would be a closed set 
with non-zero area. But there exist functions f with the property 


{t: t is a strict local minimum of f} is dense in [0,1]. (3) 


And such functions give triangulations with zero area, which seems more natural. 

This “function — triangulation” mapping is useful for two reasons. First, it 
gives us a general strategy for. defining random triangulations indirectly, by first 
defining random functions and then applying the mapping. Such an indirect 
approach is useful because random functions (better known as stochastic processes ) 
have been the topic of half the research in mathematical probability theory for the 
last fifty years, so we have related a new idea to a well-studied one. Secondly, and 
more concretely, the mapping “function — triangulation” turns out to be just the 
continuous analog of a known correspondence between triangulations of the n-gon 
and discrete walks, which we now describe. 


3. TRIANGULATIONS, TREES, WALKS AND CATALAN NUMBERS. The com- 
binatorial results in this section have been explained very elegantly by Martin 
Gardner in Chapter 20 of [8], so we shall be brief. It is convenient to consider 
triangulations of the (m + 1)-gon, with vertex-set S,,, = {1,2,...,2 + 1}. As 
mentioned before, the number of triangulations of S,,, is given by the Catalan 
number c, defined at (1). Various other combinatorial sets have exactly the same 
size, and the one of ultimate interest to us is the set W,, of positive walks of length 
2n whose first return to 0 is at time 2n. A walk has steps +1 or —1: FiGurE 4 
illustrates one such walk for nm = 11. One can specify an explicit one-to-one 
correspondence between W,, and the set A,,, of triangulations of S,,,. This is 
most simply done in three stages, passing through two sets of trees whose size is 
also c,. We shall specify each map in one direction only, leaving the reader to 
verify that it is indeed one-to-one. 


Figure 4 
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Figure 5 


Map 1. This map takes the walk in FicureE 4 to the tree in FiGurE 5. Imagine 
drawing a tree as the walk progresses. After the first step of the walk we draw the 
root of the tree. In general, when the walk makes a +1 step we draw a new edge 
from the current vertex to a new vertex. When the walk makes a —1 step we 
retrace our pencil from the current vertex down one edge toward the root. Thus 
vertices a, b,c, d, e of the tree are first drawn at the steps of the walk indicated in 
Ficure 4. Note that the three children b,c, d of a are produced in a specific order 
as “first child, second child, third child’, and so the tree in FiGurE 5 is an ordered 
tree. Map 1 is a one-to-one correspondence between W, and the set OT, of rooted 
ordered trees on n vertices. 


Map 2. FiGuRE 6 shows a tree which is a binary tree in the following sense: each 
vertex iS either a leaf (with no children) or an interior vertex (with exactly 2 
children, distinguished as “left” and “right’). The first child—next sibling map 
takes the ordered tree in FiGuRE 5 to the binary tree in FiGuRE 6. Each vertex v 
(except the root) of the ordered tree is associated with an interior vertex v’ of the 
binary tree. If v has children in the ordered tree then there is a first child w, and 
in the binary tree we make the left edge from v’ go to w’; if not, the left edge leads 
to a leaf. If v has a next sibling x in the ordered tree then in the binary tree we 
make the right edge from vu’ go to x’; if not, the right edge leads to a leaf. Thus in 
FiGurE 6 the vertices a,b,c, d (we’ve omitted the primes) occur along a path, 
because b is the first child of a, then c is the next sibling of b, then d is the next 
sibling of c. To start the construction, thé root of the binary tree is the first child of 
the root of the ordered tree. Map 2 is a one-to-one correspondence between OT, 
and the set BT, of binary trees with nm — 1 internal vertices and hence with n 
leaves. 


Map 3. There is a one-to-one correspondence between BT, and the set A,,, of 
triangulations of the points S,,, = {1,2,..., + 1}. This map takes the binary 
tree of FicureE 6 to the triangulation of FiGureE 1, and is illustrated by Figure 7 
which shows the tree and the triangulation superimposed. The idea is that chords 
of the triangulation are identified with edges of the binary tree. Each chord 
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Figure 7 


(i,i + 1) on the boundary of the convex hull (except (m + 1,1)) is identified with 
an edge of the binary tree leading to a leaf. Each interior vertex v of the binary 
tree can be identified as a point inside a triangle of the triangulation; the three 
edges of the tree at v correspond to the chords of the triangle, the edge leading to 
the root being the chord of the triangle separating the interior of the triangle from 
the distinguished edge (n + 1,1). Finally, the root of the tree is identified as a 
point in the interior of the triangle containing the edge (n + 1, 1). 


4. BROWNIAN MOTION AND EXCURSION. Where does this combinatorial 
skullduggery get us? Picking a walk at random from W,, gives us a constrained 
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random walk of length m = 2n (constrained by positivity and the “first return at 
time 2n” requirement). A simpler object is the corresponding unconstrained 
random walk of length m, where all 2” possible walks are equally likely. Such 
random walks are fundamental to probability theory, and their limit Brownian 
motion is central to much of the advanced probability theory studied by mathe- 
maticians in the last fifty years. To understand the limiting area involved, consider 
the practical issue of drawing, for large m, the m-step walk of FicurE 4 on a 
typical piece of paper with available width (left-to-right) 1 unit and height (top-to- 
bottom) somewhat greater. To fit the paper we clearly want to make each step 
have width 1/m. It’s less clear how high up or down each step should be, but it 
turns out that the maximum height of the walk is of order ¥m , so a good choice is 
to make the steps have height + ¥1/m. For any finite m we draw a piecewise 
linear path, but we can now imagine the m — limit as being a continuous, but 
“jerky” rather than “smooth”, path. This limit procedure, applied to uncon- 
strained random walk, gives a random continuous path, called Brownian motion?. 
We need the variation of this result saying that the constrained random walk 
converges to Brownian motion constrained to satisfy (2), a process called Brownian 
excursion. Our loose description of the paths of these processes as “jerky” is 
reflected in precise results which say e.g. that the paths are nowhere differentiable 
and 


satisfy (3). 


5. THE RANDOM TRIANGULATION OF THE CIRCLE. Granted the existence 
of Brownian excursion, we can see how to combine the ingredients we’ve assem- 
bled. Section 3 specifies a mapping taking constrained random walk to random 
triangulation of S,. Section 2 specifies a mapping from continuous functions to 
triangulations of the circle, and applying this mapping to Brownian excursion gives 
a random triangulation of the circle. So this random triangulation of the circle 
must be the limit of the uniform random triangulation of S,, because Brownian 
excursion is the limit of the constrained random walk. 

Of course, the prose in the paragraph above skips a lot of technicalities, but the 
only conceptually important thing to check is that the mapping from Brownian 
excursion to the continuous triangulation is “essentially the same” as the mapping 
from the discrete walk to the discrete triangulation. To check this, consider a 
triangle in the triangulation of S,,, for which none of the sides is very short. 
Using Map 3, this triangle corresponds to an interior vertex v of the binary tree. 
Each of the three edges from v leads toward some proportion of the leaves, and 
these three leaf-proportions are (give or take one leaf) the arc-lengths subtended 
by the sides of the triangle. In other words, a triangle of non-negligible area 
corresponds to a vertex v with children w, x such that, partitioning the leaves as 
descendants of w, descendants of x or descendants of neither, none of these three 
components has a negligible proportion of the leaves. Mapping® now to the 
ordered tree using Map 2, v and x are siblings, with parent u, say, and u has a 
similar property in the ordered tree as v had in the binary tree. That is, if we 
partition the vertices of the ordered tree as descendants of one child (v) of u, or 


“A rigorous discussion of Brownian motion and the nowhere-differentiability and dense local 
minima properties can be found in any good textbook on Probability at the first-year-graduate level, e.g. 
Durrett [6]. The Brownian excursion is less common in textbooks, but some discussion and references 
are in Bhattacharaya and Waymire [4]. 

°This isn’t the best way to make a rigorous argument: see the end of Section 8.1 
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descendants of another child (x) of u, or not descendants of wu at all, then’ (give or 
take a few vertices, the descendants of the other children of u) these three 
proportions are the arc-lengths of the original triangle. Finally consider Map 1, 
taking the ordered tree to the walk. The subtrees at v and at x correspond to 
nearby subintervals [a,,a,] and [B,, B,] or {0,1,...,2n} for which the walk is at 
the same height (1 plus the height of vertex uw) at the different endpoints of the 
subintervals, and the walk is above that height during the subintervals. The lengths 
of the subintervals, relative to 2n, are the sizes of the subtrees at v and x, and 
hence are approximately the arc-lengths of the original triangle. As n — © such 
subintervals become adjacent intervals [¢,, t,],[t,, t,] associated with a local mini- 
mum of a continuous function, and this is exactly the correspondence between 
functions and triangulations defined in Section 2. 


6. THE FRACTAL PROPERTY OF THE RANDOM TRIANGULATION OF THE 
CIRCLE. Our discussion here will be very sketchy. Given « > 0 consider the set 
S, of points on the circumference of the circle which are endpoints of some chord 
in the random triangulation with length at least «. If we argue that S$, has 
dimension 1/2 then the reader should have no difficulty believing that the 
triangulation itself (a closed subset of the disc) has dimension 3/2, because each 
point of S. corresponds to a chord in the triangulation. In terms of the construc- 
tion of the triangulation from a function f chosen at random by Brownian 
excursion, chords correspond to intervals [s, s’] for which f(s) = f(s’) and f(t) > 
f(s) on s < t <s’. (Although such an interval may not be part of a local minimum 
interval-pair, it will be a limit of intervals which are.) Consider such intervals 
straddling time 0.5: these are the intervals [s,, s',] where 0 < y < f(0.5) and 


sy = sup{t <0.5: f(t) =y}, 8, = inf(t > 0.5: f(t) =y}. 
So we need to argue that 
the set {s,: 0 <y <f(0.5)} has dimension 1/2 (4) 


and then replacing 0.5 with an arbitrary rational shows that S, has dimension 1/2. 

Fortunately (4) can be deduced from standard facts about functions g(t) chosen 
at random by Brownian motion. The most important of these facts is 

(a) ({6] Exercise 7.4.4) The zero-set {t: g(t) = 0} has dimension 1/2. The most 
intuitive explanation of (a) is that simple random walk has order n'/” visits to 0 in 
the first n steps. 

Other facts require the concept of distribution-preserving transformation. If a 
number u is chosen uniformly at random from [0,1] then the number 1 — u is also 
random and uniform on [0,1], this being a special property of the particular 
transformation u — 1 — u. In other words, 1 — u has the same distribution as uw. 
An analogous fact about Brownian motion is 

(b) Lévy’s identity ((6] Theorem 7.4.7). Given g(t), define g*(t) = 
SUPp -,<, g(t). Then (g*(t) — g(t)) and |g(t)| have the same distribution. In 
particular, (a) and (b) imply that the set {t: g(t) = g*(t)} has dimension 1/2. This 


"The argument would break down if there were a vertex with three children, each of whose 
descendants comprised a non-trivial proportion of the population. But this has probability -—> 0 as 
n-o 
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is essentially the same as saying 
the set {t,: 0 < y} has dimension 1/2 (5) 


where ¢, = inf{t > 0: g(t) =y}. Next, Brownian motion has a “time-reversal” 
property 

(c) Define £(t) = 2(0.5 — t). If g(t) is chosen according to Brownian motion 
then so is £(t). 
This property and (5) give us (4) for functions chosen according to Brownian 
motion. Of course we really need (4) for Brownian excursion, but the conditioning 
involved in producing Brownian excursion from Brownian motion doesn’t affect 
“local” properties of the random functions, so (4) still holds for Brownian excur- 
sion. 


7. QUANTITATIVE CALCULATIONS. Consider three vertices 1 <i, <i, <i, < 
n + 1 of the n-gon. The chance that the triangle with these three corners occurs in 
the random triangulation of the n + 1-gon is 


Cj, -i,Ci-i,fn +1 +1, —13 


(6) 


D(N3 i, in, i) = c 
n 


because the edge from i, to i, creates a (non-regular) i, — i, + 1-gon outside the 
triangle. Noting that (1) and Stirling’s formula give 


C, ~ ar 172m 3/2222 
we can take asymptotics in (6) to get 


Ls 


. . —3 1 3 
Njl,,l5,l2) ~Nn t,,f,,f,) aS | ——, , > (¢,,f,,t 
P(N; iy, in, 13) b( ti, ta, ts) nt+1l?nt+2?ne+i (t,, ty, ts) 
where 
1 ~3/2 ~3/2 —~3/2 
b(t), t2,t3) = q_ bt —t))% (t3 — tz) “dl + ty ts) °° 
O<t, <t,<t, <1. (7) 


The function ¢@ represents the frequency spectrum of triangles in the random 
triangulation of the circle. That is, representing the vertices of a triangle by their 
distances t’ around the circumference (as in Section 2), 


b(t,,t2,t3) dt, dt, dt; = 
mean number of triangles (¢1,t5,t3) with ¢} € [t,,¢, + dt, ], i= 1,2,3. 


Various quantitative problems about the random triangulation of the circle have 
straightforward answers involving ¢. Consider Question 1, and measure “length” 
by arc-length, so the maximal chord length is between 1/3 and 1/2. A moment’s 
thought reveals that the longest chord is just the longest edge of the triangle 
containing the center of the disc. So for 1/3 < x < 1/2 the maximal chord-length 
is less than x iff the triangulation contains a triangle (¢,, t,, t;) such that 


max(t, — t,,t, —t,,1+¢t,—t,) <x (8) 


because such a triangle necessarily contains the center of the disc. So the 
probability that the maximal chord-length is less than x is the integral of @ over 
the domain (8). With a little help from MATHEMATICA we find that the integral 
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has the explicit expression 


(3x — 1)(1 —2x)'” 


67 '(arctan3~'/? — arctan(1 — 2x)'/*) — ax(1—x) (9) 
and that the probability density function of the maximal chord-length is 
3x —- 1 1 1 
mx?(1—x)°(1—2x)'77’ 3 7S (9) 


The latter shows that the maximal chord-length distribution is strongly biased 
toward the upper end of the interval [1/3, 1/2]. Numerically, the median works 
out as 0.479 and the chance of being in the lower half of the interval (Question 1) 
is only 0.126. 

We leave to the reader a similar problem. 


Question 3. What is the chance that the largest area of a triangle in the random 
triangulation is greater than half the area of a square inscribed in the circle? 


8. DISCUSSION 


8.1 The weak convergence paradigm. We’ve discussed triangulations, but we could 
try the same approach in a vast range of settings. Given a sequence of discrete 
(typically combinatorial) random structures of size n, does there exist a continuous 
structure representing then n — © limit? If so, then for many questions about the 
size-n structure one can obtain the n — © limit by simply asking the same question 
of the limit structure. This is the weak convergence paradigm®. To explain the 
name, recall from elementary probability that the Normal distribution is, in a 
certain sense, the limit of Binomial distributions. This kind of convergence is called 
“convergence in distribution” or “weak convergence’, applied to random num- 
bers. But we can also talk about weak convergence of random elements of an 
abstract metric space. This abstract theory’ was developed in the 1950s and 1960s, 
with special emphasis on the case of random functions (from [0,«) to R) because 
these are just stochastic processes by another name. 

Trees are fundamental in combinatorics. If you are willing to regard Ficures 1, 
4,5 and 6 as different pictures of the same object, then we have implicitly been 
studying asymptotics of random trees. The weak convergence paradigm is based 
upon representing a “‘size n” combinatorial object as an element of a metric space, 
scaled in such a way that the objects are comparable for different n. Informally, 
this is the idea of being able to draw different sized objects on the same sized piece 
of paper. It is trivial to do this for trees, provided you are willing to picture the tree 
as a triangulation or as a path (formally, the metric spaces are ‘‘closed subsets of 
the disc” and “continuous functions [0,1] ~ R”). But if you insist on picturing 
trees as in Ficures 5 and 6, you have problems. How do you actually draw a 
2000-edge tree in the style of Figure 5 which “looks right,” and what metric space 
do such trees inhabit? It’s not easy!? to say. But by drawing trees as walks or 
triangulations we can literally see some interesting numerical characteristics of the 


8My friend Mike Steele prefers to call it “the objective method”, because it involves constructing a 
limit object. 

The classic text is Billingsley [5]. 

0-The best way I know to ‘‘draw large trees as trees” is in Aldous [1] pages 4-5. 
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tree, and indeed the two pictures reveal different aspects of the tree. For instance, 
the height of the tree is just the maximum height of the walk. And the longest 
chord in the triangulation corresponds to the edge of the tree which most nearly 
partitions the vertices into equal components. 

As presented here, one might expect to get Brownian excursion asymptotics for 
random trees in only the two special models of random trees which arose in 
Section 3, viz uniform random trees in OT, and in BT,. In fact, much more is true. 
Given any model for “random tree” one can apply Map 1 to get a (non-uniform, in 
general) random path. But despite the non-uniformity, it turns out that the 
Brownian excursion limit holds for a wide class of combinatorial models of random 
trees. This is a remarkable fact with no simple explanation. A survey of this field, 
aimed at a more sophisticated audience, is in [2]. In making a rigorous argument 
out of the prose in Section 5, it is technically easiest to use the fact that the 
non-uniform random walk obtained by applying Map 1 directly to the binary tree 
has the Brownian excursion limit. 


8.2 More about trees and triangulations. Gardner [8] mentions several other 
combinatorial sets whose sizes are given by the Catalan numbers, and Richard 
Stanley (personal communication) promises a longer list to appear in Volume 2 of 
[11]. Any calculation one can do with Brownian excursion leads to a limit result for 
random elements of any of these sets: whether such limit results are interesting is 
another matter. 

Let me also mention work of Sleator et al. [10], who use the representation of 
binary trees as triangulations to obtain bounds on the number of “‘rotation steps” 
(a natural operation on trees as data structures in computer science) needed to 
move from one n-tree to another. 

A rigorous treatment of some of the new results mentioned in this paper will be 
presented in [3]. 
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Hypatia and Her Mathematics 


Michael A. B. Deakin 


1. INTRODUCTION. The first woman mathematician of whom we have reason- 
ably secure and detailed knowledge is Hypatia of Alexandria. Although there is a 
considerable amount of material available about her, very much of that is fanciful, 
tendentious, unreferenced or plain wrong. These limitations are to be found even 
in works that we might hope to be authoritative; for example, the entry in the 
Dictionary of Scientific Biography (DSB) [11]. Even where the account given is 
more careful and accurate [14, 19, 20], one is disappointed to be told so little of 
Hypatia’s Mathematics. 

This article will direct the reader’s attention to the best accessible sources and 
will describe what is known about her mathematical activities. 


2. THE HISTORICAL BACKGROUND. In about 330 B.c., Alexander the Great 
conquered northern Egypt and, via a deputy (Ptolemy I Soter), founded a city 
(Alexandria) in the Nile delta. This almost immediately became home to the 
Alexandrian Museum, an institution of higher learning, rather akin to the medieval 
universities of some 1500 years later. Euclid was an early (probably the first) 
“professor” of mathematics. 

The Museum continued for many centuries. In 30 B.c., Cleopatra’s suicide 
allowed the Roman Empire to occupy Alexandria, but this event destroyed neither 
the city’s Greek heritage nor its intellectual tradition. In the years that followed, 
two of the greatest of late Greek mathematicians flourished in Alexandria. 
Diophantus was active around A.p. 250 and produced in particular his Arithmetica 
at this time. Several generations later, Pappus (c.300—c.350) also worked there. 

A later mathematician, Theon of Alexandria, was the last person definitely 
known to have been associated with the Museum. Because he recorded two 
eclipses (one of the sun and one of the moon) and because he is also credited with 
achievements during the reign of Theodosius I, it is thought that he was at the 
height of his powers in the decade 360-370. Theon may well have been the last 
“noresident’’ of the Museum. His daughter, Hypatia, was associated with the 
Neo-platonic School—a different institution. 

Alexandria, in the years around a.p. 400, was a turbulent mix of cultures. 
Christians were in the majority, but they were divided among themselves. There 
were also persons whom the Christians regarded as “pagans”; these could be 
anything from believers in the Olympian pantheon to adherents of various schools 
of ““Neoplatonic” thought. Beyond these there were also Jews and Gnostics. 

The Roman Empire, of which Alexandria was a part, was under external 
pressure from the Huns and the Visigoths. It split in 395 into the Western Empire 
(ruled from Rome) and the Eastern Empire (ruled from Constantinople). The 
official religion was Christianity: it had been established under Constantine. But 
there had been relapses; in particular, Julian the Apostate had reigned over the 
combined empire from 361 to 363. 
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At the time of Hypatia’s death, the local governor was Orestes, a Christian not 
unsympathetic to other views, but whose authority was under challenge from that 
of the less tolerant Cyril (St. Cyril of Alexandria) who acceded to the bishopric in 
412. The divisions that beset the city were prone to erupt into sectarian violence; 
the great libraries associated with the Museum were one by one destroyed, the last 
going up in smoke in 392 when the temple of Serapis was put to the torch during a 
riot. Another such disturbance was to claim Hypatia’s life in the second decade of 
the fifth century. She died, brutally hacked to pieces, at the hands of a Christian 
lynch-mob. 

Following this, very possibly in part because of it, the thrust of Neoplatonist 
thought and education moved from Alexandria to Athens. Three names require 
mention. Proclus (410?—485) was the last of the great mathematicians of Greek 
antiquity. He frequented the Neoplatonic School at Athens and is best remem- 
bered for a commentary on Book 1 of Euclid’s Elements. After Proclus came 
Isidorus and his pupil Damascius (philosophers both of them rather than mathe- 
maticians, although the latter may have some claim on a place in mathematical 
history [6, pp. 312—313]). In 529, the emperor Justinian, enforcing Christianity as 
the state religion, closed the Neoplatonic School and Damascius went into exile in 
Persia. 


3. THE PRIMARY SOURCES. The oldest accounts of Hypatia come to us from 
either the Suda (or Suidae) Lexicon or from the writings of the early Christian 
Church. For an accessible account of them, giving more detail than I provide here, 
see Mueller [14]. 


Medallion of Hypatia in the Introduction to Halma’s edition of Theon’s “Commentary on the 
Almageste’’.. (Artist unknown) 


Briefly, the Suda was a 10th-century encyclopedia, alphabetically arranged, and 
drawing on earlier sources. In the case of Hypatia, these are in part known. (One is 
a now lost work, a life of Isidorus by Damascius.) The relevant entry is unusually 
long, but is not seen as reliable in all its aspects (see [25]); indeed in places it 
contradicts itself. 

“The other sources are to be found in the main in a compilation known as the 
Patrologiae Graecae [13], or PG for short. This gives earlier accounts (particularly 
of her death) than are available in the Suda and also preserves letters to her 
and about her from the hand of one of her pupils, Synesius of Cyrene. Also by 
Synesius is a letter published as a separate document included with the others in 
FitzGerald’s translation [4]. 


4. LIFE AND LEGEND. The best-recorded event in Hypatia’s life is her death 
and the manner of it. The fullest account tells us that a crowd of Christian zealots 
led by one Peter the Reader seized her, stripped her and proceeded to dismember 
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her and burn the pieces of her corpse. Another says she was burned alive, but this 
would seem to be a less accurate version. 

The political background to this action has been the cause of much speculation. 
Gibbon [5] is by no means alone in attributing the guilt for the murder to Cyril, but 
Rist [20] disputes this, which does mean taking issue with the Suda. Rist’s account, 
in essence, has it that, like victims of violence in Belfast or Beirut today, she was 
seized not with any great selectivity at all, but rather because she was a well-known 
public figure, prominent on the other side of a religious divide. This to my mind is 
quite compatible with the statement quoted by Gibbon to the effect that she was 
killed because of her outstanding ability. We need not posit any specific jealousy to 
say this, and Rist thinks it is unlikely that precise differences of doctrine led to her 
death. Rist does toy with the idea that her mathematical activities were a partial 
cause, hypothesizing that these included astrology. This, to me, sets us on a path 
we have no reason to travel. 

The date of her death is now generally accepted to have been 415, although 
others have been suggested. See Mueller [14] for details. 

The date of her birth is much less certain. (This is to be expected—people are 
not, generally speaking, famous when they are born.) The eclipses described by 
Theon, Hypatia’s father, have been dated to 364. So, from the eclipses to the time 
of her death is an interval of 51 years. Valesius, an early commentator on the PG 
who had the wrong date for the eclipses, reckoned this interval at 47 years; 
rounding this to 45 produces a date of c.370, which is the generally-stated figure. 
Of course, astrology aside, we have no real reason to suppose that her birth 
coincided with the eclipses; nor have we any idea how old Theon (or more 
importantly his wife) was in 364. (I tend to agree with Mueller that a date of c.350 
is more plausible.) 

As to her life between these uncertain dates, we may readily summarize. She 
was a respected and eminent teacher, charismatic even, and beloved of her pupils 
(e.g., Synesius). We have evidence that she was regarded as physically beautiful, 
that she wore distinctive academic garb, that she taught not only mathematics but 
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also Philosophy, that she gave public lectures and may have held some kind of 
public office. 

She seems to have been determinedly celibate, indeed repelling one ardent 
suitor by confronting him with one of her used menstrual pads and lecturing him 
on the shameful and unclean nature of what he thought beautiful (the vagina). 

Although almost all the primary sources are Christian and tell of the life and 
death (at Christian hands) of a prominent advocate of a rival philosophy, they do 
so in such a way that we are left with a favorable impression of her. My reading of 
this is that the official discouragement of her teachings on the part of the Church 
authorities and of their (Christian) civic counterparts was far from complete. 

Certainly that favorable impression has informed various works of literature of 
which the best-known in English are Kingsley’s novel [10] and the passage from 
Gibbon. Also fiction is Hubbard’s telling of Hypatia’s story [9]. It formed a chapter 
in a popular reader early this century and has given us the most widely dissemi- 
nated “portrait” of Hypatia, attributed to an artist called Gasparo, of whom I am 
able to learn nothing. (Of course such “portraits” have exactly the same validity as 
(e.g.) Doré’s illustrations of the Bible.) 


5. HYPATIA’S PHILOSOPHY. The Philosophy expounded by Hypatia is known 
to have been Neoplatonist. There were various versions of Neoplatonism, all 
endowing Plato’s Theory of Forms with an explicitly religious dimension. Richeson 
[19] describes one such system; Rist [20] suggests that Hypatia actually preached 
another. 

Richeson does however make a particularly insightful remark on the connection 
between Neoplatonist Philosophy and Mathematics. The nature of Mathematics is 
to abstract—to derive ideas from material things. Thus Geometry, although it has 
its origin in the practical world of land surveyors and inspectors of weights and 
measures, transcends these beginnings. The Elements deals with a world that is no 
longer the world of the practical but rather the world of ideas. Thus Mathematics 
could be seen as a paradigm of that transcendence over the material that 
Neoplatonism enjoined. 


6. HYPATIA’S MATHEMATICS. That Hypatia was a mathematician is beyond 
doubt. The PG tell us that she learned her Mathematics from her father Theon 
and went on to excel him in the subject and to teach it to numerous students. 
Another such source is more critical: “Isidorus greatly outshone Hypatia, not just 
because he was a man and she a woman, but in the way a genuine philosopher will 
Over a mere geometer.”’ This opinion, which will earn no praise from either women 
or mathematicians, is thought to derive from Damascius’ life of Isidorus, the lost 
work that in part informed the Suda. (Marrou [12], following Tannery [25], supplies 
the following delightful gloss: “[it] means in plain language that Isidorus knew 
nothing of mathematics.”’) 

However, the Suda itself gives the most explicit account of Hypatia’s mathemat- 
ical works. It attributes to her the authorship of three works. The only things she is 
known to have written all deal with Mathematics or Astronomy. The books that 
many feel she must have authored on Neoplatonist Philosophy receive no mention. 
Others (e.g., Kramer [11]) have credited her with further works of Mathematics. 
For this there is no evidence, except in one specific instance to be described below. 
The relevant passage in the Suda is precisely twelve words long. And even this 
short excerpt is the subject of various alternative and disputed readings. However, 
there is a general consensus that Tannery [25] is correct in rendering it thus: “She 
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wrote a Commentary on Diophantus, [one on] the astronomical Canon, and a 
Commentary on Apollonius’s Conics.” 

“Commentaries” were what we would now refer to as ‘Editions’ (with the 
obvious difference that they needed to be copied by hand), and the author of a 
“Commentary” is perhaps better referred to as an “Editor.” Such “Editors” or 
“Commentators” did, however (to a greater or lesser extent, and with greater or 
lesser care to distinguish their own contributions from the original), provide new 
material of various sorts (witness Fermat’s famous marginal note to Diophantus). 
It should be noted that in many cases the original text has come down to us only 
through Commentaries or translations (often into Arabic). 

Theon, Hypatia’s father, was a prolific author of Commentaries. He wrote one 
on the Elements (which, in places, still provides our present text), on two other 
works by Euclid, the Data and the Optics, and on two works by Ptolemy, the 
Almagest and the Handy Tables. There were also works now lost or partly so; 
particularly germane to our story is a work on the astrolabe. For this and more, see 
Toomer [28]. 

The picture that emerges of Theon is one of an editor, teacher and textbook- 
writer rather than a research mathematician. So is he judged, often with more than 
a hint of disapproval. But this should not mean that his was a wasted life. His 
works were preserved, presumably because they were perceived as having lasting 
value. It is all too understandable, given the politics of late 4th-century Alexandria 
and the decay of the Museum, that the emphasis on research (possible in Pappus’s 
time) should be replaced by the priority of conserving knowledge. 

After considering her works seriatim, I shall offer the hypothesis that in her 
scholarly priorities Hypatia was very much her father’s daughter. This, as I hope I 
have just made clear, is not to denigrate her. 


7. APOLLONIUS’S CONICS. Apollonius lived around 200 B.c. and the Conics is 
the most important of his surviving works. See, for more detail, Toomer’s account 
[27]. There are very few sources for our present text and Hypatia’s Commentary is 
not one of them. Of the eight books that make up the Conics, the first four survive 
via a Commentary by Eutocius while three of the remaining four have come down 
to us via the Arabic. The other is lost, as is also, we must conclude, Hypatia’s 
Commentary, unless it is the lost original of Eutocius’ work. 


8. THE ASTRONOMICAL CANON. In the case of the “‘Astronomical Canon,” we 
are much better placed. It is now generally assumed that Tannery’s interpolation 
(the words in brackets in Section 6) in the Suda entry is correct. This means that 
this work also was a Commentary. The most likely original is one of the works of 
Ptolemy, either the Almagest or the Handy Tables. It will be remembered that 
Theon wrote commentaries on both these works. 

Theon’s commentary on the Almagest has been printed in various editions. The 
best and’most recent is by A. Rome [21, 22]. (But see also [23].) It comprises 
separate Commentaries on the thirteen books that go to make up the Almagest. 
The titular inscriptions (as described by Rome from his study of the manuscripts) 
of the first and second books ascribe these works to Theon himself. Books 4—13 
contain no inscriptions. Only the very best manuscripts contain the Commentary 
on Book Three, and here the inscription tells us that the work is Theon’s “‘in the 
recension of my philosopher-daughter Hypatia.”’ 

Heath [8], reviewing Rome’s work, thus ascribed this chapter of the Commen- 
tary to Hypatia, with the inference that it was also the work alluded to in the Suda, 
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and that Theon (recognizing his daughter’s work as superior to his own) had 
suppressed his earlier effort in favor of hers. (The pity, from our point of view, is 
that we don’t have both versions before us; so we cannot see for ourselves where or 
how or to what extent Hypatia’s Commentary differed from Theon’s.) Rome 
himself discusses the matter at considerable length in his later work [22], but in 
such a way as not to rule out a possibility that has been canvassed: that father and 
daughter collaborated. 

Neugebauer [16, p. 838] accepts this as likely. However, he regards it as 
probable that what the Suda refers to is a commentary not on the Almagest at all, 
but on the Handy Tables. This is because the same word (Canon) is used for both 
works. (Delambre [2] had earlier noted this same concordance of wording, but as 
his work predates Tannery’s suggested interpolation, he credits Hypatia with a set 
of Astronomical Tables.) If the Suda were referring to a Commentary on the 
Almagest, so the argument goes, then it would speak of the Syntaxis, rather than 
the Canon. (Syntaxis is the Greek name for the work we now know by its Arabic 
designation.) Against this, however, is the Canon of Parsimony and the fact that 
Book 3 of the Almagest has a strongly tabular character. 


9. DIOPHANTUS’ ARITHMETIC. We may also have some of Hypatia’s own 
writing from the Commentary on Diophantus. Diophantus’ major work is the 
Arithmetic, originally comprising thirteen books. Of these only six now survive from 
the Greek, and possibly part of another, now listed as separate, the Polygonal 
Numbers. Tannery [26] suggested that all existing manuscripts known to him 
derived from a common source and that that source was Hypatia’s Commentary. 
His careful “family tree” of the manuscripts was later modified in one detail and 
made available in the amended form in Heath’s Edition [7]. The presumption was 
that Books 7-13 are lost because Hypatia’s Commentary did not include them, 
much as Eutocius’ Commentary extended only to the first four books of the Conics. 
This hypothesis enjoyed a deal of support, and Vogel’s Article on Diophantus in 
the DSB simply accepts it. 

The basis for this theory was the Greek text and the fact that the Suda 
reference to Hypatia’s Commentary is the only mention of so ancient an edition. 
Sesiano [24, pp. 71-75] however queries this account. This is a matter of great 
controversy. The old theory will be presented first, but see the remarks at the end 
of this section. 

On the old story, the mathematical world of today owes Hypatia a great debt, 
for without her we would have much less of the works of Diophantus. But there is 
an obvious corollary. If what survives for us is Hypatia’s Commentary, then some 
of her work may appear there. It may be possible to see what is hers. One com- 
plication is that a later scribe was thought to have attempted to reconstruct 
Diophantus’ original text and thus to have systematically omitted material he 
judged to be interpolated. But “the distinction between text and scholia being 
sometimes difficult to draw, he included a good deal which should have been left 
out” [7, p. 14]. 

On this account, the most likely of the supposed interpolations to have come 
from Hypatia’s hand are two “student exercises” at the start of Book II. The first 
asks for the solution of the pair of simultaneous equations: 


Xx-y=4, x*—y*=(x-y) +), 


where a, b are known. The next is a minor generalization. It requires the solution 
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of the pair of simultaneous equations: 
x-y=a, x*—y*=m(x-y) +5), 


where a, m and b are known. There is some evidence to link this problem to 
Hypatia: a nine-word phrase in the original Greek is identical with one from 
Euclid’s Data, which her father had edited. 

Recent work by Roshdi Rashed, Sesiano and others has suggested that some of 
the lost books of Diophantus in fact survive in Arabic translations. This has led to 
very great and indeed bitter controversy. What is at issue (apart from the personal 
rivalries involved) is whether Diophantus or someone else wrote the newly discov- 
ered works and where they might fit into the fragment previously published. 
Sesiano and others are inclined to the view that if anything of Hypatia’s Commen- 
tary survives then it survives in the Arabic. There are no clear indications of what 
might be by her and what by Diophantus or by other scholiasts. Many of Sesiano’s 
conclusions are hotly disputed by Rashed [18]. However, tentative attributions of 
material to Hypatia all tend to accept the overall assessment reached above—that 
her contributions to mathematical knowledge itself were slight or non-existent. 


10. THE ASTROLABE. The other source for information about Hypatia’s mathe- 
matical activities is the correspondence of Synesius. 

There is a brief but telling reference to Hypatia in Synesius’ essay-letter 
De Dono Astrolabii. The name ‘“‘astrolabe” was a term applied to a variety of 
instruments. For a good overview of later developments, see [17]; earlier ones are 
discussed by Neugebauer [15]. A simple attempt to zeplicate the motions of the 
heavens in a mechanical model produces the device known as an “armillary 
sphere’. Such an object is necessarily 3-dimensioual and unwieldy, more suitable 
for display purposes than for use as a practical instrument of observation or 
computation. 

However, once we have a theory of sterographic projection, the way is open for 
the construction of a more practical two-dimensional device. This theory was given 
by Ptolemy in his Planisphaerium, which even includes tabular material. Whether 
Ptolemy went on to develop the “little astrolabe”’ (i.e. the practical instrument) has 
been argued. Neugebauer regards it as probable that he did. ) 

The next figure is Theon. Ptolemy died in about 170 A.D., about two centuries 
before Theon’s active period. Theon wrote, as we have seen, Commentaries on 
the Almagest and the Handy Tables. The Suda also credits him with a treatise on 
the little astrolabe, and Arab sources refer in addition to a work of his on the 
armillary sphere. This set corresponds exactly to the set of works assigned to 
Ptolemy by the Arabs. 

There is thus considerable evidence that Theon was familiar with the theory of 
the little astrolabe. We might speculate that he invented it, but the picture of 
Theon that has come down to us is one of Theon as a disseminator and conserver 
of knowlédge, rather than an innovator. Moreover, Neugebauer has given us 
grounds to believe Ptolemy to have been the inventor. 

Although Theon’s work on the astrolabe is now regarded as lost, Neugebauer 
finds such similarities between later works that they must derive from a common 
source. This source he believes to be Theon. He further argues (because of the 
exact correspondence described above) that what Theon wrote was a Commentary 
on an earlier book by Ptolemy. 

This gives us the background to Synesius’ De Dono Astrolabii. Writing to 
Paionos, he states that he designed the astrolabe himself with help from Hypatia 
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and had it crafted by the very best of silversmiths. The inference is that the theory 
of the astrolabe and the details of its construction were passed down from 
Ptolemy, via Theon, to Hypatia, who in her turn taught Synesius. 


11. THE HYDROSCOPE. Letter 15 of Synesius begins: “I am reduced to this, 
that I have to have a hydroscope.” The letter then goes on to ask her to make him 
one, to quite detailed specifications. The question of what he needed is puzzling. 
The general presumption is that he was ill. 

The term ‘‘hydroscope” usually implies a clepsydra or water-clock, but this 
seems inappropriate as a translation in this case. Why should he be, even if 
brought so low, in such urgent need of a water-clock? FitzGerald believes that 
Fermat (yes, the Fermat) [3] is right in suggesting that what Synesius needed was a 
hydrometer, that is to say, a densimeter. This makes much more sense of the 
specifications, which refer to the need to measure the weight of the water (the 
clepsydra measures the volume), and describe an instrument that sounds very like 
a hydrometer. 

The suggestion is that Synesius needed it in his illness somehow to measure a 
medicine he was taking (or less plausibly the salinity of his drinking water). 
Hydrometers are now used, as they well may then have been, to measure the 
alcoholic contents of fermented or distilled liquors. Possibly Synesius was making 
his own medicine by some such means. My friend and colleague Charles Hunter 
(Department of Anatomy, Monash University) however offers a novel 
suggestion—that the “hydroscope” was in fact a urinometer and that the dosage of 
some diuretic was calculated by reference to the specific gravity of the urine. 


12. ASSESSMENT. What we know of Hypatia is little enough; what we know of 
her Mathematics is only a small subset of that little. There is evidence that she was 
greatly regarded as a teacher and a scholar. The range of her acknowledged 
expertise was considerable. She edited works of Geometry, Algebra and Astron- 
omy, knew how to make astrolabes and “‘hydroscopes’’, and did a lot else besides. 
One cannot but be impressed with this breadth of interest, Moreover, at the time 
of her death (assuming with Toomer [28] that Theon pre-deceased her) she was in 
fact the greatest mathematician then living in the Greco-Roman world, very likely 
the world as a whole. 

She is variously described as a philosopher, a teacher of Philosophy, a mathe- 
matician and astronomer, a learned woman and a geometer. 

We can understand the term “philosopher” in two senses: it has the technical 
sense that it retains to this day, but it also has a generic meaning of ‘‘thinker’’. 
Theon also is described in the sources as a philosopher. But this is surely in the 
second sense; Theon clearly emerges as a specialist mathematician and astronomer 
—jhe Suda goes on to say as much. Hypatia does not (unless one accords weight to 
the quote in Section 6 above); the Suda is at some pains to make this clear. ‘She 
also took up other [non-mathematical] branches of philosophy and though a 
woman she cast [an academic robe] around herself and appeared in the centre of 
the city’ (Rist’s translation)—the Suda then proceeds to describe the Philosophy 
she taught, mentioning the work of Plato and Aristotle, in particular. 

However, if we restrict consideration to Mathematics alone, we may well query 
the usual judgment that Hypatia outclassed her father. It comes from the PG and 
modern sources regularly repeat it uncritically. We may also deduce it from 
Theon’s heading to his Commentary of Book 3 of the Almagest. 
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We may still however dispute this opinion and indeed argue the opposite. That 
a fond father might recognise and promote his daughter’s improvement of one of 
his own works is understandable enough. That ecclesiastical historians, of whom 
we have no evidence of mathematical ability, might use fame or even notoriety as 
an index of talent is equally so. But this does not end the matter. 

While it is of course too much to posit a universal theory of natural selection of 
scholarly works (it being by no means always true that the best works are the 
survivors) nonetheless scholars of earlier times preserved, translated and taught 
from those works they adjudged as valuable. Much as we do today. In fact, we do 
know something of the principle of natural selection that operated. Because the 
focus had moved from research to conservation, those works were preserved that 
were well regarded as textbooks [29]. Many research works from the period are 
lost. 

We have no evidence of research Mathematics on the part of either father or 
daughter. What we can reconstruct of their Mathematics suggests to us that they 
edited, preserved, taught from and supplied minor addenda to the works of others. 
A great deal of Theon’s work survives and at most a small part of Hypatia’s. In 
other words Theon was seen as the better text-writer, even if he himself generously 
demurred in one case. 

Where Hypatia does quite clearly outshine Theon is in her reputation as a 
teacher. She was revered as such and no similar endorsement of Theon has come 
down to us. (It is perfectly possible that this is the basis of the orginial statement.) 
We are left with a well-attested account of a popular, charismatic and versatile 
teacher. And that, I suggest, is the best picture we can form of her. 
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Addendum: Too late for mention in the main article, I was made aware of the lengthy discussion of 


Hypatia by W. R. Knorr [Textual Studies in Ancient and Medieval Geometry (Boston: Birkhauser, 1989)]. 
Beginning from a stylistic analysis of Book Three of Theon’s Commentary on the Almagest, Knorr 
builds an elaborate and detailed, though speculative, argument to attribute several other lost works to 
Hypatia. In particular, he suggests that Eutocius’ Commentary on Apollonius’ Conics in fact derives 
from Hypatia’s earlier Commentary, the one mentioned in the Suda. 


I thank Win Frost of the University of Newcastle (Australia) for bringing Knorr’s work to my 


attention. 
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“The main duty of the historian of mathe- 
matics, as well as his fondest privilege, is 
to explain the humanity of mathematics, to 
illustrate its greatness, beauty and dignity, 
and to describe how the incessant efforts 
and accumulated genius of many genera- 
tions have built up that magnificent monu- 
ment, the object of our most Icgitimate 


pride as men, and of our wonder, humility 
and thankfulness, as individuals. The study 
of the history of mathematics will not 
make better mathematicians but gentler 
ones, it will enrich their minds, mellow 
their hearts, and bring out their finer 
qualities.” 


—G. Sarton 
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Calculus II and Euler also 
(with a Nod to Series Integral 
Remainder Bounds) 


Richard Barshinger 


A classical theorem of Maclaurin and Cauchy [8, p. 45] states that, if f(x) is 
positive and decreases to zero, then 


Y= tim { 906) -f f(x) as| 


exists. The constant y, is called an Euler constant, the original belonging to 
f(n) = 1/n and having the value y = 0.57721566490.... [y arises, in applied 
mathematics, in the formulation of Bessel functions and the gamma function, 
among others.] Papers such as [2], [3], [4], and [14] have discussed these constants, 
as well as more generally considering rates of growth, for divergent series in 
particular. 

This paper considers a suitable approach by which the computation of the 
canonical Euler constant (corresponding to lim, _,,.{2%_,1/k — Inn}, a classical 
indeterminacy of the form » — ©) can find its way, in a pedagogically sound and 
useful fashion, into a first calculus course. Though a number of authors have used 
various intermediate to advanced techniques to explore some aspects of L?_,1/n 
and y, we use an elementary geometrical technique for accelerating the conver- 
gence of the above limit. 


Evaluation of y, Method O (brute force—almost!). Any student knows—or has to 
be taught—that most any computational device will claim that an infinite series 
whose terms decrease to zero will converge. Consequently, it is not sufficient 
simply to compute successive evaluations of L7_,1/k — Inn. It is, however, clear 
that, for all n, 


n 1 n 1 
yy =— —-In(n +1) < YY = - Inn. 
kai K kai K 


It is easy to show that the two sequences are strictly monotonically increasing and 
monotonically decreasing, respectively (see, for example, [13], p. 669), and that 
both converge to y. The following table of bounds can easily be classroom 
generated, when using, for example, the Sequences program in MicroCalc [6] and 
run on a 3% Mhz 486 microcomputer. - 


n Y 
1800 0.576938 0.577493 0.577 (rounded) 
2313 0.577000 0.577423 0.577 (truncated) 
7557 0.577150 0.577282 
14778 0.577182 0.577249 0.5772 (rounded) 


Computing about 15,000 terms of each of two sequences is, if nothing else, rather 
esthetically unattractive, however; so, alternatively, we might average the above 
pairs of values, which gives y = 0.577 (truncated) as early as the 10th iterate. [This 
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is based on LZ_,1/k — Inyn(n + 1) and involves the geometric mean between n 
and n + 1.] 


Error Bounds for y, Method 1. From a problem in [12, p. 344 & p. 621], based on 
FiGureE 1, it easily follows that 1/2 < y < 1. It is quite straightforward to improve 
substantially on these very crude bounds. FiGurE 2 shows that we may calculate a 
lower bound for y by drawing secant lines from the endpoints of f(x) = 1/x on 
the interval [n,n + 1] to the midpoint of the curve. The area above the secant 
lines and below the horizontal line y = 1/n is smaller than the contribution to the 
value of y by taking the area between the curve and y = 1/n. Similarly, an upper 
bound for y may be obtained by calculating the area between y = 1/n and the 
two intersecting lines that are drawn tangent to the curve, at x =n and x=n+1 
respectively. Summing over n, we obtain 


; x= 1 1 < 1 


—_ + — ——_—_____—____-. 
n ntl 7 n(2n + 1) 


1 


<y< nF 1) (1) 


n=1 


n=1 


Papers such as [1], [5], [9], [10] & [11] have considered various ways to accelerate 
the convergence of a series. Let f(n) be the generator of the terms of a convergent 
series and R, be the remainder after the nth partial sum S,. Assume f and |f’| 
both decrease to zero (f is concave up). These inequalities, presented for archival 
purposes, then summarize the papers cited above. 


[f<[f-oy<fo fern < fo feats —af er 


n+ 


0 n+1 
<R,<f  ftif(n+1—-3f"" f 
n+1/2 n+1/2 


< fo f+ flat) + aif n+ DO 


<ff<[F- Lf(n +1) —4f'(n + 1)| < [of sf +1) <|f 


(2) 
1 
1 — 
_ n 
n 
2 
“In+1 
1 
1 
jo n+] 
n+1 
. no 2n(n+1) an 44 n+1 
n+1. 2n+ 1 2 
Figure 1 Figure 2 
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The weakest bounds are the basic elementary integral error bounds; the strongest 
bounds are newly computed by the author and are based on a suggestion in [1, 
p. 89]. 

Depending on the series in question, various bounds in (2) may give substantial, 
or only marginal, improvement over the use of other bounds also given in the 
above. For pedagogical purposes the author prefers the following inequality 
because of its clear improvement to the basic integral error bounds given in 
virtually all calculus texts. If S = S, + R,, papers [9] and [10] imply that 


S,+f fthf(n+1) <S<S,+ f f-4f(nt 1). (3) 
n+1 n 


We adapt (3) to relationship (1) by letting S, S and S be the sums of three series 
for which S < S <S. Also let f(x), f(m), S, and S,, be the generators and partial 
sums for S and S respectively. Then 


S,+f f+4f(n+1) <S<5,+ f F-4f(n +1). (4) 
n+T ~ n 


Applying (4) to (1) and summing the telescoping series, we obtain 
1 1 a Lo an t3 1 
—+-— + In ——_ +. ——______ 
4° 2\ k(2k+1) | 2n+2. (2n+2)(2n +3) 
> 1 2n+ 1 1 5 
<y< + In ——— _ — ——____W__ 
YS Leake) +" tn Gut Danes © 


This table easily follows. 


1 0.553238 0.688798 (all values are rounded) 
2 0.555647 0.632667 

10 0.556824 0.614033 

49 0.556852 0.613709 

50 0.556853 0.613709 

77 0.556853 0.613707 

78 0.556853 0.613706 


After iteration 78 there is no further improvement; hence this method gives 
0.55685 < y < 0.61371, with the lower bound conventionally truncated and the 
upper bound always rounded up. 

Method 1 is less satisfactory than the almost 15,000 iterations necessary to 
generate four place rounded accuracy with Method 0. The following modification 
of the above technique, however, leads to bounds that converge to y itself. 


Convergence to y, Method 1’. The difficulty with Method 1 is that the intervals 
(that generate those series giving lower and upper bounds for y) are fixed at unit 
width, so that, on the interval [n, n + 1], the contributions to the bounds are poor 
when n is small. A way to remedy this is to replace the partial sums S, and S,, in 
(4) by y, = L%_,1/k — In(n + 1). From Ficure 2, we have S, < y, <S,, R, < 
Yr <R,, (where yp = y — y,), and, with (4) above, easily see that 


S,+ [ f+3f(nt+1)<y+t f f+ zf(n +1) <%m% + Bn <a + Ye 
n+1 n+17 
= <%mtR,<mt f F-af(nt 


n 


<5, +f F-2f(n +1) (6) 
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Consequently, the bounds given by that part of (6) which is 


mt fo fr iflns i <y<mt f F-ahatD) (7) 
will converge to y. By combining (1) with (7) we obtain 
+, DP ~in(n +1) 4 2Inwt 2 5 ——_+ 
4(n + 1) ~ 2 2n+2 (4n + 4)(2n + 3) 
+1 1 


3 1 1 2n 
<y< Y-- +1) + In —— —- ——___—_. 
YS Sk n(n + 1) + In — (2n + 2)(2n + 3) 


From this we compute 


n Y 
1 0.568425 0.662318 
2 0.573701 0.600722 
9 0.576969 0.578069 
10 0.577014 0.577887 0.577 (truncated) 
14 0.577111 0.577528 
15 0.577125 0.577483 0.577 (rounded) 
35 0.577199 0.577257 
36 0.577200 0.577254 0.5772 (truncated) 
107 0.577214 0.577220 
108 0.577214 0.577219 0.5772 (rounded) 
355 0.577215 0.577216 
356 0.577216 0.577216 0.577216 (rounded) 


In order to accelerate the convergence further, we could bisect the interval 
[m,n + 1] and repeat Method 1 for the subintervals [n,n + 1/2] and [n + 1/2, 
n + 1], from which 


1 1 12 48n? + 44n + 9 
a ln — ——|/+- 
8 Tt n+1 4 (2, n(2n + 1)(4n + 1)(4n + 3) 
8n + 3 
<¥< 


n(4n + 1)(4n + 3) 


which converge, however, to values strictly lower /higher than y. We then employ 
Method 1’, using the integral remainder bounds given by (4) and replacing the 
partial sums by y, to get 


1 1 


(2n + 3)(4n + 5)(4n + 7) 
8n+1) "4 


32(n + 1)° 
48n? + 140n + 101 
+ $$ 
8(n + 1)(2n + 3)(4n + 5)(4n + 7) 


1, (4n + 1)(4n +3) 
<y<y, + — ln ———————— 
rs YT 5 - 16n? 


8n + 11 
~ 2(n + 1)(4n + 5)(4n + 7) 


Convergence to a six-place (rounded) accurate value for y is by the 183rd iterate. 
Informally, this appears to imply that the method is O(/) in the step size (at least 
for this example). 
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Other Examples and Extensions. The techniques described in Methods 1 and I’ 
above can be used to evaluate the Euler constant for a convergent series and, 
therefore, to accelerate finding the value of the series itself. For example, consider 

” _,1/n’, whose value is 77*/6. The computations for the Euler constant, given by 
Method 1’, converge to 0.644934 (rounded) by the 79th iteration. Adding a value of 
one thus gives an approximate value to the series. By contrast, calculations done in 
[5] and based on a slightly weaker form of (4) generate the same value by the 124th 
iteration. The above method thus converges faster, though at the expense of more 
analytical work. 

Method 1 may also be used to accelerate the convergence of series that we do 
not normally associate with convergence tests for series of positive terms. For 
example, consider the alternating harmonic series ©” _ (—1)"*'/n = In2. We may 
rewrite this series in the form © _,[1/(Qn — 1) — 1/2n] = UF _j1/[2nQn — 1)]. 
[It is not by accident that we choose this example; its similarity to the series in (1) 
is clear. In fact, the bounds in (1) can be shown to be related to the alternating 
harmonic series and to have the exact values of 5/4 — In2 and 2 — 2In2, respec- 
tively.] When we apply (3) to the series in this form, we obtain 


n 1 1 -2n+2 1 


» 


+ = In ——— 5. —________ 
= 2k(2k—-1) 2. 2n+1. (4n+ 4)(2n+ 1) 


moc ¥ 1 1 Qn 1 
<In2< )) ———_ + res 
Ne S & 9k(2k—-1) 2. 2n—-1 (4n+4)(2n +1) 


Convergence to In2 = 0.693147 (rounded) is by the 87th iterate. [If, instead, we 
use the strongest error bounds from (2) for a similar computation, we obtain the 
above value as early as the 25th iterate, a substantial improvement. In addition, see 
[7] for another approach to the evaluation of In 2 by using various other series to 
estimate remainders.] 


Conclusions. It is to be hoped that, with the ease and availability of machine 
computing, a somewhat more sophisticated approach to the evaluation of series 
may take place in the current calculus classroom. Error bounds such as those given 
in (3) or, indeed, any mix or match of bounds given in (2) are within the grasp of 
the student. In addition, the sec/tan method (Method 1’) for accelerating the 
convergence of the Euler constant perhaps merits some consideration because of 
its easily visualized geometry. 


REFERENCES 


1. R. P. Boas, Jr., Estimating Remainders, Math Mag., 51 (1978) 83-89. 

R. P. Boas, Jr., Partial sums of infinite series and how they grow, this Montuty, 84 (1977) 

237-258. 

3. R. P. Boas, Jr., Growth of partial sums of divergent series, Math. of Comp., 31 (1977) 257-264. 

4. R. P. Boas, Jr., and J. W. Wrench, Jr., Partial sums of the harmonic series, this MONTHLY, 78 

(1971) 864-870. 

R. J. Collins, Approximating series, Coll. Math. J., 23 (1992) 153-157. 

H. Flanders, MicroCalc ver. 5.1, MathCalcEduc, Ann Arbor, MI, 1990. 

C. Goldsmith, Calculation of In2 and 7, Math. Gaz., 55 (1971) 434-436. 

G. H. Hardy, Orders of Infinity, 2nd ed., Cambridge Univ. Press, New York, 1924. 

R. K. Morley, The remainder in computing by series, this MONTHLY, 57 (1950) 550-551; repr. in 

Selected Papers on Calculus, 1968, 324-325. 

10. R. K. Morley, Further note on the remainder in computing by series, this MONTHLY, 58 (1951) 
410-412. 


ad 


CP RPNNM 


248 CALCULUS II AND EULER ALSO [March 


11. 
12. 


13. 


14. 


D. A. Smith, INTERFACE: Calculus and the Computer, 2nd ed., Saunders, New York, 1984. 

S. K. Stein and A. Barcellos, Calculus and Analytic Geometry, Sth ed., McGraw-Hill, New York, 
1992. 

G. B. Thomas and R. L. Finney, Calculus and Analytic Geometry, 7th ed., Addision-Wesley, 
Reading, MA, 1989. 

S. R. Tims and J. A. Tyrrell, Approximate evaluation of Euler’s Constant, Math. Gaz., 55 (1971) 
65-67. 


Penn State-Scranton 
120 Ridge View Drive 
Dunmore, PA 18512 


The Prisoner’s Paradox Revisited 


Awaiting the dawn sat three prisoners wary, 

A trio of brigands named Tom, Dick and Mary. 
Sunrise would signal the death knoll of two, 
Just one would survive, the question was who. 


Young Mary sat thinking and finally spoke. 

To the jailer she said, “You may think this a joke” 
But it seems that my odds of surviving ‘til tea, 

Are clearly enough just onc out of three. 

But one of my cohorts must certainly go, 

Without question, that’s something I alrcady know. 
Telling the name of one who is lost, 


Can’t possibly help me. What could it cost?” 


The shriveled old jailer himself was no dummy, 

He thought, ‘“‘But why not?” and pointed to Tommy. 
“Now it’s just Dick and I” Mary chortied with glce, 

“One in two are my chanccs, and not one in three!” 


Imagine the jailer’s chagrin, that old clf 
She’d tricked him, or had she? Decide for yourself. 


Richard E. Bedient 
Department of Mathematics 
and Computer Science 


Hamilton College 
Clinton, NY 13323 
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A Focusing Property of the Ellipse 


Marc Frantz 


INTRODUCTION. In this article some elementary mathematics will be used to 
establish and quantify a focusing property of ellipses and ellipsoids. This property 
is evidently unknown, even though it is quite simple and apparently significant. 
Possible areas of application include solar energy collection and laser technology; 
we also use our results to solve an open problem recently posed in this journal. It 
1s a well-known fact from geometry that a light ray which leaves a focus F, of an 
ellipse will be reflected to the other focus F,. This is illustrated by the curve 
F,R,F, in Ficure 1. The goal of this article is to describe the long-term behavior 
of such a light ray, and to describe the corresponding long-term behavior of an 
idealized spherical wave emitted from one focus of an ellipsoid of revolution. 


Ry 


Figure 1 


In a qualitative sense, the essential features of the “future history” of such a ray 
are depicted in FiGure 2, where the ray F,R,R,R,R,R, has been drawn until just 
before the fifth reflection at R,. Given the ellipse and foci, the ray is easy to 


Ry 


Figure 2 
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sketch by hand, since it must always travel from focus to focus. In the process of 
sketching such rays, one discovers that the same thing always happens to any ray, 
namely, its path eventually flattens out until it is essentially moving back and forth 
along the major axis of the ellipse. 

To make this assertion more precise, we define the departure angle from a focus, 
of a ray which has just left that focus, to be the smallest positive angle subtended 
by the departing ray and a “reference ray” from that focus to the opposite focus. 
For example, the successive departure angles of the ray in FiGurE 2 are 2 F,F,R,, 
LF,F,R,, 2F,F,R;, 2F,F,R,, and 2 F,F,R;, which is nearly equal to zr. Thus 
departure angles range from 0 to 7, and for each departure angle in (0, 77) there 
are two corresponding rays which can depart from a particular focus. This 
ambiguity will cause no problem for us, since what we wish to show is that the 
successive departure angles of almost any ray tend to 7, just as they did in 
FIGURE 2. 


Proposition 1. Let a light ray leave a focus of an ellipse with departure angle 
6, € (0,77), and let the successive departure angles of the ray be 0,,05,... . Then 
6,7 7. 


Proof: From Ficure 3 we see that if 6, € (0, 77), then 6, = 6,,, -a, <60,,, <7, 
and thus 6,,, < (0,7) and 6,7. Since the sequence {0,} is bounded, it must 
converge, and hence a, — 0. By the law of sines applied to triangle F,R, ,,F,, we 
then have 


Now the sequence of lengths s, is clearly bounded, so sin 6, — 0, and since 6, T, 
this implies 0, f 7. 


Figure 3 


Remark. An immediate application of Proposition 1 is the solution of an open 
problem posed by J. E. Connett in [1]. The negative form of the question asked 
there is: Can a perfectly reflecting container be designed which will admit a beam 
of parallel light rays from some particular direction, but will not allow any ray from 
that beam to escape, regardless of how many times it is reflected? Such a 
counterexample is indeed possible, and a two-dimensional version of such an 
object appears in FiGur_E 4. 
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B 


Figure 4. A light trap. 


The smooth curve in the figure is constructed using a “primary ellipse” 
with equation x”?/a* + y?/b? = 1, where a > b > 0, a parabola with equation 
(x +c)? = 4a(y + a), where c = Va’ — b’, and a “secondary ellipse” with equa- 
tion (x + c)*/(a — c)* + y*/a* = 1, which is used merely to make a smooth 
connection. The arcs V,A and BV,C belong to the primary ellipse, which has foci 
F, = (—c,0) and F, = (c,0). Arc CD belongs to the parabola, which has focus F, 
and vertex D, and arc DV, belongs to the secondary ellipse. The reader may 
check that the connecting points are: V, = (—a,0), D=(-c, -a), C= 
(c, —b*/a), and that the arcs have common tangents at these points. Endpoints A 
and B belong to the curve and are directly above the focal points. Thus any light 
ray entering vertically must strike the parabola and be reflected to F,. The ray 
then strikes the upper left quadrant of the primary ellipse, and is reflected through 
F, to the lower right quadrant of the same ellipse. By Proposition 1 and the 
geometry of the figure, we see that subsequent reflection points are either in the 
upper left quadrant converging downward to vertex V,, or in the lower right 
quadrant converging upward to vertex V,. Hence the light ray gradually ap- 
proaches a horizontal trajectory and never leaves the “container’’. A three-dimen- 
sional version of the container is easily made by using the curve to generate a 
cylinder. 

Let us now turn our attention to ellipsoids and examine the aspect of light 
intensity in terms of the convergence of {6,} described in Proposition 1. What we 
will see (Proposition 3) is that the intensity of a light wave emitted from a focus of 
an ellipsoid of revolution increases with each reflection at one particular point on 
the wave, called the point of concentration. This intensification, or focusing effect, 
is described by a simple, discrete-time exponential law which has apparently 
dramatic consequences. An illustration of this will be given at the end of the 
article. 

To begin with, given any initial departure angle 6, it is possible to derive an 
explicit formula for the nth subsequent departure angle 0,; this formula is the tool 
that will enable us to give a precise description of the focusing process and to 
derive the exponential law mentioned above. To this end, consider the diagram in 
FIGURE 5, where we have drawn an arbitrary ellipse with eccentricity « and major 
axis of length 2a, with d(F,, F,) = 2ae. A ray leaves F, with departure angle 6 
and travels a distance r before being reflected at R,. Using polar coordinates with 
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Figure 5 


origin F,, we can determine that 


a(1 — €*) 
~ 1—¢cosd’ 


(*) r 


It is well-known that the ray must travel a distance 2a — r to F,, where it arrives 
at an angle 7 — 6, and departs at an angle 0@,. 

Let us now derive a formula for the departure angle 6, in terms of the 
departure angle 6. Applying the law of cosines to the triangle F,R,F,, and using 
the fact that cos(a7 — 6,) = —cos 6,, we have 


r? = (2ae)” + (2a —r)* + 2(2ae)(2a — r)cos 4). 
The solution for cos 6, can be put in the form 
1—a(1+e*)/r 
cos 0, = —.—-—_——__. 
2aée/r—eé 


Substituting the expression in (*) for r gives 


' (1 + €7) 1 6 
(1 ey | € cos 6) (1 + e7)cos 6 — 2e 
(**)  cos6, = —S————— — = —_ 
2¢e(1 — € cos 6) (1 + e°) — 2€ cos 6 
—_—_-..0>nn— or Ts CE 
(1 — e) 


To get a form of this equation which will be more useful to us, we now multiply the 
numerator and denominator of the last expression in (* «) by 2/(1 — e)” to obtain 


2(1 + €7) cos @ — 4e 

(1 —«)° (1 —«)° 
(e**) 081 3 +e) de 

ey ~ ey — a 2 ey cos 8 
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Defining the constant pw by 

1+e)° 

—], (1) 
E 


and noticing that 2(1 + «7)/(1 — «)? = + 1 and 4¢/(1 — «)* = pw — 1, we see 
that (* * *) can be written 


(u + 1)cos 6 — (uw — 1). 
(u +1) — (uw — 1)cos 6 


Finally, defining the function f,,: [0,7] — [0, a] by f,(@) = 6,, we can compute 0, 
by 


cos 6, = 


(wu + 1)cos 6 — (pw — 1) 


0, =f,(9) = arccos (n+ 1) —(u = leos 8 F1)— (an — Leos 8 


(2) 
since 6, € [0, zr]. 

If we use the standard notation f/(0) := f,(0), f7(0) = f,(f,(@)),..., then the 
successive departure angles of a ray are 0, f/(0), f7(0),... . If we also adopt the 
conventions f)(0) = 6 and f7"(0) = (f,")"°(@), we have 


Proposition 2. Let wu > 1, and let f.,: [0,7] — [0, a] be defined as in (2). Then for 
each 6 & [0,7] and eachn € Z, 


f7(@) = arccos 


(3) 


(u” + 1)cos 0 — (u” — 1) 
(uw? +1) —- (p" - —). 


Proof: Since (3) obviously holds when n = 0 or n = 1, suppose that (3) is true 
when n =k for some k > 0. By straightforward algebra it follows that for any 
6 € [0, 7], 


ukt+! + 1)cos @ — (u**! — 1) | 


fE*'(0) =FE(F,09)) = arecos{ eer he ae 


and hence (3) is true for any n > 0. It is equally straightforward to check that for 
any n > 0, defining f” according to (3) gives 


fa”(Fi(9)) = Fi(f."(8)) = 6 


for each 0 € [0, 7]. Thus each fy; is invertible, and (3) is proved. 


Remarks. It is not difficult to use (3) to prove Proposition 1. For example, if 
6 & (0, 7), then by elementary calculus, lim, _,.f7"(0) = 7. It is also clear from (3) 
that each f’ is a homeomorphism of [0, +] with fixed points 0 and 77; we shall use 
this fact later. Both of these results are suggested by the graphs in FiGure 6. 

Before giving a precise statement of the promised exponential law, it will help 
to look at some pictures and see how the focusing process evolves. In FIGuRE 7 are 
21 cells from a movie, and these should be viewed from left to right, starting at the 
top. In the first cell (marked A), 16 particles simultaneously depart from the 
left-hand focus F,, all at the same speed. The particles have a uniform angular 
spacing, and thus the initial departure angles are 0, 77/8, 277/8,...,7, where the 
angles 77 /8,..., 77/8 each correspond to a pair of particles which belong to 
opposite half-planes determined by the major axis. 
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Figure 6. Graphs of f{,(6) for n = 1, 2,3. 
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Figure 7 


Initially, the particles ‘“‘ride’”’ on a circular wave, but as the wave begins to be 
reflected, it consists of two circular arcs: one centered at, and diverging from, F,, 
and the other centered at, and converging toward, the other focus F,. This follows 
from the fact that the total path length from focus to focus is the same for each 
particle, and thus the particles must eventually converge on F, in a circular 
pattern. The particles on the diverging arc become more dispersed, while those on 
the converging arc become more concentrated. 
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By the end of the third row of cells, the wave has become a small circle 
converging on F,, and in cell B the wave has crossed itself (we allow the 
“particles” to pass through each other) and is now diverging from F,. The 
departure angles, now being first iterates of f,, are mostly clustered about 7 
(recall that at F,, the departure angle 7 is to the right). As the wave diverges from 
F,, the particles at first begin to disperse, but very soon most of them are reflected 
and begin to converge back toward F,. 

In cell C, the particles have just passed through F,, and their departure angles 
are computed by f7(0). Except for the particle which had initial departure angle 
§ = 0, the particles are now contained within an arc of about 70°. Let us observe 
what this process implies for a spherical wave: 

A spherical light wave of uniform intensity, emitted from one focus of a perfectly 
reflecting, prolate ellipsoid of revolution, will eventually be reduced to a single, highly 
concentrated pulse oscillating back and forth along the major axis. 

Of course, when the wave is emitted, there should be no emitting device left at 
the focus, or else it would interfere with the focusing process. Thus one might 
imagine two thin wires with a tiny gap at F,, where a spark starts the process. 
Perhaps better still, one might 1 imagine the wave to be emitted from a matter-anti- 
matter collision at F;. 

Having visualized ‘this process, we are now ready for the precise statement of 
the exponential law. Let us assume that we have a prolate ellipsoid of revolution 
with uw > 1, and that a sphere of unit radius centered at either focus can be 
contained inside the ellipsoid. For the purpose of analysis we will consider only 
spherical waves of unit radius departing alternately from F, and F,, as roughly 
depicted in cells A, B, and C of Figure 7. On any such wave, let us refer to the 
point farthest from the opposite focus (departure angle 77) as the point of 
concentration. By the intensity at a point of the wave we mean the power (energy 
per unit time) per unit area transported by the wave at that point. What we will 
show is that, if the initial wave has uniform intensity J), then the intensity J, at the 
point of concentration after the nth reflection is given by 


L, = [yu”. (4) 


Thus pw is an excellent measure of the focusing efficiency of an ellipsoid. More- 
over, from (1) we see that wt as ef1, so elongated ellipsoids focus more 
efficiently. 


Proposition 3. Let uw > 1 and let I, and I, be as defined above. Then for each 
n >= 0, I, is given by (4). 


Proof: Let us think of all the departing spherical waves of unit radius described 
above as being superimposed on one another, with their points of concentration at 
the left. Then a small spherical cap (a special case of a zone, shaded in profile in 
FicureE 8)-centered at this point, with its boundary located at 6, has area 
Ay = 27(1 + cos 6). This follows from a standard theorem of solid geometry 
which says that a zone with altitude 4 on a sphere of radius r has area 27rrh; in 
- this case, r= 1 and h = 1+ cos 8. 

Since for any n > 1, the function f” is a homeomorphism of [0, zr], we may also 
consider it to be a homeomorphism of the sphere which increases the 0-coordi- 
nates of points, where 0 € (0,77) is a polar angle measured from the extreme 
right-hand point of the sphere. Thus after n reflections, a larger spherical cap 
centered at 7 will be mapped by f’ onto the smaller one. The boundary of this 
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Figure 8 


larger cap is located at f"(@), and hence by the zone formula and (3) its area is 

(u~" + 1)cos@ — (w” — 1) 

(u7-" +1) — (uw ™ — 1)cos 0} 
Now the original intensity /) was constant, so the energy flux through the larger 

cap was initially J) A_,. After this cap is mapped by f/’ onto the smaller one, the 

total energy flux is the same (the same rays now pass through the smaller cap), but 


the area has decreased to Ay. Hence the average intensity over the cap is now 
I, A_,/Apo. The intensity J, at the point of concentration is then given by 


A_, = 2[1 + cos( f;"(0))| = 27/1 + 


lLA_, 
IT, = lim 
06-7 Ay 
1. i (uw "+ 1) —-(n-” — 1)cos 6 + (uw ” + 1)cos 6 — (w~” — 1) 
0 gm [((u-” +1) — (w-” — 1)cos 8] (1 + cos 6) 
2+ 2cos 6 
= [, lim 


eon (uw "+ 1) + 2cos 0 — (uw -” — 1)cos* 6 
L’Hospital’s rule is clearly applicable, and thus 


1 =r Jj —2sin 6 
= Nn —____OCOOCO_OO_ 
n 10 gy —28in 6 + 2(u—” — 1)cos 6 sin 6 
—2 
= [, im —— >. _ = pn". 


@>n7 —2+ 2(u~" — 1)cos 0 ~ “0 


To summarize our results, we may consider both the traditional focusing process 
and the one considered here as means of increasing the intensity of a wave emitted 
from a focus of an ellipsoid of revolution. The former process makes the intensity 
arbitrarily large over the entire wave, in a periodic fashion, by periodically making 
the wave arbitrarily small (compressing it to a focus). The latter process makes the 
intensity arbitrarily large at one particular point on the wave, in an exponential 
fashion, by reflecting the wave arbitrarily many times. Of course, this process is 
exponential in the discrete-time sense, since we have fixed our attention on a 
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particular phase of the wavefront cycle (i.e. departing from either focus with unit 
radius), which occurs every 2a/v seconds, assuming that the major axis has length 
2a and light travels at v units per second. In between reflections and focal-point 
compressions, the intensity increases or decreases according to an inverse-square 
law, depending on whether that part of the wave is converging to or diverging from 
a focus. However, we may consider the wavefront at any time t, except when the 
wave is compressed to a point, identify the point of concentration, and show that at 
time t + 2a/v the intensity at the corresponding point has increased by a factor of 
ww. Thus it seems appropriate to call this process ‘“‘Phase-Exponential Reflection,” 
and to call our make-believe antimatter-powered device a ‘‘Phase-Exponential 
Reflector’ (PHASER). Under the idealized conditions we have described, 
PHASER intensification of light is rather dramatic, even for moderately elongated 
ellipsoids. To illustrate this, we will look at one final example. 

In FiGuRE 9 we have sketched an ellipsoid of revolution with eccentricity 
é = 24/25. There also appears in that figure a wave of radius 1 departing from F;,. 
For this ellipsoid, u = 2401. Hence assuming that the wave initially departs from 
F, with uniform intensity J), we can use (4) to determine that when the wave next 
departs from F, (after two reflections), the intensity at the point of concentration 
will be increased by a factor of almost six million. 


I, = 5,764,801 I — 


Figure 9 


The author is indebted to Professor Y. Ionin for an improvement of the original version of the proof 
of Proposition 1; to Professor Richard Patterson for a helpful critique; and to the referees for several 
suggestions and improvements. 
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NOTES 


Edited by: John Duncan 


Reflections Can Be Trapped 


Roberto Peirone 


In [1] it is asked whether, roughly speaking, a beam of light rays can be trapped in 
an opportune container, acting as a reflecting mirror. In this paper I prove that 
this is so. More precisely, I describe a Jordan curve of class C! (in fact, we can 
make this curve to be of class C%), y:[0,1] > R’, such that yQ5,1D =A is a 
horizontal open segment line, and, by supposing that y([0, $]) works as a reflecting 
mirror, every vertical upward light ray passing through A enters the bounded 
region determined by y((0,1]) and will not repass through A. Analogously, and 
this example is more closely related to the problem treated in [1], we can construct 
a compact connected surface s of class C! in R?, such that there exists an open 
rectangle A contained in s, and, by supposing that ¢\.A works as a reflecting 
mirror, every light ray passing through A, orthogonal to A and entering the 
bounded region determined by s, will not repass through A. The precise statement 
of the problem discussed in [1] is whether in this type of situation, in fact in a 
slightly more general situation, it is possible to prove that some of the light rays of 
the beam will be reflected back out through A. As pointed out by the referee, it 
may be shown that if we allow just a minor deformation from a parallel ray, then it 
is impossible to trap the beam. The idea of the construction is to join opportunely 
an ellipse to a parabola whose focus is one of the foci of the ellipse (see Figure). 


The curve in the Ficure represents y((0, $]). The arc between P, and P, is an 
arc of the ellipse whose foci are F, and F,. The arc between V and P, is an arc of 
the parabola whose vertex is V and whose focus is F,. Here, the straight-line 
passing through F, and F, is horizontal and that passing through V and F, is 
vertical. If y([0, $]) works as a reflecting mirror, then every vertical upward light 
ray as in the FiGureE will remain in the interior of the curve. For, after the first 
reflection, by a well-known property of the parabola, it will pass through F,, after 
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the second reflection, by a well-known property of the ellipse, it will pass through 
F,, then it will repass through F,, then through F, and so on. 

In order to construct ¢ as required it is sufficient, for example, to consider a 
surface opportunely built on a cylinder having y for base. 
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Euler’s Theorem 


Katherine Heinrich and Peter Horak 


Fermat’s Little Theorem. Jf p is a prime and p + a, then a?~' = 1 (mod p). 


Stated in a letter by Fermat in 1640, the first public proof of this result was 
given by Euler in 1736 (although it appears that Leibniz also had a proof which he 
did not publish). The details of the situation are described in [Dic]. Some time 
later (in 1760) Euler generalized Fermat’s result. 


Euler’s Theorem. Jf a and m are relatively prime, then a?“ = 1 (mod m), where 
d(m) = |{n: 1 <n < mand nand m are relatively prime}|. 


Since then a variety of proofs have been presented. The most frequently cited 
depending on the use of either the binominal theorem, group theory, or number 
theoretic arguments [see Dic]. We present a proof which is combinatorial in spirit. 
(When this proof is restricted to the case when m is prime, Thue’s proof [Thu] of 
Fermat’s Little Theorem is obtained.) 


Proof of Euler’s Theorem. We first prove the theorem in the case when m is a 


prime power; m = p“, p prime. Let S$” = {(a,,a,,...,a,,):1 <a, < a} be a set of 
a™ orderéd m-tuples. Considering the mappifig o acting on S™” as follows. For 
A = (@,,a,,...,a,,), o(A) = (4,,, a,,45,...,4,,_1). That is, o cycles the entries 


of A one place to the right. Now we are going to count the number of orbits of S$”, 
under the action of 0, of the maximum size m. This will be done by counting the 
total number of elements which are members of smaller orbits. Suppose A € S” 
belongs to an orbit consisting of t < m elements. Then o‘'(A) =A and t|m (or 
t = p®, B < a — 1). We can therefore express A as 


A = (G1, 7,...,@,, 41, A7,...,4,,+.+,4,,45,...,4;), 
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and hence as 
A = (41, 45,...,4,,4,,47,...,4,,...,@,,Q5,...,4,), 
where r = p*~!. On the other hand, whenever A has the form 
A = (@,,4,...,4,,0,,47,...,4,,...,4;,5,...,4,), o’(A) =A. 

Therefore, there are (a” —a’)/m orbits of size m. Thus mila” — a" and as 
(m,a) = 1, mla”™~" — 1. But a®™~" —1 =a? —1=a*™ — J and therefore 
a? = 1 (mod m). 

Now let m = pf'p3? +--+ pf, p,; prime, 1<i<t. Since d(m) = (pf! — 
pe \( p32 — p32!) +++ (pe — pet) and pela? — 1, then mla®” — 1 or 
a? = 1 (mod m). 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


To some people he is known as his brother’s brother. 
(see page 265.) 
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Fix a (d,n)-labeled, graph G and vertices u and v. Consider a logspace Turing 
machine that uses the contents of the random tape to perform a walk of length 
4n*d on G starting at u. If the Turing machine ever reaches v it returns 0, 
otherwise it returns 1. When the contents of the random tape is truly random the 
Turing machine returns 1 with probability at most 1/4n7. If the random tape is 
actually the result of some pseudorandom generator the probability should still be 
close to 1/4n7, else this logspace Turing machine could distinguish the pseudoran- 
dom inputs from the random ones. Thus the Turing machine with pseudorandom 
input will return 1 with probability at most, say, 1/2n’. Since there are at most n” 
choices for u and v, the probability that the pseudorandom generator produces a 
sequence that does not cover G is at most 1/2. This means that every (d, n)-labeled 
graph is covered by at least half of the possible outputs of the generator, so the 
concatenation of all of the outputs must be (d,7)-universal. Since there are 
2tles*) nossible inputs,-and each output has polynomial length, the concatenation 
of all outputs has length nO“°s”, 


Further Reading. Wigderson [9] provides an extensive overview of the s-t connec- 
tivity problem, universal traversal sequences, L vs. NL, and many other related 
combinatorial and algorithmic problems. 
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Harald Bohr, the brother of physicist Niels Bohr. 
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Universal Traversal Sequences 


Joan Feigenbaum and Nick Reingold 


In this article we discuss a purely combinatorial problem, the construction of short 
universal traversal sequences, and its relationship to questions about logspace 
computation. We state the problem formally, show how it arises naturally in 
complexity theory, and review some of the known partial results. A basic introduc- 
tion to complexity theory can be found in [6]. 

The P vs. NP problem is recognized by the mathematical world as the central 
Open question in the theory of computation. Less widely known outside of 
computer science is the fact that the analogous question for space-bounded 
computation was resolved long ago!: Savitch [8] shows that any language accepted 
by a nondeterministic Turing machine that uses space O(s(n)) is also accepted by 
a deterministic Turing machine that uses space O((s(n))*). Hence deterministic 
polynomial space is equivalent (in terms of language-recognition power) to nonde- 
terministic polynomial space: PSPACE = NPSPACE. 

However, one question about the relationship of nondeterministic space- 
bounded computation and its deterministic counterpart remains open: Is the 
quadratic “blow-up” in space complexity exhibited by Savitch’s construction neces- 
sary? This question turns out to be most interesting for computations that use very 
little space. Let L be the class of languages accepted by deterministic Turing 
machines using only O(log n) space, and NL be the class of languages accepted by 
nondeterministic Turing machines using only O(log n) space. It could be the case 
that Savitch’s theorem is optimal at this low end of the space-complexity spectrum. 
On the other hand, it could be that any language recognizable by a nondeterminis- 
tic Turing machine is also recognizable by a deterministic Turing machine with the 
same space complexity; if that’s true, then NL is exactly equal to L. The truth 
might also lie somewhere between these two extremes. 


Universal Traversal Sequences. We now consider a purely combinatorial problem. 
Consider d-regular, undirected graphs G = (V, E). Such a graph is called (d, n)- 
labeled if ‘it has n vertices and the edges incident to each vertex are labeled by a 
permutation of {1,2,...,d}. Note that an edge has two labels, one associated with 
each of its endpoints, and that these labels may differ. A sequence of labels 
0 = 0,0,...6,, and a start vertex s define a walk sys,...5,, aS follows. Let sy = s. 
For 1 <i<™m, if o, =j, then s, is the (unique) vertex such that there is an edge 
e = {s;_,, s,} and the label of e that is associated with s,_, is j. For example, in the 


F i 


Iby computer science standards. 
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(2, 3)-labeled graph above, the sequence 12211 and start vertex A define the walk 
ABCBAB. 

A sequence go is called (d,n)-universal if, for every connected (d, n)-labeled 
graph (V, E) and every start vertex s € V, the walk defined by o and s contains 
every vertex in V. For example, the sequence 112 is (2, 3)-universal. 

For expository purposes, we also define a directed version of universal traversal 
sequences. A d-regular directed graph G = (V, A) is called (d, n)-labeled if it has 
n vertices and the arcs out of each vertex are labeled with a permutation of 
{1,2,..., d}. Note that, in the directed case, arcs have only one label. A sequence 
o and start vertex s define a walk in essentially the same way as they do in the 
standard definition. We say that o in (d, n)-directed-universal if, for every strongly 
connected (d, n)-labeled digraph (V, A) and every start vertex s, the walk defined 
by o and s contains every vertex in V. 

The connection between universal traversal sequences and L vs. NL is made via 
the s-t connectivity problem. An instance of this problem consists of a directed 
graph G and two vertices s and t in V(G). The question is whether there is a path 
in G from s to t. There is a straightforward nondeterministic logspace algorithm 
for this problem: Guess a path sos,5....5,_—, Such that s = sy) and then check that 
each arc s,_, > 5s, is present in G and that some s, = ¢t. This algorithm is clearly 
correct, and it only requires space O(log n), where n = |V(G)|. To see why so little 
space is required, note that the algorithm does not need to store the entire path at 
any time. Rather, it need only store the names of two consecutive vertices s;_, and 
s,; once it has verified the presence of the arc s;_, — 5;, it can write over s;_, with 
its guess for s,, 4. 

The language STCONN of yes-instances of the s-t connectivity problem is in 
fact NL-complete. Furthermore, STCONN remains NL-complete if we assume 
that the input digraphs are regular. So, if we could exhibit a deterministic logspace 
algorithm for (regular) s-t connectivity, we would have shown that L is equal to 
NL. It would suffice to exhibit, for each constant d, a Turing machine that uses 
O(og n) space and generates a sequence 7, that is (d, n)-directed-universal. Note 
that the restriction to O(log n) space implies that the length of 7,, is polynomial in 
n. Unfortunately, it can be shown that no family {7,},., of polynomial-length 
directed-universal sequences exists. (L might still be equal to NL, but the equality 
cannot be proven this way.) This raises the question of the existence of a family 
{a,,},,> that is deterministically logspace-generable such that a,, is (d, n)-universal 
(which is weaker than (d, n)-directed universal). Aleliunas et al. [1] give a beautiful 
probabilistic argument that, for any d, there is a polynomial-length family {a,}, . , 
such that oa, is (d,n)-universal. Whether such a family can be generated in 
deterministic logspace remains open. 
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Constructing Universal Traversal Sequences. Consider a random walk in a 
connected undirected graph. At each time step the next vertex is chosen uniformly 
from the neighbors of the current vertex. We say the walk covers G if every vertex 
is visited at least once during the walk. For any vertex v, let C,, be the expected 
time at which a random walk starting at v covers G. The maximum over all v of C, 
is called the cover time of the graph. Aleliunas et al. [1] show that the cover time 
for any d-regular graph with n vertices is at most n*d. They then use this 
observation to prove the existence of (d, n)-universal traversal sequences of poly- 
nomial length as follows. 

Let o be a sequence of labels of length 4n°d? log, nd chosen uniformly from 
the set of all such sequences. We will show that the probability that oa is 
(d, n)-universal is not zero. The probability that o is not (d,n)-universal is the 
same as the probability that there exists a (d,n)-labeled graph G and a vertex v 
such that a random walk of length 4n°d? log, nd starting at v does not cover G. 
This is the same as the probability that there exists a (d, n)-labeled graph G anda 
vertex v such that G is not covered by 2nd log, nd consecutive random walks, 
each of length 2n2d, the first of which is started at v. Since any graph G has cover 
time at most n2d, Markov’s inequality shows that a random walk of length 2n7d, 
starting from any vertex, has probability at most 1/2 of not covering G. If we take 
2nd log, nd such random walks consecutively, the probability that none of them 
covers G is at most (1/2)?"4'82”"4 = (nd)~*"¢. Thus, for any fixed G and v, the 
walk through G starting at v given by o has probability at most (nd)~2" of not 
covering G. There are at most (nd)"“ choices for v and G, so summing over all 
these choices shows that the probability that o is not universal is strictly less than 
one. 

This proves the existence of polynomial-length universal traversal sequences 
and suggests the possibility that the language USTCONN (the yes-instances of the 
s-t connectivity problem for undirected graphs) is in L. However, the above proof 
does not show this, since it does not show how to generate the traversal sequences 
using only O(log n) space. Whether this is possible is still an open problem. There 
are two interesting partial results that are worth mentioning. For d = 2, Istrail [4] 
gives a construction of polynomial-length traversal sequences, but his sequences 
cannot be constructed in deterministic logspace. For d = n — 1, Karloff et al. [5] 
give an explicit construction of traversal sequences of length n°” 


Traversal Sequences and Pseudorandom Generators. The best explicit universal 
traversal sequences constructed so far are due to Nisan [7]. This construction 
exploits a connection, due to Babai et al. [2], between traversal sequences and 
pseudorandom generators. In this context, a pseudorandom generator is an algo- 
rithm that converts a small number of truly random bits into a long sequence of 
bits that appears random to any Turing machine that uses only a limited amount of 
‘Space. We will not give a precise definition of “appears random to any Turing 
machine that uses only a limited amount of space.” The interested reader should 
see [2] for details. Nisan’s generators convert a truly random string of length 
OCS log R) into a string of length R that appears random to any Turing machine 
that uses space at most S. In particular, if S = O(log n) and R is polynomial in n, 
then Nisan’s generators can convert O(log? n) truly random bits into polynomially 
many bits that appear random to any Turing machine that uses only O(log n) 
space. We will now show that the concatenation of all the possible outputs of 


Nisan’s pseudorandom generator is a (d, n)-universal traversal sequence of length 
O(log n) 
n 
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Fix a (d, n)-labeled, graph G and vertices u and v. Consider a logspace Turing 
machine that uses the contents of the random tape to perform a walk of length 
4nd on G starting at u. If the Turing machine ever reaches v it returns 0, 
otherwise it returns 1. When the contents of the random tape is truly random the 
Turing machine returns 1 with probability at most 1/4n*. If the random tape is 
actually the result of some pseudorandom generator the probability should still be 
close to 1/4n’, else this logspace Turing machine could distinguish the pseudoran- 
dom inputs from the random ones. Thus the Turing machine with pseudorandom 
input will return 1 with probability at most, say, 1/2n7. Since there are at most n? 
choices for u and v, the probability that the pseudorandom generator produces a 
sequence that does not cover G is at most 1/2. This means that every (d, n)-labeled 
graph is covered by at least half of the possible outputs of the generator, so the 
concatenation of all of the outputs must be (d,7)-universal. Since there are 
2 Olog* n) possible inputs,.and each output has polynomial length, the concatenation 
of all outputs has length n°”, 


Further Reading. Wigderson [9] provides an extensive overview of the s-t connec- 
tivity problem, universal traversal sequences, L vs. NL, and many other related 
combinatorial and algorithmic problems. 
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THE EVOLUTION OF... 
Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


What Are Algebraic Integers and What 
Are They For? 


John Stillwell 


My title is stolen from Dedekind’s Was sind und was sollen die Zahlen? (1888), the 
classic book on the meaning of the natural numbers. His book is actually about set 
theory rather than number theory or algebra but, as we shall see, Dedekind had a 
lot more up his sleeve. His reflections on the algebraic properties of the natural 
numbers would have filled a much larger book, which unfortunately he never had 
time to write. The following is a sketch of the book that might have been. 

The natural numbers 0,1,2,3,... are the setting for the oldest and deepest 
mathematical problems. As early as 300 B.c., Euclid recognised that divisibility and 
primes are important concepts for natural numbers, and that even simple proposi- 
tions about them involve subtle methods of proof. For example, to prove the 
seemingly obvious proposition that if a prime p divides the product of natural 
numbers a,b then p divides a or p divides b, Euclid had to introduce the concept 
of greatest common divisor (gcd), and use the euclidean algorithm to express the 
gcd in a suitable form. It follows almost immediately from this proposition about 
prime divisors that each natural number has a unique prime factorisation. That is, if 
n is a natural number, then n can be expressed as a product of primes in exactly 
one way, up to the order of factors. Unique prime factorisation was isolated as the 
“fundamental theorem of arithmetic” in the classic Disquisitiones Arithmeticae of 
Gauss (1801). 

Gauss organised much of earlier number theory into a cohesive structure by 
introducing unifying ideas, such as congruence, and proving unifying theorems, 
such as quadratic reciprocity. But Gauss also burst the bounds of the structure he 
had created with an array of new and baffling theorems. For example, he proved 
several theorems about the nth roots of 1, bearing on the geometric construction 
of regular n-gons. He showed in passing that these irrational numbers had 
application to the natural numbers; they were even connected with quadratic 
reciprocity (Disquisitiones, section 356). How come? Gauss’s successors spent most 
of the 19th century searching for an answer to this question, for concepts to 
explain the effectiveness of irrational numbers in arithmetic. The most important 
of these was the concept of algebraic integer. 


What algebraic integers are for. Actually, Gauss was not the first to use irrational 
numbers to answer questions about the natural numbers. Euler’s Algebra (1770) 
contains several examples, the simplest being a proof of the following claim of 
Fermat: among natural numbers, 27 is the only cube that exceeds a square by 2. This 
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claim is one the of the famous marginal notes in Fermat’s copy of Bachet’s 
Diophantus, made around 1637. Fermat claimed to have a rigorous proof, but 
never revealed it. Euler’s proof is not exactly rigorous, but it is distinguished by the 
wild idea of using the number ¥— 2. 

The problem amounts to showing that x = 5, y = 3 is the only solution of 


3 2 
yo =x°4+ 2. 
Euler factorises the right hand side into (x + y— 2 (x — V— 2), then confidently 


assumes that x + V— 2,x — v¥-—2 must both be cubes, since their product is a 
cube and he can show that their gcd is 1. This implies, presumably, that 


x+yV-2= (a + bY—2) 
for some integers a,b, and hence 
x+V¥—2 =a? — 6ab? + (3ab? — 2b7)V- 2. 
Equating imaginary parts we get 
1 = 3a*b — 2b° = b(3a? — 2b") 
which implies b = +1 and 3a” — 2b* = +1 since +1 and —1 are the only integer 


divisors of 1. It follows that @ = +1, which gives x = 5 as the only positive integer 
value of a*> — 6ab*. Q.E.D! 


Well OK, we don’t know what these things a + by — 2 and their divisors really 
are, or how they work, but they seem amazingly good for answering questions 
about the natural numbers. The same goes for other irrational objects arising from 
the factorisation of polynomials. Euler gave many examples, the most spectacular 
being a proof of Fermat’s last theorem for cubes: 


x°? + y? =z? has no solution in positive integers 


(Fermat himself had settled the case of fourth powers by his method of infinite 
descent). He proved this using the factorisation 


-1+7-3 -1-y-3 
1, 


and arguing , as above, that each factor in the right hand side must be a cube. This 
time the irrational objects involved are numbers of the form a + bw, where 


-1+ 7-3 
2 


is a complex cube root of 1. Gauss’s work on the nth roots of 1 could be seen as a 
generalisation of this, involving objects of the form a) + a,f+ --: +a,_5¢ no 
where ¢ is a complex nth root of 1 and do, a,,..., a,» are ordinary integers. 

Thus by 1800 it was clear that irrational objects were good for number theory. 
What was not clear was the nature of these objects and the reasons they behaved 
like integers, if indeed they did. A lot more work had to be done before the right 
concept of “algebraic integer’ was isolated. 


xe=z2-yr=(z-y)[z- y 


Q = 


Algebraic integers. Since we want to do arithmetic on algebraic integers, they must 
satisfy the requirement: 


1. The algebraic integers are closed under +, — and X. 


Also, since the aim in extending the concept of integer is to answer questions 
about ordinary integers, the relations between ordinary integers had better not 
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change in the process of extension. In particular, an ordinary integer a should not 
become a divisor of an ordinary integer b if it is not one already. That is, the 
rational number b/a should not be an algebraic integer unless it is an ordinary 
integer. This gives us a second requirement: 


2. The rational algebraic integers are the ordinary integers. 


The numbers a + by — 2 satisfy these requirements, as do the numbers a, + 
ao+-++: +a, _5£"~*. The only problematic operation is multiplication of num- 
bers ad) + a,f +++: +a,_,¢"~%, as this creates powers £”~', £”,... . The trick is 
to rewrite £"~',£",... as combinations of 1,¢,...,¢"~* with ordinary integer 
coefficients, using the equation 


1+f¢4+---4+¢" 74 f" 1=0 


satisfied by £. Then a product of numbers a, + a,£ + -:-: +a, 5¢"~7 is seen to 
be another number of the same form. 

Now the numbers /— 2 and ¢ arise in the first place as the solutions of 
polynomial equations with ordinary integer coefficients, namely 


x74+2=0, x77 !4+---4x41=0. 
Is the same true for the combinations a + by— 2 anda, +a, + °°: +a, _,£"~7? 
The answer is yes, and even more is true: the combinations satisfy monic polyno- 


mial equations with ordinary integer coefficients. This follows from a general 
property of monic polynomial equations pointed out by Eisenstein (1850). 


If f(x) = 0 is a monic polynomial equation with ordinary integer coefficients and 
roots Q,,Q,...,@,, and if g(a,,...,a@,) is any polynomial in the roots with 
ordinary integer coefficients, then g(a,,...,a@,,) is also the root of a monic polyno- 
mial equation h(x) = 0 with ordinary integer coefficients. 


His proof applies Newton’s theorem on symmetric polynomials to the product 
h(x) of all terms x — g(a,q),...,@ cy), Where o is a permutation of 1,...,n, to 
conclude that the coefficients of h(x) are ordinary integers. Obviously h(x) is 
monic, and its roots include g(a,,...,a,). Since a, + a,,a, — a, and a,a, are 
particular cases of g(a,,...,a,), closure under +,— and xX will hold when 
algebraic integers are defined as follows. 


Definition. An algebraic integer is the root of a monic polynomial equation with 
ordinary integer coefficients. 
Just as easily, this definition meets the second requirement: 


A rational algebraic integer is an ordinary integer. 
Because if a/b is a rational number (in lowest terms) that satisfies the equation 
x" +¢,x" | +--+ 4+¢,_,x +c, =0 
with ordinary integer coefficients, then substitution and rearrangement give 
a" /b = —c,a""' — +++ —c,_,ab"~? —c,b"™|," 


which is possible only if b = 1, since the right hand side is an ordinary integer. 
Eisenstein did not actually state the definition or the closure under +,— and 

x, but he probably didn’t need to. The class of solutions of monic polynomial 

equations was under consideration by other mathematicians at the time, and its 
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basic properties were implicit in known results about polynomials. In particular, 
the nature of rational algebraic integers was well known as a result about 
polynomials 50 years earlier (Gauss assumed it without proof in article 11 of the 
Disquisitiones). More important was Eisenstein’s recognition that the algebraic 
integers shared properties with the ordinary integers. This pointed the way to a 
systematic explanation of the phenomena discovered by Euler and Gauss. 


Divisors and primes. Closure of the algebraic integers under +,— and X means 
that they inherit the ring properties of the complex numbers. Hence they are a 
ring, like the ordinary integers, and satisfy the same basic propositions such as the 
commutative, associative and distributive laws. Unfortunately, many useful propo- 
sitions are not logical consequences of these laws and in fact are false in the larger 
ring. For example, there are no primes in the ring of all algebraic integers, because 
every algebraic integer a has the factorisation a = Va Va , and va is also an 
algebraic integer. This difficulty can be avoided by working in smaller rings 
tailored to particular problems, such as the ring of integers a + b/—2 where 
Euler looked for solutions of y* = x? + 2. “Sufficiently small” rings of algebraic 
integers do have primes, and also prime factorisation, but the big question is 
whether the factorisation is unique. 
The first ring of algebraic integers to be examined in this light was 


Z[i] = {a+ bi:a,b €Z}, 


the ring of Gaussian integers. Gauss (1832) showed that Z[i] has a divisibility 
theory like that of Z, including unique prime factorisation, thanks to an analogue 
of the following division property of Z. If a,b € Z and b #0 then there are 
q,r = Z (quotient” and “remainder”) with a = qb+r and 0 < |r| < |b]. The 
analogous division property of Z[i] is that for any a, B € Z[i] with B # 0 there are 
wu, p & Zi] with a = wB + p where 0 < |p| < |B] and the absolute value | | is now 
distance in the complex plane. 

The division property of Z[i] can be seen by viewing the Gaussian integer 
multiples of B as the corners of a lattice of squares in the plane. A typical square is 
the one with corners at 0,8,i8,1 + iB. The “remainder” p is simply the 
difference between a and the nearest corner in the lattice, and |p| < |8| because 
the distance from any point in a square to the nearest corner is less than the length 
of a side. With the division property established, one has a euclidean algorithm for 
gcd, and the rest of the route to unique prime factorisation is the same as in Z. 

A similar geometric argument establishes a division property for Z[V— 2]. 
Hence Z[V— 2] also has a unique prime factorisation, and Euler’s treatment of 
y? =x? +4 2 is valid. Euler’s proof that x* + y* # z° can likewise be justified by 
finding a euclidean algorithm for Z[w]. 

Alas, it is not always as simple as this. Unique prime factorisation fails in 
Ziv — 5], and it also fails in Z[¢] for nth roots ¢ of unity from n = 23 onwards 
(Kummer, 1844). The failure in Z[y — 5] was encountered implicitly by Fermat in 
studying primes of the form x* + 5y” (see [2], p. 82). The failure in Z[¢] was a 
major stumbling block, though not the only one, in Kummer’s attempt to prove 
Fermat’s last theorem. 


Algebraic numbers and functions. The story of how unique factorisation was lost, 
and then regained by Kummer and Dedekind’s theory of ideals, has often been 
told (for example in [1], pp. 818-824). I shall not repeat it here. Instead I shall 
sketch how Dedekind built the theory of algebraic integers to make unique 
factorisation possible, and what he discovered in the process. 
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Dedekind’s main idea was to embed rings of algebraic integers in algebraic 
number fields, where concepts of linear algebra come to the surface. “Sufficiently 
small’ rings of algebraic integers lie in fields K of finite dimension over Q, and 
any such field is of the form K = Q(a) where the degree of a equals the 
dimension of K. The concept of field and the existence of the “primitive element’’ 
a were already implicit in Abel and Galois, but Dedekind made the crucial 
identification of degree with dimension by observing that {1, a, a7,...,a”~'} is a 
basis for K, where n is the degree of a. This enabled him to give simple linear 
algebra proofs of theorems previously dependent on properties of symmetric 
functions, such as the closure of algebraic integers under +,— and xX. It also 
enabled him to develop a “linear” approach to Galois theory, and to apply Galois 
theory to number theory. 

These ideas were presented in the supplements Dedekind wrote for the 2nd, 
3rd and 4th editions of Dirichlet’s Zahlentheorie (1871, 1878, 1893). They were 
extended in Hilbert’s Zahlbericht (1897) and presented there in almost modern 
form. Many 20th century algebraists, such as Emmy Noether, Artin and van der 
Waerden, learned their algebra from these works. In fact, Emmy Noether used to 
say “Es steht schon bei Dedekind” (“It’s already in Dedekind”). 

A great benefit to flow from Dedekind’s approach to algebraic numbers was a 
new approach to algebraic functions. Among algebraic functions of one variable, 
the polynomials play the role of the integers. This idea goes back at least as far as 
Stevin’s L’arithmétique (1585), where the euclidean algorithm is used to find the 
gcd of polynomials. Dedekind took the idea that polynomials are “integers”, and 
generalised to an analogy of algebraic integers. The ring C[z] of complex polyno- 
mials in z is extended to the field C(z) of rational functions, and each algebraic 
function lies in a finite-dimensional extension K of C(z). The functions analogous 
to algebraic integers are the entire functions—those with a finite value for each 
value of z. By carrying over concepts from number theory to function theory, 
Dedekind and Weber (1882) were able to give a completely algebraic definition of 
a Riemann surface. In doing so, they put many theorems about Riemann surfaces 
(that is, algebraic curves) on a sound basis for thé first time, and laid the 
foundation of modern algebraic geometry. 

This is very interesting, but is it number theory? Can this rarified form of 
geometry tell us anything about the ordinary integers? A full answer would cover 
most of 20th century mathematics, but the short answer of course is yes. All of 
these ideas, and much more, are needed for Andrew Wiles’ proof of Fermat’s last 
theorem. 
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Let G, be the undirected graph whose vertices are the unlabeled graphs on n 
vertices (e.g., G, has 11 vertices), two of which are adjacent in G,, if and only if one 
can be obtained from the other by deleting an edge. 

(a) Show that neither G, nor G, contain Hamiltonian paths. 

(b)* Does G, contain a Hamiltonian path for any n > 5? 


10371. Proposed by Emil Yankov Stoyanov, Antiem I Mathematical School, Vidin, 
Bulgaria. 


Let B’ and C’ be points on the sides AB and AC, respectively, of a given 
triangle ABC, and let P be a point on the segment B’C’. Determine the maximum 
value of 


min{[ BPB’], [CPC’]} 
[ABC] 


where [F] denotes the area of F. 


10372. Proposed by Paul R. Chernoff and Jacob Feldman, University of California, 
Berkeley, CA. 


Let (f,)1 be a sequence of non-negative integrable functions on the unit 
interval [0, 1]. Write /o f(x) dx =c, and suppose that Sc, < ~. 

(a) Suppose also that Le, < oo, Show that there is a convergent series of 
non-negative terms a, such that, for almost all x € [0,1], f(x) <a, for all 
sufficiently large n. 

(b) Show that the conclusion of (a) may fail if Xy/c, = ©. 


10373. Proposed by M. J. Pelling, Balliol College, Oxford, England. 


Let E,, cI = [0,1] be a sequence of measurable sets in the unit interval with 
measures mE,, > 6 > 0 bounded away from zero. Prove that there is a subse- 
quence (E,, ) whose intersection has the cardinality of the continuum. 


10374. Proposed by David L. Book, University of Maryland, College Park, MD. 


Given an integer N, characterize the smallest square in the plane containing N 
lattice points. 


NOTES 


Notes: (10373) Paul Halmos “Large Intersections of Large Sets’, this MONTHLY 99 
(1992), 307-312, has a discussion of several related questions. (10374) The pro- 
poser supplied an extended narrative which is reproduced below. On that basis, it 
is proposed to call such a configuration a Bolyai square. Consideration of particu- 
lar values of N should illustrate all possibilities of the position of a Bolyai square 
on the lattice. 
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Narrative on 10374. In the introduction to his translation of J. Bolyai’s “The 
Science of Absolute Space” (Non-Euclidean Geometry, Ed. Roberto Bonola, 
Dover, New York, 1955), George Bruce Halsted wrote 


“The profound mathematical ability of Bolyai Janos showed itself physically not only in his 
handling of the violin, where he was a master, but also of arms, where he was 
unapproachable... Bolyai, when in garrison with cavalry officers, was provoked by thirteen of 
them and accepted all of their challenges on condition that he be permitted after each duel to 
play a bit on his violin. He came out the victor from his thirteen duels, leaving his thirteen 
adversaries on the square.” 


Although Halsted neglected to mention it, Bolyai caused his victims to fall in the 
elegant and symmetric pattern shown below: 


After concluding this bravura feat (as a result of which, not surprisingly, he was 
cashiered and lost his captain’s commission in the Austrian army), Bolyai was 
heard to remark that the square in question is the smallest one containing 13 
points of a square lattice. 


SOLUTIONS 


Another Appearance of Stirling Numbers 


10188 [1992, 60]. Proposed by Jiirgen Eckhoff, University of California, Davis, CA. 


Define a function h on the non-negative integers by putting h(0) = 0 and 
h(r) = max{i: ii — 1)/2 <r} for r> 1. Let f(r) =r +h(r) +1. For1 <k <n, 
let T(n,k) be the number of integer sequences 0 =r) <r, < ++: <r,_, Such 
that f(r,_,) <r, <(k + ik +i-1)/2fori = 1,2,...,n —k. Prove that T(n, k) 
equals the Stirling number of the second kind S(n, k). 


Solution I by Ivar Skau, Telemark College, Bd, Norway. We prove that T(n, k) 
and S(n,k) satisfy the same recurrence. The Stirling number S(v,k) is the 
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number of partitions of an n-set into k non-empty parts and satisfies the recur- 
rence S(n, k) = nad," 7 \s(r, k — 1), where the partitions are grouped by how 
many elements are in blocks different from the nth element. The initial condition 
here is S(n, 1) = 1 for n > 1. Note also that S(n,k) = Oif n <k. 

Let <r,) be one of the sequences counted by 7T(n, k); call these (n, k)-sequences. 
Let H(s) = {r: (:] <re< (°:')}: note that |H(s)|=s. If r,_, € H(s), then 
h(r,_,) = s, and hence r, € H(s). Indeed, if r;_, is the jth element of H(s), then 
r, cannot be any of the first j elements of H(s + 1). These observations imply that 


for k = 1, the only legal sequence is the sequence with r = ( ‘I . We also have 
T(n, k) = 0 if n < k. Hence the initial conditions for S and T agree. 


For k > 1, we obtain a recurrence for T(n, k). Let R,(n,i) be the number of 


(n, k)-sequences with r,_, = (" 5 ‘| + i, for 1 <i <n -— 1. The comments above 


about (n, k)-sequences and the sets H(s) have several implications: 


1) R,(n, n-1) = T(n — 1, k), because an (n, k)-sequence ending with 
"| requires r, , 1 < ("3 . 


2) R,(n, 1) = T(n — 2,k — 1), because an (n, k)-sequence ending with (" - ‘| 


2 
+ 1 requires r, _,_, < ("5 , 
3) R,(n, i) — R,(n,i — 1) = R,(n —- 1,1 — 1D, because an (n — 1, k)-sequence 


extends to an (n, k)-sequence by appending (" 5 ‘| + i but not (" 5 ‘| +i-1 
if and only if r,_,_, = ("5?] +i -1. 


If we rewrite (3) as R,(n,i) = R,(n,i — 1) + R,(n — 1,i — 1), we can iterate 
the recurrence i — 1 times to obtain 
i (i-1 a (i-1 
R,(n,i) = >» | j |Ra(n —j,1)= y) | j |r -—j—2,k—1), 
j=0 j=0 
where the last equality employs (2). If we set i=n-—1, apply (1), and set 
r=n—2-—jJ, this becomes 
n-2 
T(n-1,k)=R,(nn-1)=-¥ (” - 2 \r(r,k - 1). 
r=0 
Since T(r, k — 1) = 0 when r < k — 1,we can truncate the summation to begin at 
r =k — 1. Now, since the choice of n was arbitrary, we find that T(n, k) satisfies 
the same recurrence and initial conditions as S(n, k), so T(n, k) = S(n, k). 


Solution IT by Gruia Cdlinescu (student), University of Bucharest, Bucharest, 
Romania. We establish a biyection g from the set S of partitions of an n-set into k 
blocks to the set T of (n, k)-sequences. We turn a k-partition P of [n] = {1,...,n} 
into an (n,k)-sequence 0 <r, <r, < +*: <r,_,, where the elements r, are 
positions in an ordering on pairs of distinct positive integers. Let L be the reverse 
lexicographic ordering on these pairs, with (2,1) < 3,1) < @,2)<@D<.... 
To describe the positions in L, we write 1= LQ,1), 2 = L@,1), etc. In the 
notation of Solution I, note that L(b,a) € H(b — 1), and hence h(L(b, a)) = 
b — 1. 

Beginning with the partition Q of [n] into singletons, we iteratively combine two 
blocks that are contained within the same block of P. After doing this n — k 
times, we will have P. We also maintain an order on the blocks of the current 
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partition Q; in the initial Q, the jth block is {j}. With i =n — k initially, let i 
denote the number of steps remaining, so Q has k + i blocks. 

Let C be the last block of Q that is not yet a block of P, and let B be the last 
block of QO that appears together with C in a single block of P. Let (s, t) be the 
positions of C, B in Q, and let r, = L(s,t). To form the next Q, replace C by 
C UB in Q and eliminate B. The sets that were after B in Q (including C U B in 
place of C) have their position in Q reduced by 1. 

Since Q has k +i blocks, r, < (3s! . Also r, > 0. We show r, > f(r;_,) to 
prove that {r,}”—* is an (n, k)-sequence. Suppose r, = L(b, a) and r,_, = L(d,c). 
Then f(r;_,) = L(d,c) + d. When we compute r,, every block after the bth block 
of Q is already a block of P, so we have d <b. Similarly, if d = b — 1, then 
c<a. If d<b-—1, then L(b,a) = L(d,c)+d=f(7,_,), since the length of 
H(d + 1) is d. If d=b — 1, then L(b, a) = L(d,c) +d-1+a-—c2f(r,_,). 

Hence our map g produces an (n, k)-sequence. The map is injective; it is easy 
to compute from distinct partitions a point at which the resulting sequences will 
differ. To show g is surjective, we construct the inverse. Starting with Q being the 
ordered partition into singletons, we determine for i=n-—k,...,k +1 the 
unique blocks that could be combined to produce the next partition while produc- 


ing r,. Since r,; < (' , ), the pair (b, a) with r; = L(b, a) is a 2-element set with 
a<b<k +1. Hence we combine the ath block and bth block of Q. After n — k 
steps, we obtain a partition P with k blocks. 

The fact that r,; > f(r,;_,) for each i allows us to verify that these unions are 
precisely the unions performed when g is applied to P. It implies that the index of 
the higher combined block always decreases. Also, if that index decreases by 
exactly one, then the index of the lower combined block decreases. Hence a block 
remains unchanged once the two combined blocks have lower indices than it, and 
therefore it remains when P is reached. These facts imply that the choices are the 
Same as when g is used to compute the sequence from P. 


Solved also by J. H. Steelman and the proposer. 
An Identity for the Other Stirling Numbers 


10195 [1992, 162]. Proposed by Andrew Granville, University of Georgia, Athens, 
GA. 


For m > k > 1, define numbers b(k, m) by 


b(1,m) =1 form > 1, 


1 
— + 
J. 


1 
| form>k+1>2. 
m— J 


m—1 
b(k+1,m)= Yo b(k,j) 
j=k 
For example, b(2, m) is twice the (m — 1)th partial sum of the harmonic series (for 
m > 2). 
(a) Prove that 


m (—1)* 
yy Ev 


b(k,m) =0 form > 2. 
k=1 i! 


(b) Prove that (m — 1)!b(k,m) = k!o(m,k), where o(m,k) is the unsigned 
Stirling number of the first kind. 
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Composite solution by Robin J. Chapman, University of Exeter, Exeter, U.K., and 
Anchorage Math Solutions Group, University of Alaska, Anchorage, AK. Recall 
that o(m, k) is the number of permutations of [m] having k cycles. We prove first 
that (a) follows from (b). Expressing b(k,m) in terms of o(m,k) converts the 
desired identity to(1/(m — 1))Y3_ (—D*o(m, k) = 0 for m > 2. Since a permu- 
tation of [m] with k cycles has sign (—1)”**, this is equivalent to the fact that for 
m > 2 the symmetric group S,, has equal numbers of odd and even permutations. 

To prove (b), we show that k!o(m,k) satisfies the same recurrence as 
(m — 1)!b(k, m). The quantity k!o(m,k) is the number of ordered lists of the 
cycles comprising a permutation of m with k cycles. If k = 1, there are (m — 1)! 
of these, which equals (m — 1)!b(, m). If k > 1, we count these lists by the 
number /j of elements not in the first cycle. By picking these elements, forming an 
ordered list of k cycles for a permutation of them, and permuting the elements of 
the first cycle, we have 

m—1 
(k+1)!lo(m,k+1)= [7 Jato i, ym -j-TI)!. 
j=k 
Since 1/j + 1/(m — j) = m/(ji(m — j)), the numbers (m — 1)!b(k, m) satisfy the 
Same recurrence. 


Solved also by D. Callan, K. S. Kedlaya (student), P. Tracy, National Security Agency Problems 
Group, and the proposer. 


Saturated Chain Partitions and Fibonacci Numbers 


10199 [1992, 162]. Proposed by Richard Stanley, Massachusetts Institute of Technol- 
ogy, Cambridge, MA. 


Given a finite partially ordered set P, let f(P) denote the number of ways to 
partition the elements of P into pairwise disjoint nonempty saturated chains. 

(a) Prove that if P, is the product of two n-element chains, i.e., if P, = {(i, j): 
1<i<n,1<j <n}, with G,j) < W,/") if and only if i<i’ and j <j’, then 
f(P,) = IU, Fz, where F, is the kth Fibonacci number. 

(b) If every element of P covers at most two elements and is covered by at most 
two elements, prove that f(P) factors into Fibonacci and Lucas numbers. 


Solution by William Y. C. Chen, Los Alamos National Laboratory, Los Alamos, 
NM. We first define an operation on a poset P that preserves f(P). Given an 
element x € P, let P’ be the poset obtained by replacing x by two incomparable 
elements x’, x”, one of which is a minimal element covered by the elements 
covering x in P, the other of which is a maximal element covering the elements 
covered by x in P. A partition of P becomes a partition of P’ by splitting the 
chain containing x, and this map is reversible, so f(P) = f(P’). We call this 
operation splitting x. Let P* be the poset obtained from P by splitting every 
element that is not maximal or minimal. It suffices to compute f(P*). 

(a) For P,, the poset P* is a disjoint union of zigzags, where the zigzag Z, is 
the poset with all elements either maximal or minimal whose comparability graph 
is a k-vertex path. By considering the length of the chain containing a given 
endpoint of this comparability graph, we find that f(Z,) = f(Z,_,) = f(Z,_,) for 
k > 2. Since f(Z,) = 1 and f(Z,) = 2, we have f(Z,) = F,,,. The poset P* is 
the disjoint union of Z3, Z5,..., Z5n—1) Zon—1> Zon 30+ ++» Z3> ONE Zigzag for each 
consecutive pair of levels. Since F, = 1, we have f(P,) = TI. F3;- 
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(b) When every element of P covers at most two elements and is covered by at 
most two elements, P* consists of disjoint zigzags and crowns, which are the 
posets whose comparability graphs are even cycles. Let Q,, denote the crown with 
2k elements. By considering whether a fixed edge of this comparability graph is a 
chain in a partition, we find that f(Q,,) = f(Z2,) + flZox_2) = Fogg + Fog-i. 
Since L,=1, L,=3, and L,=L,_,+L,_, for n> 2, it is immediate by 
induction that the nth Lucas number L,, satisfies L, = F,,, + F,_,. Thus we have 
f(Q>,) = L,,, which completes the proof. 


Solved also by R. J. Chapman (U. K.), R. Stong, and the proposer. Part (a) only solved by J. C. Binz 
(Switzerland), and the Anchorage Math Solutions Group. 


Still Uniquely Fibonacci 


10203 [1992, 265]. Proposed by Ivan Vidav, University of Ljubljana, Ljubljana, 
Yugoslavia. 


Suppose that a, b, c and d are positive integers satisfying the two relations 


b*+1=ac and c?+1=bd. 
Prove that a = 3b —c and d = 3c — b. 


Solution by Nasha Komanda, Central Michigan University, Mt. Pleasant, MI. 
Note the equivalence of a = 3b —c to b? +1 = 8b —c)c to b? + c* + 1 = 3bc 
to d = 3c — b. It therefore suffices to prove that if c divides b? + 1 and b divides 
c’ + 1, then b* +c? + 1 = 3bc. We may assume b < c and prove this assertion by 
induction on b + c. 

If b+ c= 2, then b =c = 1 and the assertion holds. If c > 1, then b #c, 
which implies b < c — 1, then b* + 1 < c*, and a < c. The equalities b? + 1 = ac 
and c*+1=bd imply that (b* + 1)? = a7(bd — 1) and therefore 1 = —a’ 
(mod b), which means b divides a* + 1. Since a divides b* + landa+b<b+c, 
we can apply the induction hypothesis to conclude that a? + b* + 1 = 3ab. 

Now, (b? + 1)” = a*c? = Gab — b* — 1)c? = 3abc” — (b* + 1c”. Therefore, 
(b* + 1)(b* + c? + 1) = 3abc* = 3bc(b* + 1). We conclude b* + c? +1 = 3bc, 
which completes the proof. 


Editorial comment. Several solvers noted that the solution to E3210 [1987, 457; 
1988, 879] showed that if b <c, b? = —1(modc), and c? = —1(mod bd), then 
(b,c) = (Fy, 1, Fo,4,), where the F, are Fibonacci numbers. It then follows 
quickly that b? + c* + 1 = 3be and that b and c satisfy the conditions of this 
problem. 

In addition to references provided in the solution to E3210, Gerry Meyerson 
mentioned W. Sierpinski and A. Schinzel, Sur equation x* + y? + 1 = xyz, 
Matematiche, Catania 10(1955), 30-36; MR 17 7lle. 

Some solvers taking a more general approach used the fact that the continued 
fraction expansion of (kK —-2+Vk*—4)/2 is purely periodic with period 
(k — 2,1) for all integers k > 2. 

Robert J. Weber mentioned some related results: 

(1) x7 + y? = k(xy + 1) if and only if k = 1’, 

(2) x* +y*+1=k(xy + 1) if and only if k = /? + 1, 

(3) x* + y* = k(xy — 1) if and only if k = 5, and 

(4) x* + y? —1 = kyy has infinitely many such solutions for every k > 2. 
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He, along with A. Sudbery and the IMO 1992 Indian Team, noted that (1) was a 
problem in the 1988 International Mathematical Olympiad. 

Based on the similarity in method of solution between the Olympiad problem 
and the present one, A. Sudbery suggested that there might be a general proposi- 
tion which would unify these results. The additional references indicate that some 
work has already been done in this direction. A distillation of this brew may prove 
useful. 


Solved by 53 readers and the proposer. 
Square Functions 


10235 [1992, 571]. Proposed by Daniel Goffinet, Saint Etienne, France. 


(a) Determine the set Y of those continuous maps f from R* to R such that, 
for every rectangle ABCD, one has f(A) + f(C) = f(B) + f(D). 

(b) Let KLMN be a quadrangle in the plane such that f(K) + f(M) = f(L) + 
f(N) for every f € F. Is it true that KLMN must be a rectangle? 


Solution by Robert B. Israel, University of British Columbia, Vancouver, B. C., 
Canada. Part (a): ¥ consists of all functions of the form f(x) = a + b-x + cx: x, 
where a and c are real and b = (b,,b,) € R*. Moreover, this is true even if 
“rectangle” is replaced by “‘square’’. 


Proof: First note that all functions of the above form belong to ¥. For if KLMN 
is a rectangle, write L =K+u, N=K+v, M=K+u+4¢4-v where u and v are 
orthogonal vectors, and we have F(K) + F(M) = F(L) + FIN) = 2a+b:QK 
+ut+vw+cQK:-K+K:-u+K:vt+u-‘ucvyv-vy). 

Now suppose f € ¥. Let r > 0 be given. We can uniquely determine a, b and 
c so that the function g(x) = a + b: x + cx: x agrees with f(x) on the four points 
(0, 0), (7, 0), (—r, 0) and (0, r), namely 


a = f(0,0) 
b. = f(r, 9) — f(-r,9)) 
t 2r 
4, 2f(0.r) ~ f(r.) ~ (=r. 0) 
2 2r 
__ £(r.0) + F(=10) = 2f0,0) 
2r? 


Note that since both f and g satisfy the given identity, if they agree at three 
vertices of a square then they agree at the fourth vertex. Now I claim that g agrees 
with f at all points of the square lattice rZ*. First we obtain agreement at (0, —r) 
by using the square (—r,0) (0, r), (r, 0), (0, —r). Next, we use mathematical induc- 
tion to show that for every positive integer N, there is agreement at all lattice 
points with |x| + |y| < Nr. We already have the case N = 1. Given that it is true 
for N, we obtain agreement at (mr,(N + 1 — m)r) with 1 <m <N by using the 
squares ((m — 1)r,(N + 1 — m)r), (mr,(N + 1 — m)r), (mr, (N — m)r), (im — 
1)r,(N — m)r). Reflecting these squares about the x and/or y axes, we obtain 
also the points (+mr, +(N+1-—-—m)r). We obtain (0,(N + 1)r) by using the 
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square (0,(N + 1)r),(r, Nr),(0,CN — 1Ir),(—r, Nr), and similarly we obtain 
(0, -(N + Dr) and (4(N + 1)r, 0). 

The definition of g appeared to depend on r. In fact, however, every rational r 
will give the same g. For if r, is an integer multiple of r,,r,Z? Cr,Z* so the g 
defined for r, must agree with the g defined for r,. Since any two positive 
rationals are integer multiples of a single positive rational, they must have the 
same g. 

Thus we have that f and g agree on all points with rational coordinates. By 
continuity, they agree everywhere. 

Part (b): Yes, it must be a rectangle. 

Considering first a function of the form f(x) = b- x, we have b: (K — N) = 
b-(L—M). Since this is true for all b, K-N=L-—M, ie., KLMN is a 
parallelogram. Thus we can write L =K+u, N=K+v, M=K+u4v. Now 
using the function f(x) = (x — K)-: (x — K), we have 0 = f(K) + f(M) — f(L) - 
fCN) = 2u- v. Thus u and v are orthogonal, i.e., KLMN is a rectangle. 


Editorial comment. Vu Ha Van showed that one obtains the same solution of (a) 
under the assumption that the functions in Y to be Lebesgue measurable rather 
than continuous. However, using non-measurable functions g: R — R satisfying 
g(x + y) = g(x) + g(y), it is easy to produce additional functions f: R* > R 
satisfying f(A) + f(C) = f(B) + f(D) for every parallelogram. 


Solved also by F. Alouges & R. Cerf (France), R. J. Chapman (U.K.), M. Dindos (Slovakia), C. P. 
Grant, M. Hejny (Slovakia), H. H. Johnson, S. Kanetkar, I. Kastanas, P. Kinion, N. Komanda, O. P. 
Lossers (The Netherlands), K. Merryfield, M. Mocsy (Hungary), H. V. Vu (student, Hungary), 
University of Wyoming Problem Circle, and the proposer. Four incorrect solutions were also received. 


Conservative Paths in Continuous Vector Fields 


10239 [1992, 674]. Proposed by Ismor Fischer, Naval Postgraduate School, Mon- 
terey, CA. 


A continuous vector field F (in R? or R?) and a simple closed curve I are given. 
Show that, for every point x € I’, there exists a point_y € [ anda path y from x 
to y (nontrivial if x = y) such that the work W = /,,F : dr is zero. 


Solution by Frédéric Brulois, California State University—Dominguez Hills, 
Carson, CA. We prove the stronger (and more simply stated) 


Proposition. Let U be an open set of R”", x € U, and F: U > R” a continuous 
vector field on U. Then there exists a nontrivial (real analytic) path y: [0,1] ~ U 
from x back to x such that {,F : dr = 0. 


Proof: Let C be a circle through x which is small enough that the convex hull of 
C is contained in U. Let B: R — U be a constant speed parameterizations of C 
such that B(O) = B() =x and A(t) # x for 0 <t < 1. For each s € [0,1], define 
a,: [0,1] — R” by setting 

a,(t) = (1—s)B(t) +sB(-2t), forO<t <1. 


Clearly a) = B, while a, maps twice around the circle C in the direction opposite 
that of B. Also a, maps into the convex hull of C, hence into U. It is easily seen 
that each a, defines a nontrivial path. Finally a, is a real analytic map since B is. 
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Now consider the function 
1 
f(s) = [ F-dr= [ F(a,(t)) a(t) dt, ford<s <1, 
a, 0 


The function (s, t) ~ Fla,(t)) : a(t) is continuous, hence uniformly continuous on 
the compact set [0,1] x [0,1]. Thus f is continuous on [0, 1]. Now 


f(0)= [F-dr and f(1)=[ Fe dr = -2) F dr. 
B —B: B 


Therefore, by the Intermediate Value Theorem, there exists s, € [0,1] such that 
f(so) = 0. Take y =a, .. Then {,F - dr = f(s) = 0. 

The proposition shows that we may always take y = x: the curve y in the 
statement of the problem turns out to be irrelevant! The phenomenon is purely 
local. 


Editorial comment. It would be tempting to use of B(—t) instead of B(—2t¢) in 
the definition of a,. However, in that case a, ,.:would consist of a curve which 
moves along a line segment once in each direction. Such a curve should be 
considered as a trivial path from x to itself, since the work along such a path is 
zero for all vector fields. 


Solved also by I. Kastanas, O. P. Lossers (The Netherlands), A. Nijenhuis, Western Maryland 
College Problems group, and the proposer. 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Prfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 


‘‘Mathematics was born and nurtured in a 
cultural cnvironment. Without the per- 
spective which the cultural background 
affords, a proper apprcciation of the con- 


tent and state of present-day mathematics 
is hardly possible.” 
—R. L. Wilder 
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REVIEWS 


Edited by Darrell Haile 
Indiana University, Bloomington IN 47405 


Revolutions in Mathematics. Edited by Donald Gillies, Oxford University 


Press, New York, 1992, viii + 353, $98.00 


Reviewed by Michael S. Mahoney 


The theme of this book may need explication for potential readers who are not 
historians or philosophers of science. The title should have a question mark. At 
issue among the authors is precisely whether, and in what sense, revolutions occur 
in mathematics. That question hangs on what one means by “revolution,” and 
behind that lies the larger question of the extent to which the history of mathemat- 
ics is amenable to the historiographical approaches connoted by the notion of 
revolutions in science. 

The touchstone is Thomas S. Kuhn’s model of scientific change as set forth in 
The Structure of Scientific Revolutions in 1962, amended in a postscript added to 
the work in 1970, and subsequently articulated by a host of commentators, both 
friendly and not. In Structure Kuhn argued that science is inherently conservative. 
It consists normally of puzzle-solving: finding answers to problems well defined by 
a unanimously accepted paradigm (or, in a later version, disciplinary matrix ) that 
dictates how they should be solved and that offers assurance that they have 
solutions, however difficult or recalcitrant they might appear. Practice departs 
from the norm when puzzles resist efforts at solution by ever more skillful 
practitioners. Unsolved problems may be tolerated for a time as anomalies, or 
problems that don’t fit the rules, but at some point, a solution appears to some 
practitioners as crucial to the continued viability of the field. Either it is resolved, 
or some fundamental change in the rules is required to make it tractable. The 
paradigm has reached a crisis, the point of decision whether it will survive. Either 
the problem succumbs, or the paradigm does. Where a solution demands a new 
paradigm, or rather itself constitutes a new paradigm, a revolution occurs, marking 
a radical discontinuity in the development of the scientific field in question. 

The discontinuity takes both intellectual and social form. The old paradigm may 
lose meaning entirely, as phlogiston did when replaced by the oxygen theory of 
combustion. But, even where portions of the old paradigm are incorporated in the 
new, they acquire new or altered meaning through a change in the basic concepts 
of the field, as in the shift from classical to relativistic mechanics. Practitioners of 
the old paradigm find their skills obsolete, their expertise no longer of interest. 
That is where scientific revolutions become as bloody as intellectual combat is 
likely to get. Reputations are at stake, as is the work of whole careers, and the 
people involved understandably resist losing them. 

Kuhn’s model appealed to historians because it opened scientific change to 
operative factors external to the science at issue. How problems attract attention, 
how they become critical, whence new paradigms arise, how the new is compared 
to the old: none of these questions can be addressed, much less answered, from 
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within either the old or the new paradigm. Revolutions signal a break in the 
conceptual autonomy that seems to insulate normal science from influences 
beyond its own rules. Through revolutions, historians could root science in place 
and time, in social structures and cultural contexts. 

Sensitive to the new historiography of science of which Kuhn was both symptom 
and stimulus, Michael Crowe in 1974 presented to a colloquium on the history of 
modern mathematics at the American Academy of Arts and Sciences his “Ten 
‘laws’ concerning patterns of change in the history of mathematics,” which ap- 
peared in Historia Mathematica the following year and forms Chapter 1 of the 
present work. The tenth law asserts that ‘Revolutions never occur in mathematics.” 
Revolutions require that ‘‘some previously existing entity (be it king, constitution, 
or theory) must be overthrown and irrevocably discarded. [19]’ But while that may 
occur in mathematical symbolism, terminology, methodology, or even historiogra- 
phy, it does not happen in mathematics itself. The substance of the old retains its 
validity, albeit in a different form, while the new takes a place alongside it. 

Crowe’s assertion drew two major responses, republished here with the authors’ 
afterthoughts (Chapters 2—5). In “T.S. Kuhn’s theories and mathematics,”’first 
published in 1976, Herbert Mehrtens disagreed with Crowe’s separation of sub- 
stance from form in mathematics: “... there are events in the history of mathemat- 
ics that might be termed ‘revolutions’, and ...there is no point in distinguishing 
these events with respect to their being ‘in’ mathematics or somewhere else. [26] 
Yet, Mehrtens quickly added that the term “‘revolution” had more emotive than 
analytic force as an historiographical concept, since the implicit political analogy 
was difficult to sustain. “If there is to be any serious use of the metaphor,” “he 
reiterates in his 1992 appendix, ‘‘then it should aim at the structures of power and 
legitimacy before and after the event. In mathematics and the sciences one has to 
press the metaphor hard to make it work in this sense. [43] Kuhn’s notions of 
“normal science” and “anomaly” go a long way in explaining Crowe’s ten laws 
Merhtens argues, but “crisis” and hence “revolution” have limited explanatory 
power. In “Conceptual revolutions and the history of mathematics: two studies in 
the growth of knowledge,” read as a paper in 1974 and expanded for publication in 
1984, Joseph Dauben acknowledged the conservative, or cumulative, nature of 
mathematics, while at the same time arguing that episodes such as the Greek 
discovery of incommensurable magnitudes and Cantor’s theory of transfinite sets 
transgressed the accepted limits of mathematical thought at the time, causing a 
reformulation of what had gone before and provoking the sort of resistance to 
change that Kuhn had signaled as a concomitant of revolution. His appendix adds 
two more episodes to the list: Cauchy’s rigorous foundation of the calculus and 
Robinson’s non-standard analysis. 

The remaining chapters, specially commissioned for this volume, take up the 
issue from there, addressing what might be called ““Crowe’s quandary.” Clearly, 
there have been radical changes in mathematical thought and practice, changes 
that one would want to call “revolutionary” in the common usage of the term: the 
calculus, non-Euclidean geometry, or abstract algebra, to name a few. Clearly, too, 
they have left older mathematics in place. The school curriculum bears witness to 
the continuing validity of arithmetic, geometry, and algebra as they were practiced 
five hundred years ago and as they today continue to serve as general foundation 
for the subject. (Interestingly, none of the contributors notes the pedagogical 
issues implicit in the notion of revolutions, namely, how students shift their 
thinking as the curriculum recapitulates these episodes of conceptual and episte- 
mological upheaval.) 
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How then does mathematics change while remaining the same? What changes 
deserve to be called ‘revolutionary’? By what definition or model of “revolution”? 
Very little consensus emerges from the nine contributors. For the most part, they 
share a sense of revolution as a radical break with the past, a point at which new 
ways of thinking become possible, but they have little in common beyond that. In 
several instances, some see revolutions where others do not. Where they agree on 
what episodes were revolutions, in particular, non-Euclidean geometry, they differ 
on what made them revolutionary. 

Paolo Mancousu’s detailed exposition (Chapter 6) of Descartes’s analytic geom- 
etry leads the reader at first to think that something revolutionary was going on, 
but, after reviewing both pre-Kuhnian and post-Kuhnian claims to that effect, 
Mancosu has his doubts. His scepticism is tempered by his acknowledged disregard 
of Descartes’s theory of equations and of arguments that Descartes’s new geome- 
try was part of a larger, revolutionary shift from geometric to algebraic modes of 
thinking. Emily Grosholz (Chapter 7) makes more of that shift, as algebra provided 
the mediating element in Leibniz’s unification of geometry and number theory to 
form his calculus, which in turn provided a vehicle for uniting mechanics with 
mathematics. In thus opening new domains along the borders of its constituents, 
the calculus also transformed those older fields, thus giving it its revolutionary 
impact. But why it is particularly revolutionary, rather than simply creative or 
innovative, is hard to discern from Grosholz’ focus on Leibniz, which leaves out of 
the picture contemporaries pursuing the same ends in the same and different ways. 

Giulio Giorello (Chapter 8) mixes Kuhn’s model with the political metaphor to 
perceive in the eighteenth-century debate over fluxions a reflection of England’s 
“Glorious Revolution” of 1689, which retained the monarchy by reconciling it with 
the new forms of political legitimacy embodied in Parliament. Here Newton’s 
fluxions as he presented them challenged the established “‘ paradigm of legitimacy’’, 
which Berkeley sought to defend in a ‘“‘counter-revolution”. Maclaurin saved 
Newton’s revolution by reconciling the new modes of reasoning with traditional 
standards of geometric rigor. 

Following Crowe’s lead, Caroline Dunsmore (Chapter 11) allows revolutions 
only at the level of the “metamathematical values of the community that define 
the telos and methods of the subject, and encapsulate general beliefs about its 
nature” and not at the “object-level” of ‘concepts, terminology and notation, 
definitions, axioms, and theorems, methods of proof and problem-solutions, and 
problems and conjectures. [211]}’ By her criteria, the discovery of incommensu- 
rables, negative and imaginary numbers, non-commutative algebra, and transfinite 
sets constitute revolutions, while symbolic algebra and the calculus do not. By 
contrast, looking at algebraic number theory and non-Euclidean and projective 
geometry in the nineteenth century, Jeremy Gray (Chapter 12) argues that “al- 
though the objects of study remained superficially the same, the way they were 
defiried, analysed theoretically, and thought about intuitively was entirely trans- 
formed” [245], constituting a new framework of mathematics incompatible with its 
predecessor. 

Two contributors come at the question from altogether different directions. 
Examining the emergence of non-Euclidean geometry, Yuxing Zheng (Chapter 9) 
sees two sorts of innovation in mathematics: that which runs in the same direction 
as current thinking and that which runs in the opposite direction. The latter 
provokes crisis by introducing contradictions within mathematics, and revolution 
consists in the restoration of harmony by the expansion of concepts to transform 
the contradictions into alternatives. Zheng calls this the “harmonious principle of 
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counter-way thinking’. Considering “The geometrical vision of space” (Chapter 
10), Luciano Boi resists any notion of “revolution” that applies sociological or 
historical categories to the development of mathematics. Rather, a wholly internal 
process of “hermeneutics” leads to two forms of growth: “an indefinite ’rewriting’ 
of mathematical problems in the framework of one particular tradition or in 
different traditions” and “a radical change in the way a mathematical concept or 
field is conceived” [200-201]. In the latter case, the measure of the “revolutionary” 
nature of that change is the capacity of the new concept or field to provide fruitful 
models for the other fields, thus embracing what has gone before while at the same 
time rewriting or reconstruing it. 

The volume’s editor, Donald Gillies, has the next to last word before turning 
the discussion back to Michael Crowe for his afterthoughts. In arguing for “The 
Fregean revolution in logic’, Gillies seeks to reconcile radical change with preser- 
vation of the old order by contrasting the “Russian” model of revolution with the 
“Franco-British’”. The former discarded the monarchy, the latter retained or 
restored it ‘with greatly diminished powers”. While science has had its Russian 
revolutions, mathematics is restricted to the Franco-British sort. Gillies’s candidate 
for revolution tests that limitation, pointing to the all but complete displacement of 
Aristotelian syllogistic logic by Frege’s propositional and first-order predicate 
calculus, reinforced by Peano’s determination of previously undetected exceptions 
to some syllogisms. In undermining of uniqueness of Aristotle’s logic, the new 
system had metalevel implications similar to those of non-Euclidean geometry and 
for that reason provoked resistance from several quarters, thus fitting both 
Dunsmore’s and Dauben’s models of revolutions. 

According to Garrett Birkhoff, his father’s initial response to lattice theory was 
to ask “what one could do with lattices that couldn’t be done without them” 
(Papers on Algebra and Topology, 577—8). One may ask a similar question of 
Revolutions in Mathematics. What does the notion of ‘‘revolutions” do for history 
of mathematics that can’t be done without it? Mehrtens alone raises the issue and 
concludes, “‘Nothing”. As the other authors debate the possibility of revolutions, 
testing their arguments against a series of well known episodes, it never becomes 
clear just what is at stake historiographically, despite Gillies’s admirable introduc- 
tory review of the different positions. Grosholz unintentionally provides a hint 
when she concedes at the start of her essay that “the term ‘revolutionary’ has 
come to have an honorific sense in the philosophy of science, and I would like to 
see Leibniz properly honoured.” [116] It sounds like a throw-away line, but it 
captures a tacitly shared assumption among most of the authors, with the obvious 
exception of Mehrtens, who observes in his ‘“ Appendix” (1992) that “Revolutions 
are no longer what they were. [43]”. All of the developments proposed as revolu- 
tions here are “good” revolutions. They aim toward mathematics as it is currently 
conceived and thus, perhaps ironically, have a cumulative effect. To read through 
Revolutions in Mathematics is to watch the modern curriculum develop one 
revolutionary step at a time. None of the revolutions leads in the wrong direction. 
There are no failed revolutions, nor is there significant opposition, reaction, or 
counter-revolution. Giorello casts Berkeley in that last role, but only metaphori- 
cally and ultimately in ways supportive of the new calculus. Only Herbert Breger 
puts opposition in the spotlight with his study of Paul Finsler’s intuitive set theory 
(Chapter 13), “a restoration that failed”, less because of its inconsistency than 
because of its lack of fit with the newly dominant style of formalist mathematics 
espoused by Hilbert. 
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Historians of political revolutions take the opposition more seriously. For them, 
revolutions may succeed or fail, and successive revolutions—and restorations and 
counter-revolutions—may take a society in different, even opposite, directions. 
The present is neither explanation nor criterion of revolutions in the past. They 
are a matter of choices among alternatives at the time, and all choices equally 
require explanation. The notion of revolutions in mathematics might acquire 
analytic force if historians paid more attention than they have to those who 
resisted the transformations that constitute the present-day subject. Studying 
“reaction” as well as “revolution” would bring out the factors of stasis that define 
change and that show up, in mathematics especially, as continuations of the “old 
order’, thus producing Crowe’s quandary. As the contributions to’ this volume 
show, whether or not we call these formative episodes “revolutions”, they ulti- 
mately involve clashes of intellectual and social values and hence bring into sharp 
focus how those values shape mathematics. Viewing them in that light means 
taking the values of both sides seriously and bearing in mind that at any juncture 
things could have gone otherwise and still have been mathematics. 


History of Science Department 
Princeton University 
Princeton, NJ 08544 


“Regarding all these basic topics in in- 
finitesimal calculus which we teach today 
as canonical requisitics..., the question is 
never raised, ‘Why so?’ or ‘How does one 
arrive at them?’ Yet all these matters must 
at one time have been goals of an urgent 
quest, answers to burning questions, at the 


time, namely, when they were created. If 
we were to go back to the origins of these 
ideas, they would lose that dead appear- 
ance of cut-and-dried facts instead take on 
fresh and vibrant life again.” 


—QO. Toeplitz 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 
with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 

T : Textbook 

C : Computer Software 

S : Supplementary Reading 


1-4: Scmestcr 
** : Spccial Emphasis 
?? : Questionable 


P : Professional Reading 
L : Undergraduate Library 
13: Grade Level 


Readers are advised that price information is subject to change. Selected books and 
software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, 


MN 55057-1098. 


General, S*(16-17), C. Introduction to Maple. 
André Heck. Springer-Verlag, 1993, xili + 
497 pp, $39. [ISBN 0-387-97662-0] An 
excellent introduction to the computer alge- 
bra/graphics system Maple. Assumes little pre- 
vious experience with Maple, but moves rapidly 
to use of Maple to solve complex problems. 
Several in-depth examples worked in detail. 
Shows nicely not only how Maple works, but 
why it does what it does. MPR 


General, P. 1[.M. Gelfand Seminar. Eds: 
Sergei Gelfand, Simon Gindikin. Adv. in So- 
viet Math., V. 16. AMS, 1993. Part 1, xiv + 
241 pp, $71 [ISBN 0-8218-4118-1]; Part 2, xiv 
+ 208 pp, $65. [ISBN 0-8218-4119-X] 14 in- 
vited papers honoring Gelfand’s 80th birthday. 


General, P. Proceedings of the St. Petersburg 
Mathematical Society, Volume I. Eds: O.A. 
Ladyzhenskaya, A.M. Vershik. Transl. Ser. 2, 
V. 155. AMS, 1993, xiii + 223 pp, $99. [ISBN 
0-8218-7505-1] 8 papers in diverse mathe- 
matical areas. 


General, S(13). Mathematik-Vorkurs: Ubungs- 
und Arbeitsbuch fiir Studienanfdnger. Wolf- 
gang Sch§gfer, Kurt Georgi. BG Teubner Leip- 
zig, 1993, 470 pp, DM 44 (P). [ISBN 3- 
8154-2038-5] A compendium, for indepen- 
dent study, of mathematics taught in Ger- 
man secondary schools. Algebra, trigonome- 
try, logic, analytic geometry, vectors, calculus. 
Many exercises, with solutions. JD-B 

General, P. Mathematical TX by Example. 
Arvind Borde. Academic Pr, 1993, xi + 352 pp, 
$29.95 (P). [ISBN 0-12-117645-2] Shows 
how to use TeX to typeset mathematics. Input 
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commands and typeset output appear on facing 
pages. AO 


General, P. Transactions of the Tenth Army 
Conference on Applied Mathematics and Com- 
puting. US Army Research Office (POB 
12211, Research Triangle Park, NC 27709- 
2211), 1993, xxi + 770 pp, (P). 42 pa- 
pers on topics in natural language processing, 
wavelet analysis, variational methods, small 
sample asymptotics, computational fluid dy- 
namics, control theory, parallel programming, 
computational algebraic geometry, and mathe- 
matical aspects of materials science, etc. 


General, P. Mathematical Computation with 
Maple V: Ideas and Applications. Ed: Thomas 
Lee. Birkhauser, 1993, viii + 199 pp, 
$34.50 (P). [ISBN 0-8176-3724-9] Proceed- 
ings of the 1993 Maple Summer Workshop and 
Symposium at the University of Michigan. 


General, P. An Introductory Guide to Scien- 
tific Visualization. R.A. Earnshaw, N. Wise- 
man. Springer-Verlag, 1992, xvi + 156 pp, $49. 
[ISBN 0-387-54664-2] Written for novices. 
First part: basic ideas, vocabulary, and applica- 
tions. Second part: descriptions of many com- 
mercial and public domain systems. AO 


Recreational Mathematics, L*. The Wohas- 
cum County Problem Book. George T. Gilbert, 
Mark I. Krusemeyer, Loren C. Larson. Dol- 
ciani Math. Expos., No. 14. MAA, 1993, ix + 
233 pp, $26 (P). [ISBN 0-88385-316-7] 130 
challenging, original problems with clear, ele- 
gant solutions (often several per problem). A 
few problems use basic linear or abstract al- 
gebra; most require nothing beyond calculus; 
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many are accessible to high school students. A 
must for any problem solver. DH 


Precalculus, T(13: 1). College Algebra and 
Trigonometry, Third Edition. Michael Sulli- 
van. Dellen, 1993, xxi + 1034 pp. [ISBN 0- 
02-418305-9] New in this edition: more em- 
phasis on visualization; exercises and examples 
using graphing calculators; discussion/writing 
questions; some revised and/or reorganized ma- 
terial. AO 


Precalculus, S(13). Schaum’s Outline of The- 
ory and Problems of Mathematical Methods for 
Business and Economics. Edward T. Dowling. 
McGraw-Hill, 1993, ix + 384 pp, $12.95 (P). 
[ISBN 0-07-017697-3] Includes topics from 
high school algebra, linear algebra, linear pro- 
gramming, calculus. 1742 problems, most with 
complete solutions. DH 


Education, P, L. Developments in School 
Mathematics Education Around the World. 
Eds: Izaak Wirszup, Robert Streit. Univ. of 
Chicago School Math. Proj. NCTM, 1992, 
xv + 501 pp, $22 (P). [ISBN 0-87353-356- 
9] Proceedings of the Third UCSMP Interna- 
tional Conference on Mathematics Education 
(Chicago, 1991). 


Education, S(14—17). Making Sense of Data. 
Mary M. Lindquist, et al. Addenda Ser., Grades 
K-6. NCTM, 1992, viii + 48 pp, $9.50 (P), 
[ISBN 0-87353-318-6]; Number Sense and 
Operations. Grace M. Burton, et al. Ad- 
denda Ser., Grades K-6. NCTM, 1993, viii + 
55 pp, $9.50 (P), [ISBN 0-87353-319-4]; Pat- 
terns. Terrence G. Coburn, et al. Addenda 
Ser., Grades K-6. NCTM, 1993, ix + 53 pp, 
$9.50 (P), [ISBN 0-87353-320-8]; Geometry 
and Spatial Sense. John Del Grande, et al. Ad- 
denda Ser., Grades K-6. NCTM, 1993, viii + 
55 pp, $9.50 (P). [ISBN 0-87353-317-8] The 
contents of these four books duplicate the con- 
tents of the seven K-6 grade level books in 
the Addenda Series (TR, April 1993). Grade 
level books were divided into four topic areas; 
topic books are divided into seven grade levels. 
Topic books also include additional background 
information, extensive bibliographies. MW 


Education, P. Mathematicians and Education 
Reform, 1990-1991. Eds: Naomi D. Fisher, 
Harvey B. Keynes, Philip D. Wagreich. CBMS 
Issues in Math. Educ., V. 3. AMS and MAA, 
1993, x + 185 pp, $62 (P). [ISBN 0-8218- 
3503-3] 14 articles on projects and issues in 
mathematics education reform. 


Education, S, P*, L. The Geometric Supposer: 
What Is It a Case Of? Eds: Judah L. Schwartz, 
Michal Yerushalmy, Beth Wilson. Tech. in 
Educ. Ser. Lawrence Erlbaum Assoc, 1993, xi 
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+ 254 pp, $49.95. [ISBN 0-8058-0720-9] Re- 
ports on use of the Geometric Supposer: prob- 
lems in learning and teaching, how learning be- 
havior changes. Authors include developers, 
math educators, teachers, and students. JNC 


Education, P. Numerical Cognition. Ed: 
Stanislas Dehaene. Blackwell, 1993, 209 pp, 
$18.95 (P). [ISBN 1-55786-444-6] 5 papers 
survey recent research. 

History, P. Elie Cartan (1869-1951). M.A. 
Akivis, B.A. Rosenfeld. Transl. of Math. 
Mono., V. 123. AMS, 1993, xii + 317 pp, 
$153. [ISBN 0-8218-4587-X] A scientific bi- 
ography. Describes and evaluates Cartan’s most 
important discoveries. AO 


Logic, P. Uncountably Categorical Theories. 
Boris Zilber. Transl. of Math. Mono., V. 117. 
AMS, 1993, vi + 122 pp, $97. [ISBN 0-8218- 
4586-1] 

Logic, P. Many-Valued Logics, Volume 1: 
Theoretical Foundations. Leonard Bolc, Piotr 
Borowik. Springer-Verlag, 1992, xii + 292 pp, 
$69. [ISBN 0-387-55926-4] 


Graph Theory, P. Graph Structure Theory. 
Eds: Neil Robertson, Paul Seymour. Contemp. 
Math., V. 147. AMS, 1993, xiv + 688 pp, 
$81 (P). [ISBN 0-8218-5160-8] Proceedings 
of the 1991 AMS-IMS-SIAM Joint Summer 
Research Conference on Graph Minors held at 
the University of Washington. 


Graph Theory, T(16—-17: 1), P, L. The Pe- 
tersen Graph. D.A. Holton, J. Sheehan. Aus- 
tralian Math. Soc. Lect. Ser., V. 7. Cambridge 
Univ Pr, 1993, 353 pp, $39.95 (P). [ISBN 0- 
521-43594-3] Studies areas of graph theory 
where the Petersen graph plays an important 
role (often as a counterexample). Covers basic 
definitions, coloring problems, sharks (a cer- 
tain 4-edge-connected cubic graph), matchings, 
cases, hypohamiltonian graphs. Usable as a 
second course in graph theory for seniors (ex- 
ercises included) or as reference for researchers 
(unsolved problems mentioned). LC 


Discrete Mathematics, P. Proceedings of 
the Fourth Annual ACM-SIAM Symposium on 
Discrete Algorithms. ACM and SIAM, 1993, 
506 pp, (P). [ISBN 0-89871-313-7] 54 papers 
from the 1993 symposium in Austin, Texas. 
Number Theory, P. Number Theory With 
an Emphasis on the Markoff Spectrum. Eds: 
Andrew D. Pollington, William Moran. Lect. 
Notes in Pure & Appl Math., V. 147. Marcel 
Dekker, 1993, viii + 321 pp, $125 (P). [ISBN 0- 
8247-8902-4] 28 papers from the 1991 Con- 
ference on the Markoff Spectrum and Diophan- 
tine Approximation and Analytic Number The- 
ory at Brigham Young University. 
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Number Theory, T**(17). A Course in Com- 
putational Algebraic Number Theory. Henri 
Cohen. Grad. Texts in Math., V. 138. Springer- 
Verlag, 1993, xxi + 534 pp, $49. [ISBN 0- 
387-55640-0] Computer algorithms for cal- 
culating number-theoretic quantities. First six 
chapters cover fundamental ideas: Euclid’s al- 
gorithm, solving equations modulo a prime, 
polynomial factoring, algebraic number field 
techniques, quadratic field techniques, comput- 
ing Galois groups, etc. Chapter 7 introduces 
elliptic curves (including the Taniyama, Birch, 
and Swinnerton-Dyer Conjectures) and many 
important algorithms (e.g., for computing the 
rational points and the L-function of a curve). 
Chapters 8-10 treat integer factoring and pri- 
mality testing. An impressive text—probably 
the standard for some time to come. MPR 


Number Theory, P. Local Fields and Their 
Extensions: A Constructive Approach. 1.B. Fe- 
senko, S.V. Vostokov. Transl. of Math. Mono., 
V. 121. AMS, 1993, xv + 283 pp, $118. [ISBN 
0-8218-4613-2] 


Number Theory, P. Collected Papers of Kus- 
taa Inkeri. Eds: Tauno Metsankyla, Paulo 
Ribenboim. Papers in Pure & Appl. Math., 
V. 91. Queen’s Univ, 1992, xxxi + 566 pp, (P). 
[ISBN 0-889 1 1-632-6] 


Number Theory, P. The Stickelberger Ideal 
in the Spirit of Kummer with Application to the 
First Case of Fermat’s Last Theorem. Vijay 
Jha. Queen’s Papers in Pure & Appl. Math., 
No. 93. Queen’s Univ, 1993, xiv + 181 pp, (P). 


Group Theory, P. Group Representations, Vol- 
ume 2. Gregory Karpilovsky. Math. Stud., 
V. 177. North-Holland (US Distr: Elsevier Sci- 
ence), 1993, xv + 902 pp, $185.75. [ISBN 
0-444-88726-1] 


Group Theory, P. The Admissible Dual of 
GL(N) via Compact Open Subgroups. Colin J. 
Bushnell, Philip C. Kutzko. Annals of Math. 
Stud., No. 129. Princeton Univ Pr, 1993, ix 
+ 313 pp, $59.50; $24.95 (P). [ISBN 0-691- 
03256-4; 0-691-02114-7] 


Group Theory, P. Abelian Groups. Eds: Las- 
Zlo Fuchs, Riidiger Gébel. Lect. Notes in Pure 
& Appl. Math., V. 146. Marcel Dekker, 1993, 
xii + 260 pp, $99.75 (P). [ISBN 0-8247-8901-6] 
Proceedings of the International Conference on 
Torsion-free Abelian Groups (Curagao, 1991). 
4 survey articles and 17 research articles. 


Group Theory, P. Arithmetic of Probability 
Distributions, and Characterization Problems 
on Abelian Groups. G.M. Fel’dman. Transl. of 
Math. Mono., V. 116. AMS, 1993, 223 pp, 
$127. [ISBN 0-8218-4593-4] 
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Group Theory, P. Subgroups of Teichmiiller 
Modular Groups. Nikolai V. Ivanov. Transl. of 
Math. Mono., V. 115. AMS, 1992, xii + 127 pp, 
$107. [ISBN 0-8218-4594-2] 


Algebra, P. Selected Papers in K-Theory. 
Transl. Ser. 2, V. 154. AMS, 1992, ix + 195 pp, 
$83. [ISBN 0-8218-7504-3] 


Algebra, P. Boolean Constructions in Univer- 
sal Algebras. A.G. Pinus. Math. & Its Applic., 
V. 242. Kluwer Academic, 1993, vii + 350 pp, 
$149. [ISBN 0-7923-2117-0] 


Algebra, P. Ordered Algebraic Structures: The 
1991 Conrad Conference. Eds: J. Martinez, 
C. Holland. Kluwer Academic, 1993, xiii + 
258 pp, $110.50. [ISBN 0-7923-2258-4] 14 
papers from a conference held at the University 
of Florida to honor Paul F. Conrad. 


Algebra, P. Kac Algebras and Duality of Lo- 
cally Compact Groups. Michel Enock, Jean- 
Marie Schwartz. Springer-Verlag, 1992, x + 
257 pp, $98. [ISBN 0-387-54745-2] 


Algebra, P. Theory of Commutative Fields. 
Masayoshi Nagata. Transl: Masayoshi Nagata. 
Transl. of Math. Mono., V. 125. AMS, 1993, 
Xv + 249 pp, $125. [ISBN 0-8218-4572-1] 


Algebra, P. Algebras and Orders. Eds: Ivo G. 
Rosenberg, Gert Sabidussi. NATO ASI Ser. C, 
V. 389. Kluwer Academic, 1993, xvii + 553 pp, 
$220. [ISBN 0-7923-2143-X] 12 papers by 
invited speakers at the 1991 NATO Advanced 
Study Institute in Montréal. 


Algebra, P. Noncommutative Distributions: 
Unitary Representation of Gauge Groups and 
Algebras. Sergio A. Albeverio, et al. Pure & 
Appl. Math., V. 175. Marcel Dekker, 1993, viii 
+ 190 pp, $99.75. [ISBN 0-8247-9131-2] 


Calculus, S?(13). Calculus Laboratories with 
Mathematica, Volume 1. Michael G. Kerck- 
hove, Van C. Nall. McGraw-Hill, 1993, 
40 pp, (P). [ISBN 0-07-034220-2] 13 com- 
puter-based exercises (notebooks) with mini- 
mal Mathematica documentation; supplements 
standard Calculus I (some exercises refer to 
Calculus and Analytic Geometry, Fifth Edition, 
by Stein and Barcellos). INC 


Calculus, T(13: 3). Calculus, Fifth Edition. 
Stanley I. Grossman. Saunders College, 1992, 
xx + 1142 pp, $68. [ISBN 0-03-096420- 
2] Multi-colored tome includes initial review 
chapter, early treatment of trigonometric func- 
tions; features more calculator use, more fig- 
ures, ten new historical essays. (Third Edition, 
TR, January 1985.) INC 


Complex Analysis, P. Theory of Entire and 
Meromorphic Functions: Deficient and Asymp- 
totic Values and Singular Directions. Zhang 


[March 


Guan-Hou. Transl. of Math. Mono., V. 122. 
AMS, 1993, xi + 375 pp, $182. [ISBN 0-8218- 
4589-6] 


Complex Analysis, P. Carleman’s Formulas 
in Complex Analysis: Theory and Applications. 
Lev Aizenberg. Math. & Its Applic., V. 244. 
Kluwer Academic, 1993, xx + 299 pp, $137. 
[ISBN 0-7923-2121-9] 


Complex Analysis, P. Entire and Subhar- 
monic Functions. Ed: B. Ya. Levin. Adv. in 
Soviet Math., V. 11. AMS, 1992, vii + 275 pp, 
$147. [ISBN 0-8218-4110-6] 15 papers from 
the Research Seminar on the Theory of Func- 
tions at Kharkov University. 


Complex Analysis, P. The Cauchy Method of 
Residues, Volume 2: Theory and Applications. 
Dragoslav S. Mitrinovi¢é, Jovan D. Keckicé. 
Math. & Its Applic., V. 259. Kluwer Academic, 
1993, x + 191 pp, $86.50. [ISBN 0-7923-2311- 
4] 

Differential Equations, P. Averaging in Sta- 
bility Theory: A Study of Resonance Multi- 
Frequency Systems. M.M. Hapaev. Math. & 
Its Applic., V. 79. Kluwer Academic, 1993, xiii 
+ 279 pp, $145. [ISBN 0-7923- 1581-2] 


Differential Equations, P. Differential In- 
clusions in Nonsmooth Mechanical Problems: 
Shocks and Dry Friction. Manuel D.P. Mon- 
teiro Marques. Progress in Nonlinear Diff. Eqts. 
& Their Applic., V. 9. Birkhauser, 1993, x + 
179 pp, $69. [ISBN 0-8176-2900-9] 


Differential Equations, S*. Differential and 
Integral Equations through Practical Problems 
and Exercises. Gheorghe Micula, Paraschiva 
Pavel. Texts in Math. Sci., V. 7. Kluwer 
Academic, 1992, ix + 395 pp, $139. [ISBN 
0-7923-1890-0] Hundreds of problems (with 
solutions) on first-order ODE’s and PDE’s, lin- 
ear ODE’s, Laplace transforms, integral equa- 
tions, numerical methods. Nice resource. SK 


Differential Equations, P. Ordinary and Par- 
tial Differential Equations, Volume IV. Eds: 
B.D. Sleeman, R.J. Jarvis. Pitman Res. Notes 
in Math. Ser., V. 289. Longman Scientific & 
Technical (US Distr: Wiley), 1993, 292 pp, 
$44.95 (P). [ISBN 0-582-09137-3] Proceed-- 
ings of 1992 University of Dundee conference. 


Differential Equations, P. Applications of Li- 
apunov Methods in Stability. A. Halanay, V. 
Rasvan. Math. & Its Applic., V. 245. Kluwer 
Academic, 1993, xi + 237 pp, $115. [ISBN 
0-7923-2 120-0] 

Differential Equations, P. Asymptotic Prop- 
erties of Solutions of Nonautonomous Ordinary 
Differential Equations. 1.T. Kiguradze, T.A. 
Chanturia. Math. & Its Applic., V. 89. Kluwer 
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Academic, 1993, xiv + 331 pp, $154. [ISBN 
0-7923-2059-X] 


Differential Equations, P. Introduction to 
the General Theory of Singular Perturbations. 
S.A. Lomov. Transl. of Math. Mono., V. 112. 
AMS, 1992, xviii + 375 pp, $201. [ISBN 0- 
8218-4569- 1] 


Differential Equations, P. Nevanlinna The- 
ory and Complex Differential Equations. I\po 
Laine. Stud. in Math., V. 15. Walter de Gruyter, 
1993, viii + 341 pp, DM 154. [ISBN 3-11- 
013422-5] 


Partial Differential Equations, P. Hypo- 
Analytic Structures: Local Theory. Francois 
Treves. Math. Ser., V. 40. Princeton Univ Pr, 
1992, xvii + 497 pp, $65. [ISBN 0-691-08744- 
X] 


Partial Differential Equations, P. Nonlin- 
ear Stochastic Evolution Problems in Applied 
Sciences. N. Bellomo, Z. Brzezniak, L.M. de 
Socio. Math. & Its Applic., V. 82. Kluwer 
Academic, 1992, xiv + 219 pp, $93.50. [ISBN 
0-7923-2042-5] 


Partial Differential Equations, P. Typical Sin- 
gularities of Differential 1-Forms and Pfaffian 
Equations. Michail Zhitomirskif. Transl. of 
Math. Mono., V. 113. AMS, 1992, xi + 176 pp, 
$116. [ISBN 0-8218-4567-5] 


Partial Differential Equations, P. Develop- 
ments in Partial Differential Equations and Ap- 
plications to Mathematical Physics. Eds: G. 
Buttazzo, G.P. Galdi, L. Zanghirati. Plenum 
Pr, 1992, viii + 246 pp, $69.50. [ISBN 0-306- 
44311-2] Proceedings of a 1991 conference 
in Ferrarra, Italy. 


Partial Differential Equations, P. Diffusion 
Equations. Seiz6 It6. Transl: Seiz6é It6. Transl. 
of Math. Mono., V. 114. AMS, 1992, x + 
225 pp, $93. [ISBN 0-8218-4570-5] 


Partial Differential Equations, P. Systems 
of Evolution Equations with Periodic and 
Quasiperiodic Coefficients. Yu. A. Mitropol- 
sky, A.M. Samoilenko, D.I. Martinyuk. Math. 
& Its Applic., V. 87. Kluwer Academic, 1993, 
xiv + 280 pp, $126 pp. [ISBN 0-7923-2054-9] 


Partial Differential Equations, P. Nonlinear 
Potential Theory of Degenerate Elliptic Equa- 
tions. Juha Heinonen, Tero Kilpeldinen, Olli 
Martio. Oxford Math. Mono. Clarendon Pr, 
1993, v + 363 pp, $70. [ISBN 0-19-853669-0] 


Partial Differential Equations, P. Second In- 
ternational Conference on Mathematical and 
Numerical Aspects of Wave Propagation. Eds: 
Ralph Kleinman, et al. SIAM, 1993, xi + 
473 pp, $69.50 (P). [ISBN 0-89871-318-8] 
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Proceedings of a 1993 conference at the Uni- 
versity of Delaware. 


Partial Differential Equations, P. Floquet 
Theory for Partial Differential Equations. Pe- 
ter Kuchment. Oper. Theory: Adv. & Applic., 
V. 60. Birkhauser, 1993, xiv + 350 pp, $108.50. 
[ISBN 0-8176-2901-7] 


Partial Differential Equations, P. The 
Method of Newton's Polyhedron in the Theory 
of Partial Differential Equations. S. Gindikin, 
L.R. Volevich. Math. & Its Applic., V. 86. 
Kluwer Academic, 1992, x + 266 pp, $126. 
[ISBN 0-7923-2037-9] 


Partial Differential Equations, P. Algorithms 
for Elliptic Problems: Efficient Sequential and 
Parallel Solvers. Marian VajterSic. Math. & Its 
Applic., V. 58. Kluwer Academic, 1993, xviii 
+ 292 pp, $152. [ISBN 0-7923-1918-4] 


Partial Differential Equations, P. The So- 
lution of a One-Dimensional Stefan Problem. 
C. Vuik. CWI Tract, V. 90. Centrum voor 
Wiskunde en Informatica, 1993, 134 pp, Dfl. 
40 (P). [ISBN 90-6196-419-9] 


Partial Differential Equations, P. Degenerate 
Elliptic Equations. Serge Levendorskii. Math. 
& Its Applic., V. 258. Kluwer Academic, 1993, 
xi + 431 pp, $186.50. [ISBN 0-7923-2305-X] 


Partial Differential Equations, P.  Ellip- 
tic Boundary Problems for Dirac Opera- 
tors. Bernhelm Boo8-Bavnbek, Krzysztof P. 
Wojciechowski. Math.: Theory & Applic. 
Birkhauser, 1993, xviii + 307 pp, $49.50. 
[ISBN 0-8176-3681-1] 


Dynamical Systems, P. Continuum Theory 
and Dynamical Systems. Ed: Thelma West. 
Lect. Notes in Pure & Appl. Math., V. 149. 
Marcel Dekker, 1993, viii + 296-pp, $115 (P). 
[ISBN 0-8247-9072-3] 19 papers, some ex- 
pository, from a conference-workshop at the 
University of Southwestern Louisiana. 


Dynamical Systems, P. Analytic D-Modules 
and Applications. Jan-Erik Bjork. Math. & Its 
Applic., V. 247. Kluwer Academic, 1993, xiii 
+ 581 pp, $245. [ISBN 0-7923-2114-6] 


Dynamical Systems, P. Hamiltonian Me- 
chanical Systems and Geometric Quantization. 
Mircea Puta. Math. & Its Applic., V. 260. 
Kluwer Academic, 1993, viii + 278 pp, $110. 
[ISBN 0-7923-2306-8] 


Dynamical Systems, S(15-16), P, L. The 
Dynamics of Ambiguity. Giuseppe Caglioti. 
Springer-Verlag, 1992, xxi + 170 pp, $59. 
[ISBN 0-387-52020-1] A slick, coffee-table- 
style book on symmetry and symmetry breaking 
in art and science. Translated from Italian into 
occasionally cumbersome English. BC 
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Numerical Analysis, P. Knot Insertion and 
Deletion Algorithms for B-Spline Curves and 
Surfaces. Eds: Ronald N. Goldman, Tom Ly- 
che. SIAM, 1993, xiii + 197 pp, $43.50 (P). 
[ISBN 0-89871-306-4] 7 papers providing 
modern perspectives on knot insertion and dele- 
tion algorithms for B-splines. AO 


Numerical Analysis, P. Acta Numerica 1993. 
Cambridge Univ Pr, 1993, 326 pp, $44.95. 
[ISBN 0-521-443563] 6 survey papers on path 
following, multivariate piecewise polynomials, 
numerical linear algebra, fluid dynamics, and 
multigrid methods. 


Numerical Analysis, T*(14-15), L. Elemen- 
tary Numerical Computing with Mathematica. 
Robert D. Skeel, Jerry B. Keiper. McGraw- 
Hill, 1993, xiv + 434 pp, $60.16. [ISBN 0- 
07-057820-6] Interesting alternative to stan- 
dard treatments. First two chapters treat error 
analysis and floating-point arithmetic. Remain- 
der covers rootfinding, linear systems, interpo- 
lation, least-squares, numerical integration and 
differentiation, and ordinary differential equa- 
tions. Appendices introduce Mathematica, sur- 
vey basic mathematical topics. Authors con- 
tend that using Mathematica skirts program- 
ming problems, so reveals more mathematics. 
Worth a look. MPR 


Numerical Analysis, P. Guaranteed Accuracy 
in Numerical Linear Algebra. S.K. Godunov, 
et al. Math. & Its Applic., V. 252. Kluwer 
Academic, 1993, xi + 535 pp, $242. [ISBN 
0-7923-2352-1] 

Numerical Analysis, P. Convergence of Itera- 
tions for Linear Equations. Olavi Nevanlinna. 
Lect. in Math. Birkhauser, 1993, vii + 177 pp, 
$29 (P). [ISBN 0-8176-2865-7] 

Operator Theory, P. Generalized Vertex Alge- 
bras and Relative Vertex Operators. Chongy- 
ing Dong, James Lepowsky. Progress in Math., 
V. 112. Birkhauser, 1993, ix + 202 pp, $44.50. 
[ISBN 0-8176-3721-4] 


Operator Theory, P. Continuous and Discrete 
Fourier Transforms, Extension Problems and 
Wiener-Hopf Equations. Ed: 1. Gohberg. Oper. 
Theory, V. 58. Birkhauser, 1992, viii + 214 pp, 
$73. [ISBN 0-8176-2809-6] 7 papers, 4 based 
on talks from a September 1991 workshop at the 
University of Maryland. 


Functional Analysis, P. White Noise: An In- 
finite Dimensional Calculus. Takeyuki Hida, 
et al. Math. & Its Applic., V. 253. Kluwer 
Academic, 1993, xiii + 516 pp, $198. [ISBN 
0-7923-2233-9] 

Functional Analysis, P. Positive Operators 
and Semigroups on Banach Lattices. Eds: C.B. 
Huijsmans, W.A.J. Luxemburg. Kluwer Aca- 


[March 


demic, 1992, vii + 152 pp, $92. [ISBN 0- 
7923-1964-8] Proceedings of the Caribbean 
Mathematics Foundation’s 1990 conference. 
Reprinted from Acta Applicandae Mathemat- 
icae, Vol. 27, Nos. 1—2 (1992). 

Functional Analysis, P. Applied Theory of 
Functional Differential Equations. V. Kol- 
manovskii, A. Myshkis. Math. & Its Applic., 
V. 85. Kluwer Academic, 1992, xv + 234 pp, 
$99. [ISBN 0-7923-2013-1] 

Functional Analysis, P. Fourier Integrals 
in Classical Analysis. Christopher D. Sogge. 
Tracts in Math., V. 105. Cambridge Univ Pr, 
1993, x + 237 pp, $39.95. [ISBN 0-521-43464- 
5] 

Functional Analysis, P. Functional Inte- 
grals: Approximate Evaluation and Applica- 
tions. A.D. Egorov, P.I. Sobolevsky, L.A. 
Yanovich. Math. & Its Applic., V. 249. Kluwer 
Academic, 1993, x + 418 pp, $172. [ISBN 
0-7923-2193-6] 

Functional Analysis, P. Invariant Func- 
tion Spaces on Homogeneous Manifolds of Lie 
Groups and Applications. M.L. Agranovskii. 
Transl. of Math. Mono., V. 126. AMS, 1993, x 
+ 131 pp, $71. [ISBN 0-8218-4604-3] 


Functional Analysis, P. Index Theory and 
Operator Algebras. Eds: Jeffrey Fox, Peter 
Haskell. Contemp. Math., V. 148. AMS, 1993, 
vii + 190 pp, $41 (P). [ISBN 0-8218-5152- 
7] Proceedings of the 1991 CBMS Regional 
Conference at the University of Colorado. 


Analysis, P. Lectures on Hermite and Laguerre 
Expansions. Sundaram Thangavelu. Math. 
Notes, V. 42. Princeton Univ Pr, 1993, xv + 
195 pp, $22.50 (P). [ISBN 0-691-00048-4] 

Analysis, P. Difference Equations and 
Their Applications. A.N. Sharkovsky, Yu. L. 
Maistrenko, E. Yu. Romanenko. Transl: D.V. 
Malyshev, P.V. Malyshev, Y.M. Pestryakov. 
Math. & Its Applic., V. 250. Kluwer Aca- 
demic, 1993, xii + 358 pp, $152. [ISBN 0- 
7923-2194-4] Modern theory, with many new 
results. Four parts: one-dimensional dynam- 
ical systems; difference equations with con- 
tinuous time; differential-difference equations; 


bouhdary-value problems for hyperbolic sys-~ 


tems of PDE’s. AO 


Analysis, P. Commensurabilities among Lat- 
tices in PU(1,n). Pierre Deligne, G. Daniel 
Mostow. Annals of Math. Stud., No. 132. 
Princeton Univ Pr, 1993, 183 pp, $19.95 (P); 
$49.95. [ISBN 0-691-00096-4; 0-691-03385- 
4] 

Combinatorial Geometry, P. New Trends in 
Discrete and Computational Geometry. Ed: 
Janos Pach. Algorithms & Combinatorics, 
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V. 10. Springer-Verlag, 1993, xi + 339 pp, $89. 
[ISBN 0-387-55713-X] 12 survey papers. 


Algebraic Geometry, P. Cyclic Homology. 
Jean-Louis Loday. Grund. der math. Wissen- 
schaften, B. 301. Springer-Verlag, 1992, xvii + 
454 pp, $149. [ISBN 0-387-53339-7] 
Algebraic Geometry, P, L. Mathematical 
Methods for CAD. J.J. Risler. Cambridge Univ 
Pr, 1992, 196 pp, $69.95; $29.95 (P). [ISBN 0- 
521-43 100X; 0-521-436915] Introduction to B- 
splines for curves and surfaces and their appli- 
cations. Uses tensor products, triangular Bezier 
patches; Bezier curves are treated as special 
cases of B-splines. Also treats algebraic prob- 
lems, including properties of polynomials via 
formal computation. No exercises. MPR 


Algebraic Geometry, P. Mapping Class 
Groups and Moduli Spaces of Riemann Sur- 
faces. Eds: Carl-Friedrich Bédigheimer, 
Richard M. Hain. Contemp. Math., V. 150. 
AMS, 1993, xx + 372 pp, $51 (P). [ISBN 
0-8218-5167-5] Joint proceedings of work- 
shops held in 1991 at the University of 
GOttingen and the University of Washington. 


Algebraic Geometry, P. Complex Analy- 
sis and Geometry. Eds: Vincenzo Ancona, 
Alessandro Silva. Univ. Ser. in Math. Plenum 
Pr, 1993, xvi + 412 pp, $85. [ISBN 0-306- 
44179-9] 16 papers reviewing recent research 
on the role of complex function theory in alge- 
braic geometry, analytic geometry of manifolds 
and spaces, and differential geometry. Also in- 
cludes a list of open problems. 

Algebraic Geometry, P. Algebraic Functions. 
Kenkichi Iwasawa. Transl: Goro Kato. Transl. 
of Math. Mono., V. 118. AMS, 1993, xxii + 
287 pp, $131. [ISBN 0-8218-4595-0] 
Algebraic Geometry, P. Nilpotence and Pe- 
riodicity in Stable Homotopy Theory. Dou- 
glas C. Ravenel. Annals of Math. Stud., 
No. 128. Princeton Univ Pr, 1992, xiv + 209 pp, 
$24.95 (P); $69.50. [ISBN 0-691-02572-X; 0- 
691-08792-X] 


Algebraic Geometry, P. Introduction to Toric 
Varieties. William Fulton. Annals of Math. 
Stud., No. 131. Princeton Univ Pr, 1993, xi 
+ 157 pp, $16.95 (P); $32.50. [ISBN 0-691- 
00049-2; 0-69 1-03332-3] 

Algebraic Geometry, P. Tangents and Secants 
of Algebraic Varieties. F.L. Zak. Transl. of 
Math. Mono., V. 127. AMS, 1993, vii + 164 pp, 
$96. [ISBN 0-8218-4585-3] 

Differential Geometry, P. Einstein Metrics 
and Yang-Mills Connections. Eds: Toshiki 
Mabuchi, Shigeru Mukai. Lect. Notes in Pure & 
Appl. Math., V. 145. Marcel Dekker, 1993, viii 
+ 224 pp, $99.75 (P). [ISBN 0-8247-9069-3] 
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Proceedings of the 27th Taniguchi International 
Symposium (Sanda, Japan, 1990). 
Differential Geometry, P. Constrained Me- 
chanics and Lie Theory. Robert Hermann. 
Interdisc. Math., V. 27. Math Sci Pr, 1992, 
288 pp, $95. [ISBN 0-915692-43-0] Nice 
introduction to modern differential geometric 
tools for constrained mechanical and variational 
systems. Basic tools are the theory of algebraic 
structure of vector fields defined by the Jacobi- 
Lie bracket and the structure of affine connec- 
tions in subbundles of the tangent bundle, all de- 
veloped using modern language and ideas. Also 
presents a modern treatment of Lie’s exploita- 
tion of the symmetries of differential equations. 
Of interest to mathematicians working in me- 
chanics, robotics, biomechanics. SP 


Differential Geometry, P. Minimal Surfaces. 
Ed: A.T. Fomenko. Adv. in Soviet Math., V. 15. 
AMS, 1993, ix + 342 pp. [ISBN 0-8218-4116- 
5] 10 papers from the seminar on modern geo- 
metrical methods at Moscow State University. 


Differential Geometry, P. Harmonic Maps 
and Minimal Immersions with Symmetries: 
Methods of Ordinary Differential Equations Ap- 
plied to Elliptic Variational Problems. James 
Eells, Andrea Ratto. Annals of Math. Stud., 
V. 130. Princeton Univ Pr, 1993, 228 pp, 
$19.95 (P); $49.50. [ISBN 0-691-10249-X; 0- 
691-03321-8] 


Differential Geometry, P. Projective Differen- 
tial Geometry of Submanifolds. M.A. Akivis, 
V.V. Goldberg. Math. Lib., V. 49. North- 
Holland (US Distr: Elsevier Science), 1993, xi 
+ 362 pp, $128.50. [ISBN 0-444-89771-2] 
Differential Geometry, P. Differential Geom- 
_ etry. Eds: Robert Greene, S.T. Yau. Proc. 
of Symposia in Pure Math., V. 54, Parts 1-3. 
AMS, 1993 [ISBN 0-8218-1493-1] set. Par- 
tial Differential Equations on Manifolds, Part 
1, xxii + 560 pp, $89 [ISBN 0-8218-1494-X]; 
Geometry in Mathematical Physics and Related 
Topics, Part 2, xxii + 655 pp, $96 [ISBN 0- 
8218-1495-8]; Riemannian Geometry, Part 3, 
xxii + 710 pp, $103. [ISBN 0-8218-1496-6] 
Proceedings of a 1990 AMS Summer Research 
Institute. 


Differential Geometry, P. Nonlinear Poisson 
Brackets: Geometry and Quantization. M.V. 
Karasev, V.P. Maslov. Transl. of Math. Mono., 
V. 119. AMS, 1993, xi + 366 pp, $170. [ISBN 
0-82 18-4596-9] 

Algebraic Topology, P. Algebraic Topology: 
Oaxtepec 1991, Ed: Martin C. Tangora. Con- 
temp. Math., V. 146. AMS, 1993, xviii + 
481 pp, $71 (P). [ISBN 0-8218-5162-4] | Pro- 
ceedings of the July 1991 conference. 
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Algebraic Topology, P. Algebraic L-Theory 
and Topological Manifolds. A.A. Ranicki. 
Tracts in Math., V. 102. Cambridge Univ Pr, 
1992, 358 pp, $69.95. [ISBN 0-521-42024-5] 
Algebraic L-theory is the algebraic K -theory of 
quadratic forms; it relates a manifold topology 
to its homotopy type (dimensions > 5). SK 


Algebraic Topology, P. Differential Algebras 
in Topology. David Anick. Res. Notes in Math., 
V. 3. AK Peters, 1993, xxv + 274 pp, $49.50. 
[ISBN 1-56881-001-6] 


Topology, P. Embeddings and Immersions. 
Masahisa Adachi. Transl: Kiki Hudson. Transl. 
of Math. Mono., V. 124. AMS, 1993, x + 
183 pp, $103. [ISBN 0-8218-4612-4] 


Topology, P. The Selected Works of J. Frank 
Adams, Volumes I-II. Eds: J.P. May, C.B. 
Thomas. Cambridge Univ Pr, 1992, $74.95 
each. Volume I, xvi + 536 pp [ISBN 0-521- 
41063-0]; Volume II, xvi +529 pp. [ISBN 0- 
521-41065-7] 


Mathematical Modeling, P. Fisheries: Con- 
trol and Management via Application of Man- 
agement Science (with a Comprehensive An- 
notated Bibliography of Applications). Eds: 
Bruce L. Golden, Edward A. Wasil. Amer. J. 
of Math. & Management Sci., V. 12, Nos. 2 
& 3. American Sciences Pr, 1992, 150 pp, 
$98.75 (P). [ISBN 0-935950-34-6] 5 papers 
on recent work. 

Mathematical Modeling, P. Forecasting the 
Health of Elderly Populations. Eds: Kenneth 
G. Manton, Burton H. Singer, Richard M. Suz- 
man. Ser. in Stat. Springer-Verlag, 1993, x 
+ 371 pp, $59. [ISBN 0-387-97953-0] 15 
survey papers on methodological issues, fore- 
casting techniques, effects of interventions on 
health care costs, and longitudinal research. 


Control Theory, P. Jdentification and Con- 
trol in Systems Governed by Partial Differen- 
tial Equations. Eds: H.T. Banks, R.H. Fabi- 
ano, K. Ito. SIAM, 1993, ix + 234 pp, 
$48.50 (P). [ISBN 0-89871-317-X] Proceed- 
ings of a 1992 AMS-IMS-SIAM Joint Summer 
Research Conference at Mt. Holyoke College. 


Control Theory, P. Lecture Notes in Con- 
trol and Information Sciences-187: RoManSy 9. 
Eds: A. Morecki, G. Bianchi, K. Jaworek. 
Springer-Verlag, 1993, xxxi + 438 pp, $74 (P). 
[ISBN 0-387-19834-2] Proceedings of the 9th 
CISM-IFToMM Symposium on Theory and 
Practice of Robots and Manipulators (Udine, 
Italy, 1992). 

Stochastic Processes, P. Schrdédinger Equa- 
tions and Diffusion Theory. Masao Nagasawa. 
Mono. in Math., V. 86. Birkhauser, 1993, 
319 pp, $99. [ISBN 0-8176-2875-4] 
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Stochastic Processes, P. Some Aspects of 
Brownian Motion, Part I: Some Special Func- 
tionals. Marc Yor. Lect. in Math. Birkhauser, 
1992, 136 pp, $24.50 (P). [ISBN 0-8176-2807- 
X] 

Stochastic Processes, P. Sojourns and Ex- 
tremes of Stochastic Processes. Simeon M. 
Berman. Stat. & Prob. Ser. Wadsworth, 1992, 
xiv + 300 pp, $59.95. [ISBN 0-534-13932-9] 


Stochastic Processes, P. Doeblin and Mod- 
ern Probability. Ed: Harry Cohn. Contemp. 
Math., V. 149. AMS, 1993, xii + 347 pp, 
$49 (P). [ISBN 0-8218-5149-7] Proceedings 
of the 1991 conference “50 Years after Doeblin: 
Developments in the Theory of Markov Chains, 
Markov Processes, and Sums of Random Vari- 
ables” held at Blaubeuren, Germany. 


Stochastic Processes, P. Gaussian Processes. 
Takeyuki Hida, Masuyuki Hitsuda. Transl. of 
Math. Mono., V. 120. AMS, 1993, xv + 183 pp, 
$99. [ISBN 0-8218-4568-3] 


Stochastic Processes, P. Contributions to 
Stochastics. Ed: N. Venugopal. Wiley East- 
ern, 1992, xvi + 216 pp, Rs. 350. [ISBN 81- 
224-0417-0] 15 papers honoring K. Nagab- 
hushanam. 


Statistical Methods, P. Drug Safety Assess- 
ment in Clinical Trials. Ed: Gene Sogliero- 
Gilbert. Stat.: Textbooks & Mono., V. 138. Mar- 
cel Dekker, 1993, x + 437 pp, $135. [ISBN 0- 
8247-8893-1] 15 papers on interpretation and 
analysis of safety data. 


Statistical Methods, P. Nonparametric Meth- 
ods in Change-Point Problems. B.E. Brodsky, 
B.S. Darkhovsky. Math. & Its Applic., V. 243. 
Kluwer Academic, 1993, xi + 209 pp, $90. 
[ISBN 0-7923-2 122-7] 

Statistics, T(13: 1), C. Beginning Statistics: 
A to Z. William Mendenhall. Duxbury Pr, 
1993, xii + 525 pp, disk included [ISBN 0-534- 
19122-3]; Flash Cards. [ISBN 0-534-19122-3] 
Introductory text stresses concepts, statistical 
analyses, and their interpretation; downplays 
probability and hand calculations. Many exam- 
ples and exercises stem from newspapers and 


research journals. Three appendices: data sets_ 


(available on disk), statistical tables, answers 
to selected exercises. Includes flash cards on 
Statistical terminology and concepts. KB 


Computer Organization, P. The Cache Mem- 
ory Book. Jim Handy. Academic Pr, 1993, xviii 
+ 269 pp. [ISBN 0-12-322985-5] 

Computer Systems, P. MS-DOS Command 
Summary for Versions 5 & 6. SSC, 1993, 32 pp, 
(P). [ISBN 0-916151-64-6] 

Computer Systems, P. Motif Reference Man- 
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ual, Volume Six B. Paula M. Ferguson, David 
Brennan. O’Reilly & Assoc, 1993, xii +908 pp, 
$34.95 (P). [ISBN 1-56592-038-4] 


Computer Systems, P. Learning the Korn 
Shell. Bill Rosenblatt. O’ Reilly & Assoc, 1993, 
xxii + 338 pp, $27.95 (P). [ISBN 1-56592-054- 
6] 


Computer Systems, P. PEXIlib Programming 
Manual: 3D Programming in X. Tom Gask- 
ins. O’Reilly & Assoc, 1992, xlv + 1105 pp, 
$44.95 (P). [ISBN 0-56592-028-7] 


Computer Systems, P. X Toolkit Intrinsics 
Programming Manual for Version 11, Volume 
Four. Adrian Nye, Tim O’Reilly. O’Reilly 
& Assoc, 1992, xxxvi + 567 pp, $34.95 (P). 
[ISBN 1-56592-003-1] 


Computer Systems, P. Software Portability 
with imake. Paul DuBois. O’Reilly & Assoc, 
1993, xxiii + 365 pp, $27.95 (P). [ISBN 1- 
56592-055-4] 


Computer Systems, P. /%@:: A Directory 
of Electronic Mail Addressing and Networks. 
Donnalyn Frey, Rick Adams. O’Reilly & As- 
soc, xvi + 443 pp, $24.95 (P). [ISBN 1-56592- 
031-7] 


Computer Systems, P. X Window System 
User's Guide, Volume Three: OSF/Motif 1.2 
Edition. Valerie Quercia, Tim O’Reilly. 
O’Reilly & Assoc, 1993, xxxii + 924 pp, 
$34.95 (P). [ISBN 1-56592-015-5] 


Computer Systems, P. Connecting to the 
Internet: A Buyer‘s Guide. Susan Estrada. 
O’Reilly & Assoc, 1993, xv + 170 pp, 
$15.95 (P). [ISBN 1-56592-061-9] 


Computer Systems, P. Learning the UNIX 
Operating System. Grace Todino, John Strang, 
Jerry Peek. O’ Reilly & Assoc, 1993, xv +92 pp, 
$9.95 (P). [ISBN 1-56592-060-0] 


Computer Systems, P. PEXlib Reference 
Manual: 3D Programming in X. Ed: Steve Tal- 
bott. O’Reilly & Assoc, 1992, xxv + 551 pp, 
$34.95 (P). [ISBN 1-56592-029-5] 


Computer Graphics, S*(16-17), P, L. Ra- 
diosity and Realistic Image Synthesis. Michael 
F. Cohen, John R. Wallace. Academic Pr, 1993, 
Xvi + 382 pp, $49.95. [ISBN 0-12-178270-0] 
A complete presentation of tools for modern 
image synthesis. Radiosity improves the older 
technique of ray tracing. Ray tracing is viewer 
(and pixel) dependent, but radiosity is deter- 
mined by an overall energy balance equation 
and so is viewer independent. In effect, ra- 
diosity takes into account indirect illumination 
of the scene. Requires substantial mathematics 
(e.g., differential geometry, numerical analysis, 
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measure theory), but much is developed in con- 
text. 54 color plates. MPR 


Computer Graphics, S, C. Modern Image 
Processing: Warping, Morphing, and Classi- 
cal Techniques. Christopher D. Watkins, Al- 
berto Sadun, Stephen Marenka. Academic Pr, 
1993, xix + 234 pp, $49.95 (P), disk included. 
[ISBN 0-12-737860-X] Elementary introduc- 
tion to filtering, image enhancement, and geo- 
metric transformations. Rather brief on mathe- 
matics behind these techniques, but surveys the 
subject. Includes a disk of code fragments (in 
C), and a short chapter on applications. MPR 


Theory of Computation, P. Formal Specifica- 
tion and Design. L.M.G. Feijs, H.B.M. Jonkers. 
Tracts in Theoret. Comput. Sci., V. 35. Cam- 
bridge Univ Pr, 1992, xvi + 335 pp, $44.95. 
[ISBN 0-521-43457-2] 


Artificial Intelligence, P. © Computational 
Learning & Cognition: Proceedings of the 
Third NEC Research Symposium., Ed: Eric 
B. Baum. SIAM, 1993, xi + 276 pp, $42.50. 
[ISBN 0-89871-311-0] 


Computer Science, P. Computer Benchmarks. 
Eds: Jack J. Dongarra, Wolfgang Gentzsch. 
Adv. in Parallel Comput., V. 8. North-Holland 
(US Distr: Elsevier Science), 1993, xiv + 
349 pp, $125.75. [ISBN 0-444-81518-X] 21 
papers on performance prediction and mea- 
surement, compiler and database benchmarks, 
and techniques for benchmarking shared- and 
distributed-memory parallel computers. 


Computer Science, P. Virtual Reality: Appli- 
cations and Explorations. Ed: Alan Wexelblat. 
Academic Pr, 1993, xviii + 245 pp, $39.95. 
[ISBN 0-12-745045-9] 10 essays on virtual 
reality in computer science, the arts, etc. 

Computer Science, P. Lectures on Parallel 
Computation. Eds: Alan Gibbons, Paul Spi- 
rakis. Intern. Ser. on Parallel Comput., V. 4. 
Cambridge Univ Pr, 1993, 437 pp, $49.95. 
[ISBN 0-521-41556-X] 15 papers on parallel 
computation, especially efficiency issues. 


Computer Science, P. Proceedings of the 
Sixth SIAM Conference on Parallel Processing 
for Scientific Computing, Volumes I-II. Eds: 
Richard F. Sincovec, etal. SIAM, 1993, $95 (P) 
[ISBN 0-89871-3 15-3]; Volume I, xix + 504 pp; 
Volume II, xix + 543 pp. 


Computer Science, P. Internet: Mailing Lists, 
1993 Edition. Eds: Edward T.L. Hardie, Vivian 
Neou. SRI Internet Inform. Ser. Prentice Hall, 
1993, x + 356 pp, $26 (P). [ISBN 0-13-327941- 
3] 

Computer Science, P. Proof Theory. Eds: 
Peter Aczel, Stanley S. Wainer. Cambridge 
Univ Pr, 1992, x + 306 pp, $49.95. [ISBN 0- 
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521-41413-X] 3 expository, 7 research papers 
from the 1990 Summer School and Conference 
on Proof Theory at Leeds University. 


Applications (Biological Science), P. Ad- 
vances in Computer Methods for Systematic Bi- 
ology: Artificial Intelligence, Databases, Com- 
puter Vision. Ed: Renaud Fortuner. Johns Hop- 
kins Univ Pr, 1993, xiv + 560 pp, $65. [ISBN 0- 
8018-4492-4] Proceedings of the ARTISYST 
workshop, an interdisciplinary conference on 
the application of modern computer methods to 
research in systematic biology, held at Napa, 
California, in 1990. 


Applications (Fluid Dynamics), P.  Tran- 
sonic Aerodynamics: Problems in Asymptotic 
Theory. Ed: L. Pamela Cook. Frontiers in 
Appl. Math., V. 12. SIAM, 1993, x + 90 pp, 
$26.50 (P). [ISBN 0-89871-310-2] 5 papers 
originally presented at a minisymposium at the 
second ICIAM meeting held in July 1991 in 
Washington, DC. 

Applications (Fluid Dynamics), T(17-18: 1), 
P. Numerical Modeling in Combustion. Ed: 
T.J. Chung. Ser. in Comput. & Physical Pro- 
cesses in Mech. & Thermal Sci. Taylor & 
Francis, 1993, xxiv + 549 pp, $195. [ISBN 0- 
89116-822-2] Numerical methods for react- 
ing flow problems, especially combustion prob- 
lems. Sections on laminar flows, turbulent 
flows, spray combustion. AO 

Applications (Fluid Dynamics), S(16-18), 
P. Bénard Cells and Taylor Vortices. E.L. 
Koschmieder. Mono. on Mech. & Appl. Math. 
Cambridge Univ Pr, 1993, x + 337 pp, $64.95. 
[ISBN 0-521-40204-2] Researchon problems 
of Bénard convection, related areas of hydro- 
dynamics. Graphs and photographs are almost 
as common as mathematical expressions. The 
bulk of the text is a very readable, informal, 
descriptive discussion. MU 


Applications (Fluid Dynamics), P. Theory 
of Laminar Film Condensation. Tetsu Fujii. 
Springer-Verlag, 1991, xviii + 213 pp, $69. 
[ISBN 0-387-97541-1] 

Applications (Physics), P. Quantum Field 
Theory. Lowell S. Brown. Cambridge Univ 
Pr, 1992, xiv + 543 pp, $100. [ISBN 0-521- 
406066] 


Reviewers 


KB: Karla Ballman, Macalester; JNC: Judith N. Ceder- 
berg, St. Olaf; LC: Laura Chihara, St. Olaf; BC: Barry 
Cipra, St. Olaf; JD-B: John Dyer-Bennet, Carleton; SG: 
Steven Galovich, Carleton; DH: Deanna Haunsperger, 
St. Olaf; SK: Steve Kennedy, St. Olaf; AO: Arnold Os- 
tebee, St. Olaf; SP: Samuel Patterson, Carleton; MPR: 
Matthew P. Richey, St. Olaf; MU: Milton Ulmer, Carleton; 
MW: Martha Wallace, St. Olaf. 
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Essential Mathematics from Cambridge 


Numerical Mathematics 


—A Laboratory Approach 


The late S. Breuer and G. Zwas 
1993 280pp. 44040-8 Hardcover $49.95 


The Higher Arithmetic 
Sixth Edition 


H. Davenport 
1992 217pp. 41998-0 Hardcover $44.95 
42227-2 Paper $19.95 


Primes and 
Programming _ 
Computers and Number Theory 


P. J. Giblin 
1993 245pp. 40182-8 Hardcover $44.95 
40988-8 Paper $19.95 


A Course of Pure 
Mathematics 
Tenth Edition 

G. H. Hardy 


Cambridge Mathematical Library 
1993 522 pp. 09227-2 Paper $22.95 


Elementary Theory of 
L-functions and 
Eisenstein Series 


Haruzo Hida 

London Mathematical Society 

Student Texts 26 

1993 398pp. 43411-4 Hardcover $69.95 
43569-2 Paper $29.95 


Hyperbolic Geometry 


Birger Iversen 

London Mathematical Society 

Student Texts 25 

1993 312pp. 43508-0 Hardcover $54.95 
43528-5 Paper $22.95 


Available in bookstores or from 


CAMBRIDGE 


UNIVERSITY PRESS 


Representations and 
Characters of Groups 
Gordon James 


and Martin Liebeck 
1993 429pp. 44024-6 Hardcover $69.95 
44590-6 Paper $29.95 


Exercises in 
Fourier Analysis 
T. W. Korner 


1993 395pp. 43276-6 Hardcover $54.95 
43849-7- Paper $22.95 


Computational 
Geometry in C 


Joseph O’Rourke 
1994 320pp. 44034-3 Hardcover $59.95 
44592-2 Paper $24.95 


Hilbert Space 
Compact Operators and 
the Trace Theorem 
J. R. Retherford 


London Mathematical Society 

Student Texts 27 

1993 143pp. 41884-4 Hardcover $44.95 
42933-1 Paper $19.95 


Elementary Probability 


David Stirzaker 
1994 500pp. 42028-8 Hardcover $64.95 
42183-7 Paper $24.95 


How To Prove It 
A Structured Approach 


Daniel J. Velleman 
1994 304pp. 44116-1 Hardcover $49.95 
44663-5 Paper $19.95 


40 West 20th St., N.Y., NY 10011-4211 
Call toll-free 800-872-7423 
MasterCard/VISA accepted. 

Prices subject to change. 


Proofs Without Words 
Exercises in Visual Thinking 


Roger B. Nelsen 


Just what are “proofs without words?” First of all, 
most mathematicians would agree that they cer- 
tainly are not “proofs” in the formal sense. Indeed, 
the question does not have asimple answer. Proofs 
without words are generally pictures or diagrams 
that help the reader see why a particular math- 
ematical statement may be true, and howone could 
begin to go about proving it. While in some proofs 
without words an equation or two may appear to 
help guide that process, the emphasis is clearly on 
providing visual clues to stimulate mathematical 
thought. Proofs without words bear witness to the 
observation that often in the English language to 
see means to understand, as in “to see the point ofan 
argument.” 


Proofs without words have a long history. In this 
collection you will find modern renditions of proofs 
from ancient China, classical Greece, twelfth-cen- 
tury India—even one based on a published proof 
by a former President of the United States! How- 
ever, most of the proofs are more recent creations, 
and many are taken from the pages of MAA jour- 
nals. 


The proofs in this collection are arranged by topic 
into six chapters: Geometry and Algebra; Trigo- 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


Name 
Address 
City 


State Zip Code 
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nometry, Calculusand Analytic Geometry; Inequali- 
ties; Integer Sums; Sequences and Series; and 
Miscellaneous. Teachers will find that many of the 
proofs in this collection are well suited for class- 
room discussion and for helping students to think 
visually in mathematics. 


The readers of this collection will find enjoyment 
in discovering or rediscovering some elegant visual 
demonstrations of certain mathematical ideas that 
teachers will want to share with their students. 
Readers may even be encouraged to create new 
“proofs without words.” 


160 pp., Paperbound, 1993 

ISBN 0-88385-700-6 

List: $27.50 MAA Member: $22.00 
Catalog Number PWW 


Total $ 
Payment O Check O VISA O MASTERCARD 
Credit Card No. 
Signature 


Exp. Date 


Was This The 


Last Time You Bought 
Insurance? 


Face it—it’s change, so can your 
been a long time. A coverage. Insurance 
lot has changed since through your asso- 
then. Your family. ciation is designed 
Maybe your job. And to grow with you— 


more than likely, the it even moves with you 


amount and types of when you change jobs. 

coverage you need from your insurance 

program. That’s why you need insurance We're Affordable. 

that can easily adapt to the way your life What good would all these benefits be 

changes—MAA Group Insurance Program. if no one could afford them? That’s why we 
offer members the additional benefit of rea- 

We Understand You. sonable rates, negotiated using our group 
Finding an insurance program that’s purchasing power. Call 1 800 424-9883 (in 


right for you isn’t easy. But as a member of Washington, D.C., (202) 457-6820) between 
MAA, you don’t have to go through the diffi- .8:30 a.m. and 5:30 p.m. Eastern Time for 
cult and time consuming task of looking for more information about these insurance 
the right plans—we’ve done that work for plans offered through MAA: 


you. What's more, you can be sure the pro- Term Life ¢ Disability Income Protection « 
gram is constantly being evaluated to better Comprehensive HealthCare e Excess Major 
meet the needs of our members. Medical In Hospital ¢ High-Limit Accident 
We're Flexible. MAA Insurance 
Updating your insurance doesn't have Designed for the way you live today. 
to be a hassle. With our plans, as your needs And tomorrow. 


This plan is administered by Seabury & Smith. 


Essays in Humanistic 
Mathematics 


Alvin White, Editor 


A dazzling array of essayists reveal humanistic 
mathematics in this volume, and in so doing go 
beyond the facts, formulas, and algorithms that most 
students associate with mathematics to a presenta- 
tion of mathematics as an intellectual discipline with 
a human perspective and a significant history. Hu- 
manistic mathematics challenges dogmatic teach- 
ing styles that expect students to parrot the lecturer. 
It demands creativity from both the teacher and 
student. 

Teaching mathematics humanistically seeks to 
place the student more centrally in the position of 
inquirer than is generally the case, while at the same 
time acknowledging the emotional climate of the 
activity of learning mathematics. This type of 
teaching encourages students to learn from each 
other and to better understand mathematics as so- 
cially constructed knowledge, rather than as an 
arbitrary discipline. 

Teaching humanistic mathematics brings the fo- 
cus less upon the nature of the teaching and learning 
environment and more upon the need to reconstruct 
the curriculum and the discipline of mathematics 
itself. This reconstruction relates mathematical 
discoveries to personal courage, discovery to verifi- 
cation, mathematics to science, truth to utility, and 
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The Mathematical Association of America 
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Washington, DC 20036 
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mathematics to the culture in which it is embedded. 

The humanistic mathematics movement, which 
began as the personal vision of a few, has now 
become a major part of mathematical culture. What 
was viewed with skepticism is now accepted and 
expected. Humanistic mathematics is not a new 
discovery. It is arecent rediscovery of ideas that go 
back to Plato. It has provided a vocabulary for 
previously unarticulated concepts and approaches. 

The essays in this volume illustrate and help to 
define humanistic mathematics. The variety and 
scope indicate the richness and fruitfulness of the 
concept. Although each essay is independent, a 
sense of unity emerges. A glimpse at the table of 
contents will give you an idea of the excitement and 
range of the ideas presented. 


212 pp., Paperbound, 1993 
ISBN 0-88385-089-3 

List: $24.00 

Catalog Number NTE-32 


Total $ 
Payment UO) Check O VISA UO MASTERCARD 
Credit Card No. 
Signature 
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Graphing Calculator Workshops and Short Courses for College Faculty 
Organized by Frank Demana and Bert Waits of The Ohio State University 


We are seeking host sites for graphing 
calculator workshops or short courses 
to be given in the spring, summer, and 
fall of 1994 and 1995. We have a 
network of dynamic college instructors 
available to present one, one and one 
half, ortwo-day workshops or three-day 
to one-week “short courses” on using 
Texas Instruments graphing calcula- 
tors to enhance the teaching and learn- 
ing of college mathematics. Topics 
may be selected from developmental 
mathematics through calculus and dif- 
ferential equations. Participants will 
receive extensive “hands-on” instruc- 
tion from experienced colleagues and 
will receive many useful self study ma- 
terials. We have five summer 1994 
short courses scheduled. 


c colle9? 


1cS 
pater 


FO Ted cher? 


A SOURCE BOOK FOR 
COLLEGE MATHEMATICS 
TEACHING 


Alan Schoenfeld, Editor. 
Prepared by the Committee on the 
Undergraduate Teaching of Mathematics 


Do you want a broader, deeper, more suc- 
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COMMENTS 


“T have been invited to discuss further the subject which you had under consideration last 
year, namely, the problem of turning Doctors of Philosophy into efficient teachers. There are 
some of you, I see by the printed reports of your earlier meeting, who regard the ability to 
teach as a natural gift. There are apparently many of you who are persuaded that courses in 
pedagogy cannot contribute materially to the improvement of a graduate of a university. In 
the face of such settled views, backed up by the success that many of you have obtained in 
the teaching profession, it seems to be a bold and from some point of view a useless 
undertaking to come before you as I must with the assertion that ability to teach is not a 
natural gift and that every Doctor of Philosophy would be improved by a careful considera- 
tion in a scientific and historical way of the problems of education... . 

“Until very recently university and college organizations have been based on the opinion 
expressed by some of you that teaching cannot be made a subject of special study and 
instruction. Gradually, however, a change is being worked out before our eyes. In spite of 
opposition and indifference, courses in education are being organized even in the most 
conservative institutions... . 

“The first fact which I wish to point out is that there is a great deal of poor teaching 
within our universities and colleges, and this fact can be traced to a neglect upon the part of 
academic men and women of the form in which they arrange their material... . 

“It is widely recognized that such a neglect of form as I have indicated appears in much 
of our university lecturing. The assumption of the ordinary university lecturer is that if he 
presents a certain body of material so organized that it seems to him to be fairly coherent 
and logical in its character, it is a matter of small moment whether it appeals to the students 
because of its literacr forma or whether it is easily intelligible to them because of its careful 
adjustment to their present stage of development... . 

“The elective system has in part overcome this attitude and there is a very much more 
general conviction on the part of college instructors now than there was a generation ago 
that the demands of the student that the material shall be presented in clear and coherent 
form should be met. An instructor who is in competition with the other members of his own 
department for students in his course is likely to recognize the importance of preparing his 
material as clearly as possible for presentation to his class... . 

‘“‘As matters now stand, whatever the ideal held by the university world in regard to the 
high calling of the doctor in the field of pure research, the fact is that the great majority of 
the doctors now turned out in this country are nominated and pushed by their respective 
departments, for teaching positions, which they must fill successfully or be counted as 
failures by all who measure the ratio of results accomplished to tasks undertaken... . 
Success in teaching is well-nigh indispensable.”’ 


Professor Charles Hubbard Judd 


Director, University of Chicago Education School 
as reported in The November Montu ty, 16(1909) 
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Pizza Slicing, Phi’s, and the Riemann 
Hypothesis 


Edward A. Bender, Oren Patashnik, and Howard Rumsey, Jr. 


1. INTRODUCTION. Place a unit-square pizza so that its lower-left corner is at 
the origin and its upper-right corner is at the point (1, 1). Run a pizza knife along 
all lines having an integer y-intercept and a positive-integer slope that doesn’t 
exceed N. How many slices S(V) are formed, and how many sides might a slice 
have? 


(1,1) G1) Gp) GD 
(4, 0) (5,0) (,0) 


Figure 1. The 2, 4, and 8 slices formed when N = 1, 2, and 3. 


FIGURE 1 shows that when N = 1 the only relevant line is y =x, and two 
triangular slices are formed. When N = 2 there are two additional lines, y = 2x 
and y = 2x — 1, and there are now four triangular slices (henceforth called simply 
“triangles’”’). When N = 3 there are eight slices—six triangles and two quadrilater- 
als. FIGURE 2 shows the 22 triangles and 14 quadrilaterals for N = 6. 


Figure 2. The 36 slices (22 triangles, 14 quadrilaterals) when N = 6. 
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David desJardins, who posed the general problem, also asked in particular 
whether the slices are always triangles or quadrilaterals. This paper answers “yes” 
to that question, giving formulas for the number of each that involve a double sum 
of Euler’s g-function, and it shows that asymptotically there are N?/7 slices in 
all (half triangles, half quadrilaterals), with an error term that involves the 
Riemann hypothesis. Along the way, the paper gives a fairly complete history of 
the somewhat controversial asymptotics of L, -;, < ,¢(k). 


2. ESTIMATING SLICES. First some terminology. From now on a slice refers to 
a region of the unit square, rather than the corresponding part of the pizza. A 
segment is the part of one of our lines lying in the unit square; we’ll talk about a 
segment as if it were of the form y = Ax — B, even though technically it is the 
corresponding line that’s of that form. And — is the set of segments having 
nonzero length. So SCN) is just the number of slices into which the set “% 
partitions the unit square. 

To get a crude upper bound on SCN) we start with an easy question: How many 
segments are in .“? One of slope 1, two of slope 2, and so on, hence L}/_, k = 
N(N + 1)/2 = O(N”) segments total. Now it’s well known that / straight lines 
partition the plane into at most 1 + /(/ + 1)/2 = O(/’) regions. (There are exactly 
that many regions, it turns out, if the lines are in general position; that is, if no two 
lines coincide or are parallel, and if no three lines intersect at a point.) To deduce 
the O(/7) bound, notice that when we add a kth (distinct) line to a collection of 
k — 1 lines in the plane, the number of additional regions formed is one more than 
the number of distinct intersection points the new line makes with the old lines; 
since there are at most k — 1 such points, there are at most k additional regions, 
hence after we’ve added / lines there are at most 1 + L,_, k regions, as claimed. 
(Jacob Steiner [14] proved this formula in the early 19th century; Graham et al. 
[4, Chapter 1] discuss it, along with some generalizations.) Therefore, since the 
number of /-line regions in the plane is an upper bound for the number of 
l-segment slices in the unit square, the O(N”) segments of .“ yield O(N’%) slices. 

But O(N%) is not a very good bound, because the segments of .“ aren’t even 
close to being in general position—some segments (all k of slope k) are parallel, 
and many can have a mutual intersection point (there are | N/2| segments passing 
through the point (5, 5) for example). 

We can improve the bound on S(N) by a factor of N, using one more 
observation: A segment of slope k intersects at most one of the segments of slope 
j, for each j < k, because the projection onto the x axis of the segment of slope k 
is properly contained in the projection of a segment of slope j that it intersects, 
and because the projections of the j segments of slope j are pairwise disjoint. So 
when we start with the empty square and add the segment of slope 1, then the 
segments of slope 2, then slope 3, and so on, a segment of slope k intersects at 
most k — 1 other segments and thus increases the number of slices by at most k. 
Hence collectively the segments of slope k add at most k? slices, giving at most 
cN_, k* = N?/3 + O(N”) slices total. (As noted in the introduction, S(N)’s 
actual leading term has 7” in place of 3 in the denominator.) 


3. COUNTING SLICES. We now derive an exact expression for SCN). Let o(k) 
be Euler’s totient function—the number of integers in the interval [0..k) that are 
prime to k—and let ® be the sum of ¢: 


O(x)= L o(k). (1) 


1<k<x 
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The exact expression for SCN) involves the sum of ®: 


S(N) =1+ )0 ®(k), forN>0. (2) 
1<k<N 

To derive this expression, we start as before with the empty square, which 
comprises a single slice, and we add the segments in order of increasing slope. 
We'll see that the k segments of slope k collectively increase the slice count by 
®(k), from which Equation (2) follows. 

The lines-in-the-plane argument above shows that when we add a single 
segment to the unit square, the slice count increases by one more than the number 
of points of intersection, excluding points on the boundary, that the segment 
makes with previous segments. Equivalently, if we define our square to be [0. .1)? 
and define the segments of .“ accordingly, and if we consider the segment of 
slope 0 along the x axis to be in .~ (and to be present in the square before others 
are added), then the slice count increases by exactly the number of points of 
intersection with previous segments. Henceforth we adopt this more precise 
definition of .“ and its segments. 

We'll make use of the following fact, about the existence in ~ of a certain 
segment that has smaller slope than a given segment. 


Useful Fact A. If a segment y = Ax — B in /& passes through a point p whose 
x coordinate is a rational number q/r, where r < A, then the segment y = (A — r)x 
— (B — q) is in S& and passes through p. 


Proof: Equating y’s and solving for x shows that the two lines in question meet at 
a point whose x coordinate is q/r, and therefore at p itself: That y = (A — r)x — 
(B —q) is a segment in ~ follows, because 0 < A —r <A (and because the 
segment passes through the square). | 


There’s an analogous fact for the existence in .“ of a segment of larger slope. 


Useful Fact B. If a segment y = Ax — B in & passes through a point p whose 
x coordinate is a rational number q/r, where r < N — A, then the segment y = 
(A + r)x — (B + q) is in S and passes through p. 


Proof: Again, the two lines in question meet at p. And the segmenty = (A + r)x — 
(B + q)isin “~, because0< A+r<QN. a 


So look at what happens to the slice count when we add the A segments of 
slope A, each of the form y = Ax — B, for 0 < B < A. We need to show that 
these new segments collectively form (A) intersection points with the old 
segments, which have smaller slope, and we do this by examining possible 
x coordinates. Let q/r be the x coordinate, in reduced form, of the intersection of 
a new segment, say y = Ax — B, with an old segment, say y = ax — b. (There may 
be more than one such old segment.) Since 

q B-b 

r  A-a’ 
we have 1 <r <A. Furthermore, corresponding to this x coordinate g/r there is 
among the segments of slope A a unique y coordinate (it is Ag/r mod 1) and a 
unique B (it is | Aq/r]), and therefore just one such intersection point. Conversely, 
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for any reduced fraction q/r in the interval [0..1), with r <A, there exists an old 
segment whose intersection point with the uniquely determined new segment has 
x coordinate q/r, by Useful Fact A. So there’s a one-to-one correspondence 
between intersection points and reduced fractions q/r in [0..1) having r <A, of 
which there are £“_,¢(r) = ®( A), completing the proof of Equation (2). We defer 
the asymptotics until Section 7. Incidentally, that set of reduced fractions, together 
with 1/1, constitutes what is called the Farey series of order A; Hardy and Wright 


[5] devote a full chapter to Farey series. 


4. SLICE SHAPES. This section proves several properties of the slices, among 
them, that only triangles and quadrilaterals occur, and that a quadrilateral’s four 
vertices have at most three distinct x coordinates. We start with the first of the 
two. 

Let L be the lower-left vertex of a slice (all slopes are nonnegative, hence every 
slice has a vertex that’s both lowest and leftmost) and let R be its upper-right 
vertex, as shown in FiGurRE 3. Proceed counterclockwise from L to R around the 


R 
y =A,x — B, 
Ly 
y=A,x— B, 
y = Agx — Bo 


Figure 3. A general slice. 


lower boundary of the slice, letting the intermediate vertices encountered be 
L,,...,L,, and letting the segments encountered (between consecutive vertices) 
be y = A,x — B,, for i = 0,...,k. Notice that the maximum slope of a segment 
through L,; is A,, and the minimum slope is A;_,. We’ll see that the sequence of 
vertices encountered is either L, R or L, L,, R, so that there are at most two sides 
on the lower boundary. Moreover, since a rotation by 180° is a symmetry of the 
sliced-up square, the upper boundary, too, must have at most two sides, hence the 
whole slice has at most four. 

First we handle the slice whose lower boundary includes a side along the line 
x = 1, which isn’t of the form y = Ax — B. But that slice is easy—it has exactly 
two sides on its lower boundary (when N = 0 the slice is the entire square, 
otherwise it’s a triangle). 

For any other slice, focus on an intermediate vertex L;. Its x coordinate is 
q;/';, not necessarily in reduced form, where q, = B, — B;_, and r; = A; — A;_}. 
Now suppose that the smaller slope is pretty big: A;_, => N/2. Then r, < N - 


N/2 = N/2, hence r; < A;_,. But by Useful Fact A there’s a segment of slope 
A;_, — 7, through L,, contradicting the minimality of A,;_,. Thus A,;_, < N/2, 
giving an upper bound on the smaller slope. Similarly, suppose that the larger 
slope is pretty small: A; < N/2. Then r; < N/2, hence r; < N — A,, and by Useful 


Fact B there’s a segment of slope A; + r; through L,, contradicting the maximality 
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of A;. Thus A; > N/2, giving a lower bound on the larger slope. Therefore L, 
can’t exist, otherwise we’d have both A, < N/2 and A, > N/2. So the lower 
boundary has at most two sides and the whole slice has at most four, as claimed. 

That proof also points out a few other properties of every quadrilateral: Each of 
the four vertices meets one side of small slope—less than N/2—and one side of 
large slope—greater than N/2; each small-slope side is clockwise from an inter- 
mediate vertex; each large-slope side is counterclockwise; the two sides at an 
intermediate vertex differ in slope by more than N/3 (otherwise Useful Fact A or 
B would produce a segment passing through the quadrilateral), and the larger 
slope is more than twice the smaller; and there is no side in the quadrilateral of 
slope exactly N//2. 

Finally, here’s a proof that the vertices of a quadrilateral have at most three 
distinct x coordinates. Define an aberrant slice to be one having a vertical side. 
For the aberrant quadrilateral, which occurs for N = 0, there are just two distinct 
x coordinates. 

For a non-aberrant quadrilateral, as in FiGuRE 4, let the upper-boundary slopes 
be dy > a,, let the lower-boundary slopes be A, <A,, let the upper-boundary 
intermediate vertex have x coordinate g/r, where r=a,-—a,, and let the 
lower-boundary vertices, counterclockwise, have x coordinates xX ),x,, and Xx. 
Rotating by 180° if necessary, we may assume that a) — a, < A, — Ay. Now if 
Xx, <q/r <<X,, aS in Figure 4, then by Useful Fact A there is a segment of slope 
A, —r, where A, < A, —r<A,, passing through a point p on the side having 
slope A,; but that would endow our quadrilateral with a new side. Similarly, if 
X99 <q/r <x, then by Useful Fact B there is a segment of slope A, + r, where 
Ay <Ay +r< Aj, passing through the side having slope Ag, again giving a 
contradiction. Thus q/r = x,, and the quadrilateral has just three x coordinates, 
as claimed. 


xX axis 


Xo x, q/r X> 


Figure 4. A non-aberrant quadrilateral. 


5. COUNTING TRIANGLES. This section counts the number of triangles T(N), 
showing that 


T(N) =24+2 )) ®(k/2), forN>1. (3) 
1<k<N 
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(Notice that by definition (1), the value ®(k /2) is perfectly well defined, even for 
odd k.) 


How can the addition of a segment of largest slope divide a slice? 


e If the slice is a triangle, the segment either passes through two sides, 
producing a triangle and a quadrilateral (Figure 5A), or through a vertex and 
a side, producing two triangles (FiGURE 5B). 

e If the slice is the aberrant quadrilateral (the entire square), the segment 
produces two triangles (FIGURE 5C). 


A LALA ¢ 


Figure 5. How a largest-slope segment (dotted) can divide a slice. 


e If the slice is a non-aberrant quadrilateral, the three-x-coordinate property 
severely restricts the segment: It must pass through one of the vertices with 
repeated x coordinate, as well as one of the sides, producing a triangle and a 
quadrilateral (FiGuRE 5D). Passing through two vertices is impossible from its 
slope, and passing through two sides would produce either a pentagon or a 
quadrilateral with four distinct x coordinates. 


So the triangle-count increase for the segments of slope N equals the number of 
times one of these new segments passes through a vertex of an old slice that it 
splits. How many such vertex/slice occurrences are there? Consider the set of 
previous intersection points (between two or more old segments) through which 
the segments of slope N pass. Notice that each such point in the interior of the 
Square accounts for two vertex/slice occurrences—one each for the old slices 
below and to the left, and above and to the right. The same accounting, however, 
applies to the previous intersection points on the square’s boundary (they are, 
using our convention that [0..1)* is the unit square, all on the line y = 0): 
Corresponding to a previous intersection point (B/N, 0), where 1 < B < N, is an 
old slice above and to the right, which is split by the segment y = Nx — B, as well 
as an old slice below and to the left of the point (B/N, 1)—a point that’s not 
counted, according to our convention—split by the segment y = Nx — (B — 1). 
And corresponding to the previous intersection point (0,0), for N > 2, are two 
aberrant old slices—one with a side on the y-axis that’s above and to the right of 
(0,0), and one with a side on the line x = 1 that’s below and to the left of (1, 1), 
another point not counted, according to our definition. Thus for N > 2 the 
triangle-count increase is twice P(N), where P(N) is the number of previous 
intersection points through which the segments of slope N pass. And for N = 1 
there are no previous intersection points, so P(1) = 0, but the triangle-count 
increase is two (the two aberrant triangles formed by the line y = x). Thus the 
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total number of triangles satisfies 


T(N)=2+2 ¥P(k), forN>=1, (4) 
1<k<N 


and we’re left with determining PCN). 

When is a point on a new segment a previous intersection point? Suppose that a 
new segment y = Nx — B and two old segments y = a,x — b,, for i = 1,2, havea 
mutual intersection point. Its x coordinate is rational, say g/r in reduced form. 
Now c,;r = N — a,, for some positive integer c,, since q/r is just (B — b,)/(N — a;) 
in reduced form. Furthermore c, # c,, since a, # a,. So max(c,,c,) = 2, hence 
r<N/2. Conversely, if segment y = Nx — B passes through a point p whose 
x coordinate is a rational qg/r, with r < N/2, then both N —r and N — 2r are 
nonnegative. So by Useful Fact A the segments y = (N —r)x — (B — q) and 
y = (N —- 2r)x — (B — 2q) are also in “ and pass through p, hence p is a 
previous intersection point. Thus we have the answer: A point on a new segment is 
a previous intersection point if and only if its x coordinate is a rational number 
that, in reduced form, is qg/r with r < N/2. Since there are ®(N/2) such points, 
P(N) = ®(N/2), and equation (4) implies formula (3) for the number of triangles 
T(N). That in turn determines the number of quadrilaterals QCN), which is 
SCN) — TIN): 


Q(N)=-1+ Yo (®(k) -— 2®(k/2)), for N > 1. (5) 
1<k<N 


6. THE SURFACE OF CHOICE—A TORUS. Previous discussion was somewhat 
messy, with all the aberrations and boundaries and boundary points, suggesting 
that things would be cleaner without the boundaries. We eliminate them by using a 
torus, as outlined below. 

Curl the square into a cylinder, so that the square’s top edge, along the line 
y = 1, and its bottom edge, along the line y = 0, coincide, and so that the square’s 
left and right edges, along the lines x = 0 and x = 1, form circles at the ends of 
the cylinder. Now wrap the cylinder around, joining ends, so that those two circles 
coincide. The resulting surface is our torus. 

The two aberrant triangles of our square, for N > 1, have now become a single 
quadrilateral; but that’s the only difference in slice counts and “shapes” between 
the two surfaces. Thus for N > 1 the additive constants in the expressions for the 
number of triangles (3), quadrilaterals (5), and total slices (2) disappear. 

Moreover, looking at the problem on the torus reveals an interesting phe- 
nomenon. Suppose that, before wrapping the cylinder, we first give one end Np 
full twists (277N, radians). Now when we join ends it’s as if we had used lines with 
slopes between N, and N,. +N instead of between 0 and N—we get the same 
slice counts in both cases. (The analogue in the plane is to use a parallelogram of 
unit area whose left edge is still the interval [0..1) along the y axis but whose right 
edge is now the interval [Np..N,) + 1) along the line x = 1.) 

Mathematically the torus is a more natural surface for our problem than the 
square; it is a more difficult surface conceptually, however, which is why our work 
has been in the square. 


7. ASYMPTOTICS. Finally we take a peek at the asymptotics of the quantities 
involved. A good estimate for SCN) follows easily from one for ®CN), whose long 
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but somewhat confused and controversial history is summarized in the next four 
paragraphs. 

In 1849 Dirichlet [2] determined the leading term, showing that ®(N) = 
3N*/m? + O(N‘**), where, in terms of the inverse of Riemann’s zeta function, 
c is £~1(2) = 1.73, and where ¢ is any positive constant. (The literature sometimes 
misstates Dirichlet’s error term.) In 1874 Franz Mertens [10] improved the error 
term, defined by 


Ey(N) = O(N) — 3N*/r*, (6) 


from Dirichlet’s O(N'’*:::) to OCN log N); the proof is pretty easy and appears 
in Hardy and Wright [5, page 268] and Graham et al. [4, Section 9.3, Problem 6]. 

Those two 19th-century estimates gave upper bounds on the error. S. S. Pillai 
and S. D. Chowla [12] in 1930 gave a super-linear lower bound, showing that 
E,(N) is not o(N logloglog N); that is, there exists a positive constant c such 
that |E,(N)| > cN logloglog N for infinitely many values of N. Along similar 
lines, Paul Erdés and Harold N. Shapiro [3] in 1951 showed that there exists a 
positive constant c such that both E,(N) > cN loglogloglog N for infinitely 
many values of N and E,(N) < —cN loglogloglog N for infinitely many values 
of N. (The error E was once conjectured to be positive for all N.) 

So the question, in essence, became how many logs to throw onto the error. In 
1953. Arnold Walfisz [19] lowered the upper bound, showing that E,(N) = 
O( N(log N)?/*(log log N)*). He improved that bound further, reducing the pair of 
exponents from (4,2) to (2, 3) in a 1958 paper [20], and then to (4, =) in a 1962 
book [21, Chapter 4]. But—and this is the controversy—the book neglected to 
mention that in 1958 N.-M. Korobov [7] improved on Walfisz’s 1958 result, 
obtaining the exponents (2, y), for some positive constant y; or that in 1960 
A. I. Saltykov [13] obtained (4, 1 + €), showing that 


2 


® _ 3N 2/3 l+e 

(N) = —> + O(N(log N)‘’*(loglog N)'**), all constants ¢ > 0. (7) 
We may never know why Walfisz, who died a month after he finished his book, 
omitted the Korobov and Saltykov results; perhaps he was unaware of them. But 
there was, at the time, a full-fledged dispute on a related topic. It was between, on 
the one hand, Korobov [8] and I. M. Vinogradov [17], who claimed without proof 
that a certain region of the ¢ function was zero free, and on the other hand, 
Walfisz, who said in the notes to Chapter 5 of his book that neither he nor 
Hans-Egon Richert could verify the Korobov/Vinogradov claim. And nobody else 
has been able to verify it. Despite requests like A. E. Ingham’s [6] for Korobov and 
Vinogradov to either prove or retract their claim, they apparently have not done 
so. Korobov’s recent book [9] doesn’t address the issue, and Vinogradov’s book [18] 
repeats without proof the unsubstantiated claim. Perhaps this related dispute led 
Walfisz to omit mention of the Korobov and/or Saltykov bounds on E,(N); 
neither bound, however, used Korobov’s unproven claim (even though Saltykov, 
who seems to have been a student of Korobov, did use proved results from 
Korobov’s paper [8]). Hence notwithstanding the prevalence in the literature of 
Walfisz’s 1962 result, Saltykov’s upper bound (7) on Eg is undisputed and is the 
best to date. 

The best lower bound is due to Hugh L. Montgomery [11], who in 1987 replaced 
Erdés and Shapiro’s loglogloglog N factors with (loglog N)!'”*. Thus there re- 
mains a gap between these two best bounds, which in essence are N(loglog N)'/2 
and Nilog N)?/7(loglog N)'**. And that’s a brief history of ®. 
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For our purposes here, any of the upper bounds mentioned above will do. 
Summing Mertens’s version, P(N) = 3N?7/7r? + O(N log N), gives 


S(N)=1+ > &(k) 
1<k<N 


3 
=> yk? + O(N? log N) 
1<k<N 
3 


= =z + O(N® log N). (8) 


So the number of slices is asymptotic to N°/7’, as claimed. Furthermore Equa- 
tions (3) and (8) show that the number of triangles is 


T(N) =2+2 )) ®(k/2) 
1<k<N 


4 Yi @(k) + O(®(N/2) ) 


1<k<N/2 


N?3> 

72 + O(N? log N). 

Thus asymptotically the number of triangles, and therefore the number of quadri- 
laterals, is half the number of slices. That is, 


N? 

S(N)~ =z, T(N) ~-35(N), — Q(N) ~ 35(N). (9) 
Incidentally, for small positive N there are more triangles T(.NV) than quadrilater- 
als Q(N), and we might conjecture that that’s true for all positive N. But it’s not; 
Q(N) first exceeds T(N) at N = 33. 

What’s curious about the number of slices, asymptotically, is the accuracy of the 
approximation N*/m?. We won’t do it here, but it’s not hard to remove the log 
factor from estimate (8) to give 

N? 
S(N) = 72 + O(N’). (10) 


Things get really interesting, however, if we restate our problem so that the 
slopes are strictly less than N, rather than less than or equal to N. While we’re at 
it we'll eliminate the ‘1 + ’ of Equation (2) by using the torus, so that in this new 
version the number of regions RCN), which is SCN — 1) — 1, becomes 

R(N) = (zk), for N > 2, (11) 
1<k<N 
when expressed in terms of ®. Since this equals S(N) — ®(N) — 1, and since 
®(N) = O(N’), estimate (10) implies RCN) = N?/7? + O(N”). In fact the Prime 
Number Theorem affords a slightly better error term: 


N?3> 
R(N) = =z + 0(N?), (12) 


That is, the error E,(N) defined by 
N? 
ER(N) = RCN) — —35 (13) 
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is o(N7). The sum R(N) itself seems not to have appeared in the literature, but its 
error E,(N) has appeared, in the study of the average size of ®’s error Eg, in the 
form “})_,E4(k). An easy manipulation, summing equation (6) and using equa- 
tions (11) and (13), shows that 


3N? N 
EXN)= Li Eg(k) -— 73 + 53: (14) 

1<k<N 7 7 
The bound E,(N) = o(N7’) then follows from Pillai and Chowla’s result [12] that 
“)_, Eg(k) is asymptotic to 3N7/(277). A slightly improved bound, E,(N) = 
O(N2/exp(AUn N)?/°(In In N)~!/°)) for some positive constant A, follows simi- 
larly from D. Suryanarayana and R. Sitaramachandra Rao [16], whose estimate 
comes from Richert and Walfisz’s estimate [21, page 191], due primarily to 
Richert, of a sum involving the Mobius function. And if the Korobov / Vinogradov 
claim were verified, the improved bound on E, (along with an analogous bound on 
the error term in the Prime Number Theorem) would be sharpened by the 
elimination of the (In In N)~!/° factor. Even the sharper bound, though, would be 

quite close to N’. 

Numerical evidence, on the other hand, suggests that E,(N) is much smaller 
than N’. For example when N = 100 the number of regions RCN) is 101315, and 
N?/r* = 101321, so E,(N) is about —6; this is somewhat surprising, since N? 
here is 10,000, and the last term added to the sum, (99), is 3004. The evidence 
suggests that E,(N), which is sometimes positive and sometimes negative, is 
bounded by something closer to N?/?. In fact Suryanarayana [15] shows that, 
assuming the Riemann hypothesis, L)_, Eg(k) = 3N7/(277) + O(N?/**) for all 
positive constants «, and Paolo Codeca [1] proves that this bound holds if and only 
if the Riemann hypothesis is true. Equation (14), along with even Mertens’s 
O(N log N) bound on E,(N), therefore implies that E,(N) is O(N?/2**)—which 
is to say that 


3 
R(N) = — + O(N?/”***), for all constants « > 0 
TT 


—if and only if the Riemann hypothesis is true. So, despite the fairly large gap 
between essentially N* and essentially N°/*, further improvements in the error 
bound await progress on the Riemann hypothesis. 
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Rational Periodic Points of the 
Quadratic Function Q(x) =x? +c 


Ralph Walde and Paula Russo 


1. INTRODUCTION. The family of quadratic maps, Q.(x) =x?+ c, plays a 
fundamental role in the understanding of one-dimensional dynamics over both the 
real and complex numbers. One reason is that virtually every sort of periodic and 
chaotic behavior is exhibited by at least one member of this 1-parameter family of 
maps. Also the fixed points and period two points of Q.. can be explicitly computed 
in terms of the parameter c, making these functions a good source of examples for 
introductory texts on the subject. Beyond period two, however, explicit calculations 
become more difficult, and so numerical methods are used to find approximate 
values of attracting periodic points of Q.. For students being introduced to the 
material for the first time, examples with only approximate values for periodic 
points are less satisfying than those which display the precise values. 

In this paper we examine some of the periodic orbits of Q.. that can be precisely 
computed, those which consist entirely of rational numbers. We obtain results that 
describe explicitly which of the Q, have both rational fixed points and rational 
period two points as well as those which have rational period three orbits. These 
results are particularly appealing as they can be readily used to generate numerous 
explicit examples. We also show that if c is a rational number with an odd 
denominator then Q. has no rational orbits of period greater than 2. 

Our theorems have elementary proofs using results from a variety of areas of 
algebra. Specifically, in addition to the algebraic manipulation of polynomials, we 
make use of a classical result in number theory on Pythagorean triples and use 
some elementary results involving p-adic numbers. 


2. BACKGROUND FROM DYNAMICS. We present in this section some neces- 
sary notation and terminology from dynamical systems. For a more complete 
introduction to dynamical systems see [3]. 

For a given function f(x) and an integer n > 0, we denote by f(x) the 
composition of f with itself m times. A point A is said to be a fixed point of f if 
f(A) = A and it is said to be a period n point of f if n is the smallest positive 
integer for which f”(A) = A. In this case the set of points 


{A, f(A), FFA), FPA) FAD} 


is said to be a period n orbit of f(x). When all of the f‘(A) are real we refer to the 
orbit as a real orbit. Similarly, if all of the f‘(A) are rational, we refer to the orbit 
as a rational orbit. We say that A is a periodic point of f if it is a period n point 
for some positive integer n. Note that authors often use the term prime period n 
instead of period n for the above property and simply call A a period n point of f 
if f"(A) = A whether n is the smallest such integer or not. 
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Let A be a period n point of f(x). We say that A is attracting if 


I(f"y (A) < 1. (1) 


In this case there is a neighborhood U about A such that if x € U and x # A then 
If"(x) — A| < |x — A|. Analogously, A is said to be repelling if 


If") (ADI > 1. (2) 


We shall restrict our attention to the quadratic family of maps defined by 
Q(x) =x? +c, where c is a complex parameter. For some of our results we will 
restrict the parameter c to be either a real or a rational number. Note that all 
rational numbers are assumed to be in lowest terms unless otherwise stated. 
Occassionally we will consider a function with a fixed value of c and will then 
simply use the notation Q(x) to refer to this function. Since our main interest is in 
the periodic points of Q., it is helpful to rewrite the definitions of attracting and 
repelling for this family of functions. Using Q/(x) = 2x, equations (1) and (2) 
above become, 


[2"AgA,.--A,-y1 <1, and [2”AjA,...A,_,| > 1. (3) 
where A, = A and A, = Q*(A) for k = 1,2,...,n — 1. 


3. RATIONAL FIXED POINTS AND RATIONAL PERIOD TWO POINTS OF @Q,. 
We begin with a description of the complex, real and rational points of periods one 
and two for Q(x) =x* +c. The characterization of these points is well known 
and follows directly by solving the formulas defining the periodic points. However, 
the usual approach is to fix a value of the parameter c, and then find a formula for 
the fixed and period two points. Our-point of view is to begin with a particular 
number A and determine those values of c for which Q,. has A ‘as a fixed point or 
period two point. This will allows us to parametrize those quadratic functions that 
have real or rational points of period one or two. 

If A is a fixed point of Q, then Q.(A) = A? + c = A implies that c = A — A” and 
so 1 —A is also a fixed point of Q,. Clearly, these fixed points are rational 
numbers if and only if A itself is a rational number. If uw is a period two point of 
Q., then 


(Q.(Q.(#)) — #) 
(Q.(H) — #) 
implies c = —p? — w — 1 and so Q.(u) = —1 — p is the other period two point. 


These period two points will both be rational if and only if u is‘a rational number. 
We summarize these results in the following theorem. 


=w+ut+co+1=0 


Theorem 1. 1. For each complex number i, the quadratic map, Q., has fixed points, 
d and 1 — A, if and only if c = 4 — 2’. These will be distinct if 1 # +. The map from 
A to the function Q,_,2 provides a one-to-one correspondence between the real 
numbers ’ = 5 and those functions Q, that have real fixed points. Furthermore, 
these fixed points are rational if and.only if X ts rational. 

2. For each complex number » # — 5, the map Q, has the period two orbit 
{u, —1 — p} if and only if c = —1— p — p’. The map from wp to the function 
Q_,_,,-,2 provides a one-to-one correspondence between the real numbers w > — > 
and those Q. that have a period two orbit consisting of two distinct real numbers. 
Furthermore, these period two points are rational if and only if w is rational. 
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Proof: We need only mention that the restrictions 4 > 5 and uw > — } are 
needed so that the correspondences between A and yu. and the values for the c’s 
are single valued. The value u = — = is excluded since in this case uw = —1— pw 


and hence is fixed and so there is no c for which — 3 is a point of period 2. O 


If one is interested in explicit examples of rational periodic points for quadratic: 
maps then it is very natural to ask the question: “Which quadratic maps have both 
rational fixed points and rational period two points?” The following theorem 
completely characterizes such functions. 


Theorem 2. The function Q. has both two rational fixed points and a rational period 
two orbit if and only if it is possible to write c = — + — (X/Y)* where X and Y are 
members of a Pythagorean triple {X,Y, Z}. That is, X and Y are positive integers and 
there exists a positive integer Z with X? + Y* = Z*. In this case the fixed points are 
5 +(Z/Y) and the period two points are — 5 + (X/Y). 


Proof: lf Q. has rational fixed and period two points then the formulas in 


Theorem 1 imply that c = —A? +A = —p? — w — 1 for rational numbers A and 
yu. Completing the squares, we obtain 

~(,—-1/412-(,4+1Y-3 4 

( 3) 4 ~~ (u 3) 4 ( ) 


or equivalently, 
(u + 1)’ +1=(A- 1)” 


These equations imply that the rational numbers » + 3 and A- > have the same 
denominator so we can write 


1 xX dA 1 Z 

+—-=— an ->=s 

BT D7 y ¢ 2° ¥Y 
for integers X, Y and Z. Substituting these values into the previous equation we 
get (X/Y)? + 1=(Z/Y)’, or equivalently, X? + Y? = Z* which is the usual 
formula for Pythagorean triples. Substituting for ~ and A into our formulas for c 
we find that c = — 3 — (X/Y)* and Q, possesses fixed points A = 5 + (Z/Y) 
and period two points u = — 5 + (X/Y) as desired. Notice that we can assume 

that X, Y and Z are all positive. O 


The following corollary gives a somewhat technical characterization of those Q. 
possessing both rational fixed and period two points. It can be easily used to 
generate explicit examples. 


Corollary 1. A complete listing of all examples of Q, with both rational fixed points 
and rational period two points is obtained from Theorem 2 and integers m > n > 0 
with m and n relatively prime and either m or n even by setting either 


X=m’?—-n’*,Y=2mn, and Z=m’*+n’ (5) 
or 


X=2mn,Y=m’—-n’*, and Z=m?*+n’. (6) 
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Proof: From formula 4 in the proof of Theorem 2 we have 


*) 3 Z\* 1 
c=-(s] --~=-Ils] +7. 
Fe 4 Fa 4 


This shows that is suffices to obtain solutions in which X, Y and Z are positive 
integers that are pairwise relatively prime, that is, those which are referred to in 
number theory as primitive Pythagorean triples. The classical solution to the 
problem of finding such triples is given by equation (5) above when Y is even and 
equation (6) is obtained when X even and Y odd. (See [6] page 394, for example). 

LJ 


Table 1 lists some examples of functions obtained from this corollary with both 
rational fixed points and rational period two points. 


TABLE 1. Some examples of Q, with both rational fixed and period two points using small 
values of m and n in the formulas (5) and (6) in Corollary 1. 
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4. RATIONAL PERIOD THREE ORBITS OF @Q,. We now wish to consider the 
existence of rational period three orbits of Q.. If we were to try to generalize the 
computations that we used to examine period two points we would note that if ¢ is 
a point of period three of Q. then 


0.(0.(O(£))) =£ and Qf) #6. 
Thus ¢ satisfies the equation 
(Q-(2-(2.(¢))) =) _ , 
O(¢) -—¢ 
which has the quotient 
£64 £54 (3c +1)f4 + (2c + 1)L3 + (3c? + 3c +: 1)E2 + (c +1) 
+(c? + 2c? +c+1)=0. 
Unfortunately this sixth degree equation will not be very useful for finding rational 
period three orbits. It is a cubic equation in c which prevents us from writing c 
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directly in terms of ¢ as we did in Theorem 1. Instead, we give a parametrization 
of both those c with a rational periodic orbit and the period 3 orbit itself in terms 
of a complex parameter 7. Although the parametrization is a bit unwieldy, it allows 
us to generate explicit examples of rational period 3 orbits as desired. 


Theorem 3. For any complex number 7 with r # 0, 7 # —1, and 77 +74+1#0, 


let 

To + 27° + 474 + 87° 4+ 9774+ 4741 

a a (7) 
47°(7 + 1) 

: rP+2rr7+r741 g 
1 27(7 + 1) (8) 
pP—-fr-l’ 9 
2 Oe(7 + 1) (9) 
r+2774+374+1 10 
*3 °° 27(7 + 1) (10) 


It then follows that 


1. Q. has the period three orbit {x,, x, x3}. That is, x, = Q(x), x3 = Q(x), 
x, = Q(x3), and the three complex numbers x,, x,, and x3, are distinct. 
Furthermore, every period three orbit of the function Q.. can be described in this 
manner. 

2. The formulas for c, x,, X,, and x, give a one-to-one correspondence between 
the real numbers .r > 0 and the real period 3 orbits of the functions Q.. 

3. The three points in the orbit are three distinct rational numbers if and only if + 
is itself a rational number. 


Proof: Given a complex number, 7, subject to the restrictions in the theorem, one 
can verify that O.(x,) =x,, O(x,) =x, and Q.(x3) =x, by straight-forward 
computations using the above formulas for c and the x,’s. Note that the condition, 
7? + 7+ 1 #0 is needed to insure that the x,’s are distinct. Now, assume ¢ is a 
complex number with O.(0.(Q(¢))) = ¢ and assume that w = Q.(¢) = £7 +c is 
not equal to g. We must derive the formulas for c, x,, x,, and x, for some 
complex 7. We have c = w — £7, so that we can write 


O(x) =x? +o —- &’. 
We can now compute 
£=O(Q.(w)) = O.(w* + w — £?) 
= + 20? + w* — 20°l? — 2w0l?2? +f +0 —- CL’. 
Transposing the ¢ and factoring we find 
(w —£)(w? + wf - of? —£ + 207 + 206 +o4+64+1) =0. 
Dividing by w — £ we have an expression equivalent to 


(w +f) + (2-2£)(@ + £)° + (1 —- 2£)(o + 2) +1=0. 
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Letting 7 = w + ¢, and then solving for £ we find 
pt2r74+74+1 
27(7 + 1) 


which is the formula for x,. Now one can derive the other formulas using the 
equations. 


Xo =W=T-C=T-xX, 


C= — f*=x,-x?, 
x3 = Q,( x2). 
To prove statement 2 in the theorem we observe that if 7 is real then the 
formulas guarantee that c, x,, x, and x, are real. Conversely, if c, x,, x, and x, 


are all real then t = x, + x, must also be real. To justify making the assumption 
that 7 > 0, use the formulas for the x,’s to verify that 


T+1 1 
X, +X. =7,X%,+%3,= - _? and %3 +x) = — 


Using these formulas, it is easy to check that exactly one of these three numbers is 
positive. In fact, x, + x, is positive if and only if 7 > 0, -1 <7 < 0 guarantees 
that x, +x, is positive, and lastly x, + x, is positive precisely when 7 < —1. 
Thus, by a suitable cyclic renumbering of the x,’s we can assume that x, + x, > 0. 
Note that two distinct positive real numbers 7, and 7, must correspond to distinct 
period three orbits because of the relationship 7 = x, + x, even though 7, and 7, 
might correspond to the same value of c. 

Lastly, to prove statement 3 of the theorem, if 7 is rational then the formulas 
guarantee that c, and the x,’s are rational. Conversely, if c, and the x,’s are all 
rational, then 7,= x, + x, must also be rational. O 


The formulas in Theorem 3 for those 7 with small numerators and denomina- 
tors give us examples of rational period 3 orbits with small denominators. Some 
examples of these are listed in Table 2. 


TABLE 2. The period 3 orbit {x,, x., x3} of the func- 
tions Q,, using values of + with small numerators and 
denominators in the formulas of Theorem 3. 
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It is of interest to look more carefully at the relationship between c and the 
values of the points of periods one, two and three. A graph of this relationship can 
be found in Ficure 1. It is apparent from this graph just when points of different 
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— 2.0 
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=—" —— 2.0 
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C =—-2.0, —1.75 — 1.25 — 6.75 0.25 


Figure 1. The relationship between c and the values of the points of period 1, 2 and 3. 


periods come into existence. Note that this graph displays all periodic points of 
these orders, both attracting and repelling. This is in contrast to the bifurcation 
diagram for the family, which shows only the attracting periodic orbits. A bifurca- 
tion diagram drawn to the same scale as the graph in FiGureE 1 can be found in 
FIGURE 2. 
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C =—2.0 —1.75 — 1.25 | —0.75 0.25 


Figure 2. The bifurcation diagram for the family Q.. 


Example 1. In Table 2, the last entry is of particular interest. As |8x,x,x,;| < 1, 
this period 3 orbit is attracting. Our computations indicate that this is the 
attracting rational period 3 orbit having the smallest possible denominator. 


Corollary 2. The function Q,. has two period three orbits each with three distinct real 


numbers if and only if c < — +. Ifc = - 1, then Q. has a single period three orbit 
consisting of the three distinct irrational roots of x7 + $x” — 3x — ;= 0. 
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Proof: If one differentiates formula (7) in Theorem 3 and then factors the result, 
one finds that 


dc (re +77 -27- 1)\(7*7 +7 + 1)° 
dt 27°(7 +1) 


so that c possesses a single extremum for 7 > 0, a maximum, which occurs at the 
positive root of 7° +77 — 27 -—1=0. To find the maximum value of c, one 
makes repeated use of the substitution r? = —7r? + 27 + 1 in the formula for c 
and discovers that c = — +. For any value of c < — 4, there are two different 
positive values of 7 yielding the c and thus two different period three orbits of real 
numbers. If c = — 7 then any complex period three point must be a solution of 


the equation 


_ PAP(QM))~* (is, 4. 9,_ 1) 


° O(x) -—x 2 4 8 


and hence there is a single orbit of period 3. UO 


Corollary 3. The sets of c for which Q. has a rational orbit of period 1, 2 and 3 are 
dense in the sets c < 7,c < — =, andc < — 4, respectively. 


Proof: This follows directly from the continuity of the functions that describe the 
parameter c in Theorems 1 and3. O 


One should note at this point that Theorems 1 and 3 as well as the above 
corollary have an algebraic geometry flavor. That is, the parameterization of a 
curve provides a description of the rational points of the curve and shows that they 
are dense in the curve. See [5] for information. 


5. BACKGROUND FROM P-ADIC NUMBERS. Up to this point we have focused 
on giving some partial answers to the question of which Q. have rational periodic 
orbits. We now turn to the consideration of which rational values of c give rise to 
Q. which do not have rational orbits of any period as well as those that have very 
few. For these results a different algebraic approach is necessary. The main tools 
used in the proofs are results about the p-adic numbers. We give here some of the 
necessary background. A full treatment of p-adic numbers can be found in the text 
[1] or briefer comments concerning them can be found in the recent article [2]. 
We begin with some of the basic definitions. For any prime number p, the 
p-adic valuation or norm, |x|,, on the set of rational numbers, can be defined by 


m 


-k wsp_ok 
, fx =p*— 
n 


\O|, =0, and |x|,=p 


where k is the unique integer such that the non-zero integers m and n are not 
divisible by p. 

This definition can be extended to a larger set of numbers called the p-adic 
numbers. For any prime number p, the set of p-adic numbers consists of all formal 
series of the form 

p*(a9 + a,p' + a,p? +a3p° +...) 


with k any integer and each a, an integer with 0 <a, <p-— 1. The p-adic 
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valuation |x|, is then defined on this set of numbers by 


—k 


\O|, =0, and |xl,=p, if x = p*(a) + a,p'+a,p* +...) 


where k is chosen so that a, is positive. Addition, subtraction, multiplication, and 
division can be defined on the p-adic numbers by using the usual power series 
rules and then adjusting the coefficients to satisfy 0 <a, <p — 1. With these 


definitions, the p-adic numbers form a field which is complete with respect to the 
metric defined by 


d( x,y) = Ix — yp. 


For any prime p, the set of p-adic numbers contains all positive and negative 
rational numbers expressed as eventually repeating series. For example, in the set 
of 2-adic numbers, 


—-1=14+2'4+274+2°+244... 
because if we add 1 to this series we get the series for 0. Similarly, 
—$=14+2'4+ 244294284274... 


because multiplying this series by 5 = 1 + 27 we get the series for —1. Those 
p-adic series that are not eventually repeating are not rational numbers and, in 
general, may not correspond to either real or complex numbers. The p-adic 
valuation on the p-adic numbers has several useful properties which we state in 
the theorem below. 


Theorem 4. Jf x and y are p-adic numbers then 


1. |x|, = 0 with |x|, = 0 if and only if x = 0. 
2. Ixy lp = Ix|ply|p- 
3. |x + yl, < max(|x|,, ly|,), with equality holding if |x|, # ly|p. 


Some of our results depend on the following special result which can be 
deduced directly from Theorem 4.4 on page 60 of [1]. 


Theorem 5. A non-zero integer b has a square root in the 2-adic numbers if and only 
if b = 27"a, where a is congruent to 1 modulo 8. (a = 1mod 8) In this case b has 
two distinct 2-adic square roots. 


Substituting the p-adic valuation for the real or complex absolute value in 
equation (3) we obtain a definition of attracting and repelling periodic points for 
the quadratic function thought of as a map from the p-adic numbers to themselves. 
Thus, for any p-adic number c, a p-adic period n point A of Q.. is attracting if 

|2"AQAAD~- Analy < 1 
and it is repelling if 
2"AGA AL. -- Analy > 1 


where the A, are defined as in Section 2. 


6. PERIODIC POINTS OF Q, OVER THE p-ADIC NUMBERS. Although we are 
mainly interested in the dynamic behavior of Q. as a function over the rational 
numbers, it will be helpful to establish some results about the periodic points of 
the quadratic functions over the p-adic numbers. Our first theorem gives a useful 
relationship between the parameter c and the periodic points of Q.. 
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Theorem 6. Jf A is a periodic point of Q. over the field of p-adic numbers with 
|A|, > 1 then 


Als =Icl, and |1Q,(A)lp = |Alp- 


Proof: Assume A is a periodic point of Q. with |A|, > 1. We will consider three 
cases. First, let us assume that |A |? > > |Clp. Then, 


IQ.(A)Ip = 1A? + clp = max(Al5, Iclp) = AIS > IAlp- 
Applying the same argument to Q,.(A) we obtain the increasing sequence, 


Alp < 1O-(A)lp < 10.(Q-(A) lp < 1O.(Q-(Q-(A)))lp < 


This contradicts the fact that A is periodic. 
Second, we assume that JAILS < |c|,. Then 


lOAA)|p = lA? + clp = max(|A|p, Iclp) = |cl, > Als > IAlp. 
Now uw = Q.(A) has the property that 


2 2 
le lp = Iclp > Ielp 
so w = Q.(A) can be treated as in the first case. Thus, once again 


Alp < 1O(AVIp < 1O.( QA) lp < 10.(Q.(Q.(A)))lp < 


contradicting the fact that A is periodic. 

For the last case we assume that |A|> = |c|,. Let uw = Q,(A) which is also a 
periodic point of Q.. By using the first two cases above: we can deduce that 
lu lS = |cl, so that |A|5 = |w|s. Since we know that |A|, # 0 and |u|, # 0 we can 
conclude that |A|, = |wl, = |O.(A)|, as required. O 


It is clear that if'A is a rational number, then it is a periodic point of Q. viewed 
as a function over the p-adic numbers if and only if it is periodic over the real 
numbers. Thus, we can use Theorem 6 to obtain information about the rational 
periodic points of Q.. 


Corollary 4. If j/k is a rational periodic point of Q. with c = m/n and both j/k 
and m/n are rational numbers expressed in lowest terms with k and n positive integers 
then n = k?. 


Proof: For any prime p, Theorem 6 implies that if |j/k|, > 1 then 


2 
m 


Pp 


k 


Thus if p* with a > 0 is the largest power of p that divides k then p7 is the 
largest power of p that divides n and so p divides k if and only if it divides n. 
Since the largest power of p dividing n is the square of the largest power dividing 
k for any prime p, we can conclude thatn =k?*. O 


n 


P 


Corollary 5. If c is rational, then Q. has at most finitely many rational periodic 
points. 


Proof: By the previous corollary we can assume that c = (m/k?”) and each 
rational periodic point has the form j/k. Thus all of the rational periodic points 
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have the same denominator. In addition, for any x with |x| > |c| + 1, the triangle 
inequality implies that 


1O.(x)| = lxl? — lel > Ixl? — lel + 1 = (lel — 1)° + Ix] = [el. 


Thus x cannot be periodic. This implies that the set of all rational periodic points 
of Q.. is a subset of the finite set 


m| 


| 
sll +1= ty +1}, LJ 


The following theorem gives a sufficient condition for Q. to have no 2-adic 
periodic points other than ftxed points. 


Theorem 7. If c is a rational number with an even numerator, then Q. has two 
2-adic fixed points and no other 2-adic periodic points. In fact, one of these fixed 
points attracts all 2-adic numbers with norms less than 1 and the other fixed point 
attracts all those with norms equal to 1. (The p-adic numbers of norm greater than 1 
will be shown to have unbounded orbits.) 


Proof: The function Q.(x) = x? + (2m/n) has a fixed point whenever the equa- 
tion x* + 2m/n =x has a 2-adic solution. Transposing x to the left side of this 
equation and completing the square gives us the equation 


| a 1 2m  ~*7n*-—8mn 
x- =| =—- — =——. 


Thus Q, has fixed points if and only if n* — 8mn is a square in the 2-adic field. By 
Theorem 5 it suffices to show that n* — 8mn = 1 mod 8. This is equivalent to 
showing that n* = 1 mod§8 as it is clear that 8mn = 0 mod 8. Since n is odd we 
have that 


n? = (2k + 1)° = 4k? + 4k 4+1=4k(kK +1) 41 


and since either k or k + 1 must be even, n* = 1 mod 8. Thus, Q, has a pair of 
2-adic fixed points. 

Let a denote one of these 2-adic fixed points. As we saw in Section 3, 1 — a is 
also a fixed point of Q. and a(1 — a) = c = (2m/n). Now, 


2m 


lal2l1 — alz = Ja(1 — @)|2 = |cl2 = 


2 3 
where the last inequality follows from the definition of the p-adic norm. Part 3 of 
Theorem 4 implies that, 


lalg<1 and |1—-—alz = max(1, lalz) = 1 
or 
ll —al, <1 and lal, = |1 - (1 — @)|2 = max(1,|1 — al2) = 1. 


Since a could denote either fixed point we may assume that |alz <1 and 
11 — al2 = 1. We will show that no 2-adic numbers other than a and 1 — a can be 
periodic points of Q.. 

Assume a is a 2-adic number with a # a. To show that a cannot be a periodic 
point of Q. we consider three cases, |a|2 < 1, |lal2 = 1, and |al2 > 1. We begin 
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with |a|2 > 1. In this case we have, 
la — alz < max(|al2, |al2) < 1. 


We may write |a — al2 = 2~* for some positive integer k and so a — a = 2*u for 
some 2-adic number u with |u|, = 1. Now, 


|O.(a) — alo = |O,(a + 2*u) — aly 
= la? + 2** au + 27*u? + (a — a7) - al2 
= |2** (qu + 2-1y2)|, < 2-&FD 
< la — ala, 
which is less than 1. We also have 
lO.(a)lo = la? +a(1- a) < max(|a7|2, la(1 — a@)l2) < 1 
which allows the computation to be repeated to obtain the decreasing sequence 
la —al2 > |O,(a) — al2 > 1O.(Q,(a)) — al2 > |O(Q(OQ(4))) -— al2> °° 


This shows that a cannot be a periodic point of Q. because it is attracted to a as 
each term in the above sequence must be a different power of 3. 

In a similar fashion, one can show that any a with |a|, = 1 cannot be a periodic 
point of Q, and is in fact attracted to 1 — a. 

Finally, we assume that a is a 2-adic number |a|2 > 1. We find that 


|O.(4)l2 = 


5 2m 5 2m 5 
a“ + ——| = max{la*|.,|—|,] = lal2 > lale. 
nm |2 . nN \2 


Continuing the process, we obtain 


lala < |Q.(a)l2 < 1QO(Q(a))l2 < °° 


which shows that a can not be a periodic point in this case either. Thus, the two 
fixed points are the only 2-adic numbers that can be periodic points. O 


The following corollary restates this theorem for the real quadratic family. 


Corollary 6. If Q(x) = x* + 2m/n where m and n are integers and n is odd then 
Q.. can have no rational periodic points other than fixed points. 


Proof: As previously noted, we can consider Q, as a function over the 2-adic 
numbers and so by Theorem 7, Q. has no periodic points other than fixed points. 
But the 2-adic numbers contain all rational numbers so that Q. can have no 
rational periodic points other than, possibly, fixed points. O 


Example 2. The quadratic Q_,(x) = x* — 2 is frequently used as an example of a 
chaotic function over the real numbers. (See Section 4.4 of [3], for example). For 
any n > 1,Q_.,(x) has 2” real periodic points of a period dividing n and all of 
them are repelling. Thus Q_,(x)-has an infinite number of real periodic points 
and these periodic points are dense in the interval [— 2,2]. Corollary 8 is one way 
to show that none of these periodic points are rational except the fixed points 2 
and —1. 


We now give a sufficient condition for Q.. to have only 2-adic period two points. 
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Theorem 8. If c = (m/n) with m and n both odd integers then Q., considered as a 
function over the 2-adic numbers has one period 2 orbit and has no other 2-adic 
periodic points. 


Proof: Let m and n be odd integers and assume that Q(x) = x* + (m/n) has a 
point of period 2. Then the equation 


Q(Q.(*)) — x 
Q(x) — * 


must have a solution in the field of 2-adic numbers. Completing the square we 
obtain 


2 2 m 
=x°+x+ec+1=x*°+x+—4+1=0 
n 


1 \? 3 om —3n? — 4mn 
x+—- 


2 4 on 4n? 
implying that Q. has period 2 points if and only if (—3n* — 4mn) is a square in 
the 2-adic field. By Theorem 5, we need to show that —3n* — 4mn = 1 mod 8. But 
m = 2s + 1 and n = 2r +1 are both odd and so 
—n(3n + 4m) = —(2r + 1)(6r + 8s + 7) = —(2r + 1)(6r + 7) 
—r(12r + 12+ 8) —-7 = -12r(r+1)-7=-7 
=] 

since r(r + 1) is even. Thus, Q. has a 2-adic period 2 orbit. 

Let B denote one of the 2-adic period 2 points of Q.. As was shown in section 


3, y = —1-B is also a period 2 point of Q, and c = —1 — B — B* = (m/n). 
Note that 


—3n* — 4mn 


m+n 
<1 


2 


Blalyl2 = |B(-1 - B)la = le + 12 - 


where the last inequality follows from the fact that m + n is even. As in the proof 
of Theorem 7 we may assume that |B|2 < 1 and |yl2 = 1. We will show that no 
2-adic numbers other than B and y can be periodic points of Q.. 

We begin with a:2-adic number, a, with |a|, < 1 and a # B, so that |a — Bl2 < 
max(|a|2, |B|2) < 1. Proceeding as in Theorem 7, one can show that 


lQ.(a) — ylo < la — Blo <1. 


We may interpret the above inequality as saying that if |a|l2 <1 then Q,(a) is 
closer to y than a is to B. In addition, | y|z = 1 and |Q.(a) — yl2 < 1 implies that 
|O.(a)|2 = 1. To continue the process, we need a similar computation for any 
2-adic a’ having the properties that |a’|, = 1 and a’ # y. As above, one can show 
that O(a’) is closer to B than a’ is to y. Combining these results, we can conclude 
that if lal. < 1 then 


la — Bl2 > 10,(Q.(4)) — Blo > 1O(Q(Q-(Q-(4)))) -— Bla > °° 
A similar inequality also holds in the case |a|2 = 1. This shows that if a is a 2-adic 
number with |a|, < 1 and a # B, a # y then a cannot be a periodic point of Q.. 
The equations can also be interpreted ‘as saying that the set of all 2-adic numbers a 
with |a|2 < 1 is attracted to the period 2 orbit consisting of B and —1 — B. 
Finally assume that a is a 2-adic number |a|. > 1 then we find that 


|O.(a)l2 = 


2 
= |a|2 > lalo. 


m 
aa + — 
WM \2 


330 RATIONAL PERIODIC POINTS OF THE QUADRATIC FUNCTION [April 


Now 


lalz < lQ.(a)lo < lO.(Q,(@) )l2 <u... 


shows that a cannot be a periodic point in this case either. Thus the two period 2 
points are the only 2-adic numbers that can be periodic points of Q,. O 


Corollary 7. If Q(x) =x? + m/n where m and n are odd integers then Q. can 
have no rational periodic points of any prime period other than 2. 


Proof: The same observations as those in the proof of the corollary to Theorem 7 
hold. O 


There are many natural questions that arise from our results. While we cannot 
guarantee that these are open questions, we list below some questions whose 
answers are unknown to us. 


1. Are there any rational periodic orbits of a quadratic Q. of period greater 
than 3? The results for periods 1, 2, and 3 would lead one to suspect that 
there must be. 

2. Over the complex numbers, the quadratic functions Q. have six period 3 
points (except for c = — +). Theorem 3 shows that for certain rational 
numbers c, three of these period 3 points are rational numbers. Are there 
any cases when all six of the period 3 points are rational? 

3. Are there any cases when Q. has a rational period three orbit and either 
rational fixed points or rational period 2 points? 


Note: After our paper was accepted for publication, we discovered three papers 
that contain some of our results. Corollary 5 follows from a more general result in 
[7], Theorem 5 appears in [8], and Theorem 3 and a negative answer to question 2 
above are given in [9]. 
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Frechet vs. Caratheodory 


Ernesto Acosta G. and Cesar Delgado G. 


1. INTRODUCTION. When we read Kuhn’s paper, The Derivative a la 
Carathéodory [Kuh], we were very impressed how Carathéodory’s formulation of 
derivative simplifies the proofs of the basic differentiability theorems for real-val- 
ued functions of one variable, in particular the proof of the Chain Rule. We saw 
that the strength of Carathéodory’s formulation relies on the concept of continuity 
and that the proofs strongly use the properties of continuous functions. Another 
advantage about this formulation is that it does not require the difference 
quotient, which is the key to generalize it to functions of several variables. 

Let us recall the usual definition of differentiability. Let f be a real valued 
function defined on R and a be a real number. We Say that f is differentiable at a 


if the limit 
_ f(x) - f(a) 
lim ——\———— 


xa xX —-€4@ 


exists. We can say this in another way. Let ¢@ be defined by 


f(x) — f(¢) 
o(x) = ————. 
Then, f is differentiable at a if and only if @ has a removable discontinuity at a. 
This is a motivation to state the following characterization of differentiability: 


f is differentiable at a if there exists a function 
which is continuous at a and such that 


f(x) — f(a) = (x) (4 — 4). 


The latter is Carathéodory’s characterization of differentiability [Car]. We will 
extend it to functions f: R” — R” and explore some of the advantages it has over 
Fréchet’s characterization [Fre]. The reader could see without difficulty that all we 
do in §2, §3, and §4 can be redone replacing R” and R” by two arbitrary Hilbert 
spaces, and that Carathéodory’s definition makes perfect sense in general linear 
topological spaces [Aco, Del]. 


(1) 


2. THE TWO FORMULATIONS. First we extend definition (1) to functions 
f: R” — R”™ and second we show it is equivalent to the corresponding one in the 
Fréchet sense. 

If we read (1) thinking of f as a function from R” to R” we have to think of 
x —a and f(x) — f(a) as points (or vectors) in R” and R” respectively. The 
question is: what is (x)? When we multiply x —a <R” by (x) we get 
f(x) — f(a) € R”. Thus we can think of d(x) as am X n matrix which gives us 
a good interpretation of (1). So, we consider @ as a function on R” taking values in 
the space M,,..,, of real matrices. 
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the space M,,.,, of real matrices. 
Now we can give the following definition: 


Let f : R* ~ R” and a e€&R”. We say that f is 
differentiable at a if there exists a function 


gd: R” > M,,,.,, which is continuous at a and satis- 
fies (2) 


f(x) — f(a) = $(x)(x - a). 
We call such a function ¢ a slope function for f at a. 
With respect to the second, let us recall Fréchet’s definition [Spi, page 14]: 


f is differentiable at a if there exists a linear 
transformation A: R” — R” such that 


(3) 


, Il f(~) — f(a) — A(x - || 
im ——_————— = 


0). 
xa l| x — al| 


If f satisfies (2) we will say that f is Carathéodory differentiable at a and if 
satisfies (3) that f is Fréchet differentiable at a. However we have the following 
theorem which states the equivalency between (2) and (3). 


Theorem 1. Every Fréchet differentiable function is Carathéodory differentiable and 
vice versa. 


Proof: We can see without difficulty that (2) implies (3). In fact, if we assume the 
existence of @, we have 
| f(x) — f(a) — d(a)(x - a)|| _ II(¢(~) — 6(4))(x — a)| 
lx — all |x — all 
Due to the continuity of ¢@ at a we get (3). 
Let us show now that (3) implies (2). Assume that A exists and define ¢! by 


<||6(x) — o(4)]. 


1 
(x) = {( f(x) — f(a) —A(x-—a)) @(x-a)} +A, x#a 


lx — all? 

r, x=aA. 
We can see immediately from the definition of @ that d(x)(x — a) = f(x) — f(a). 
We have to prove the continuity of ¢@ at a. But 


If(4) — f(a) — A(x — a) | 
lx — all 


Id(x) — o(a)| < 


and since f satisfies (3) we get (2). 


3. A UNICITY THEOREM. We know that if A exists in (3), it is unique [Spi, 
Theorem 2-1]. A is called the derivative of f at a and is denoted by Df(a). 
Unfortunately this uniqueness is not true for ¢ in (2), as we can see in the 
following example. Let f: R* > R be defined by f(x, y) = xy, and pick a point 


' ® represents tensor product. Le., if u€ R” and v,w € R" then u ®ve Minxn 


is defined by 
(u ®@v)w=(u-w)u 


where vu : w is the inner product of v and w. 
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(a,b) € R?. Then 
f(x,y) — f(a, b) = (6, x)(x — a, y — b) 
=(y,a)(x-—a,y—b). 
Therefore $(x,y) = (b, x) and w(x, y) = (y, a) are two different slope functions 


for f at (a, b). However observe that (a, b) = (a, b) and we can ask if this is 
true in general. The answer is yes! 


Theorem 2. If ¢ and Ws are two slope functions for f at a then (a) = W(a). 
Proof: Assume that @ and wW are two slope functions for f at a and let 
(x) = d(x) — W(x). Then 
n(x)(x — a) =0, 
and so 
I n(a)(x — a)|| =|(n(@) — n(x))(x — a) | 


<||n(a) — n(x)|[Ilx — all. 
Therefore we have that 


n(a)( = —— | <iIn(a) ~ n(2)Il 


Since 7 is continuous at a, we conclude that (a) = 0 and then that ¢(a) = (a). 
We can talk then about the derivative of f at a following Carathéodory which is 
precisely d(a). Evidently we have ¢(a) = Df(a). 


eal 


4. BASIC DIFFERENTIATION THEOREMS IN R”. The three basic differentia- 
tion theorems in R” are essentially the one of linearity, the one of the chain rule 
and the one of critical points. Here definition (2) starts showing its virtues. 


Theorem 3. If f, g: R” — R” are differentiable at a € IR" then af + Bg, a,B ER, 
is also differentiable at a and 


D(af + Bg)(4) = aDf(a) + BDg(a). 


Proof: Let f, g: R” — R” be differentiable at a € R” and a, B € R. Then 
(af + Bg)(x) — (af + Bg)(a) =a( f(x) — f(a)) + B(s(*) — 8(4)) 
= (ad + By)(x)(x — a), 
where @ and w are slope functions for f and g respectively. Since @ and wW are 


continuous at a, ad + Bw is also continuous at a. Therefore af + Bg is differen- 
tiable at a and 


D(af + Bg)(a) = ad(a) + BY(a) 
= aDf(a) + BDg(a). 
More impressive is the proof of the chain rule. 
Theorem 4. /f f is differentiable at a and g is differentiable at f(a) then g © f is 
differentiable at a and 


D(g° f)(4) = Dg(f(4)) Df(4). 
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Proof: Let f: R” > R” be differentiable at a < R” and let g: R” — R” be 
differentiable at f(a) € R”. Then 


8(f(~)) — 8(f(2)) = ¥(F(4)) (F(X) — £(9) 
= W( f(x))b(x)(% — a), 


where ¢ is a slope function for f at a and y& is a slope function for g at f(a). 
Since @ and f are continuous’ at a and # is continuous at f(a) we have that 
g o f is differentiable at a. Even more 


D(g°f)(a) = v(f(4))b(4) = Dg( f(a)) Df(a). 


To finish this paragraph let us prove the critical point theorem. 


Theorem 5. Let f be a real-valued function defined on an open set U c R”. If f is 
differentiable at x) € U and f (xq) is a extreme value then Df(x)) = 0. 


Proof: Suppose that x, € U is such that f(x)) < f(x) for all x € U and that f is 
differentiable at x). Then 


O<f(x) —f(%o) = 6(4)(% — Xo) (4) 


for all x € U, where ¢ is a slope function for f at x). Let h be a fixed point in R”, 
and ¢ small enough so that x, + th € U for all t € (—e, €). By (4) we have that 


th(X) + thhh = 0 
for all t € (—e, €). Hence 
o(x,9+ thhh=0, t>0, 
and 
(xX) + th)h <0, ¢ <0, 


Since ¢ is continuous at x, we get d(x,)h = 0. But h was chosen arbitrarily, then 
d(x.) = 0, ie., Df(x5) = 0. 

Observe that formally there is a little difference between the proofs of theorems 
3, 4 and 5 and those in [Kuh]. Compare them also with the proofs of the same in 
[Spil. 

To finish this paragraph we invite the reader to compute with this new 
formulation the derivative of a constant function, the derivative of a linear function 
and to prove the Leibniz rule. 


5. THE INVERSE FUNCTION THEOREM. We do not pretend our proof of the 
inverse function theorem to be easy, but we believe ours is more direct than most 
of the proofs of the same we have seen (See for example [Spi], [Apo]), and that the 
reader can follow it without getting lost in too many details. 

We start by extending the concept of differentiability at a point to differentiabil- 
ity in an open set. Let U be an open set in R” and f: U — R”. We say that f is 
differentiable in U if for each y € U there is a slope function oy, for f at y. In 
such a case we write (x, y) = ¢,(x). Observe that (x, x) = Df(x). In the 
following we denote Df(x) by ,. 


Continuity of f follows immediately from (2). 
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Let us define now continuous differentiability: 


f is continuously differentiable in U if there exists a 
continuous function #: U xX U> M,,.,, such that 


(5) 
f(x) — f(y) = v(x, y)(x — y). 
Observe that our definition is not the same as the conventional one: 
f is continuously differentiable in U if f is 
differentiable in U and Df is continuous in U. (6) 


One sees clearly that (5) implies (6). Let us show that (6) implies (5). Suppose that 
f satisfies (6) and define ¥: UX U> M,,,., by 


1 
w(x, y) _ IIx — yll° {(f(*) — f(y) —A,(x —y)) ® (x —y)} +A, x#y 


dy x=y. 


It is clear that w(x, y(x — y) = f(x) — f(y) and that dlyxy_a and wl, are 
continuous, where A is the diagonal of U X U. What it is left to prove is that w& is 
continuous on A. If a € U, 


| (4) — f(y) — A(x - y)| 


v(x, y) -— ¥(a,a)|| < 2IlA, - Al + lx — yl 


< 2I|A, — Agll + llAg — Agll 


where é is in the segment [ x, y]. The last inequality is obtained by the mean value 
theorem. We get then, from the continuity of A, the continuity of &. Therefore f 
satisfies (5). 

With definition (5) we can now prove the inverse function theorem. 


Theorem 6. Let f: U — R” be defined in an open set U Cc R". If f is continuously 
differentiable in U and det %(a,a) #0 for some point a € U, then there exist 
neighborhoods’ V of a and W of f(a) such that f~': W > V exists, is continuously 
differentiable and 


(Df-')( f(a)) = [Df(a)]~’. 


Proof: We give the proof in three steps : (A) f is injective in some neighborhood 
of a, (B) f is onto some neighborhood of f(a), and (C) f~! is continuously 
differentiable in some neighborhood of f(a). 

Let w be as in (5) and assume that det &(a, a) # 0, a € U. 

A) By continuity of & there is a neighborhood V of a such that det (x, y) + 
0, &(x, y) is bounded and [Ws(x, y)]~! is bounded for all x, y € V. By (5) we 
conclude immediately that f is injective in V. 

B)> Let B,, be an open ball centered at a and contained in V. By continuity of 
f, f(@B,)* is compact. By A, f(a) € f(@B,). Let d be the distance from f(a) to 


>This step is taken from [Spil. 
“OB, denotes the border of By. 


336 FRECHET VS. CARATHEODORY [April 


f(0B,) and let B, be the open ball centered at f(a) and radius d/2. Let us fix 
y € B, and define the function g on B,° by 


g(x) =lly — f(x)’. 


Then g reaches its minimum at some point x, € B, since it is continuous and B, 
is compact. By the definition of B,, x, is not in 0B,. Now, g is differentiable at x, 
since 


g(x) — g(X9) = lly — f(x)? - lly — F(x) II? 


= (2y — f(x) — f(%))b(%,%0)(% — Xo) 
and f(x) = y — f(x) — f(x9)) W(x, x.) is continuous at x). By Theorem 5 


0 = b(%) = 2(y — f(x,)) (Xo; Xo) 


and therefore y = f(x,) (see A). Thus f maps B, onto B,. 
C) Restrict f to W = f~'(B,) 1 B,. By A and B, f: W > B, is bijective and 
then we can define f~': B, > W. By (5), for z,w € B, we have 


f(z) -f-'(w) = [YF (2), FQ) (z - ). (7) 


Now, since [(f~'(z), f-'(w))]~' is bounded (see A), f~! is continuous and 
therefore continuously differentiable in B, by (7). Also by (7) 


(Df~')( f(4)) = [Df(a)] ™*. 


We invite the reader to compare Spivak’s proof of this theorem with our proof. 
One can see that the proof of step (B) is the same in both, but the proofs of parts 
(A) and (C), especially part (C), clearly show the advantages of Carathéodory’s 
formulation. 


CONCLUSIONS. From a pedagogical point of view Carathéodory’s characteriza- 
tion is interesting. In R, it enhances the geometrical sense of the derivative as the 
continuous approximation to the tangent line by means of secant lines to the graph 
of a function, showing that continuity is an essential element for differentiability. 
This formulation allows us to prove with economy and elegance, the elementary 
facts of differentiability, not only of real functions of real variable, but also of 
functions from R” to R™” where the arguments are the same ones used for 
functions in one variable. 
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— lis Prime 

David Slowinski and Paul Gage have verfied yet another Mersenne 
prime. Using a Cray C916 at the Cray Research computing facility, 
they verfied that 2°49 — 1 is prime. The run time on 16 processors 
was about 30 minutes. 

To verify that a random integer of this size is prime would be well 
beyond the range of present computers (as well as those for years to 
come). Mersenne primes, however, are relatively easy to test using the 
Lucas-Lehemer test. To test N = 2” — 1, start with 4 and repeatedly 
square and subtract 2. Then N is prime precisely when, after p — 2 
repetitions, the result is 0 modulo N. 

Sounds easy. The problem is squaring (and reducing) numbers with 
thousands of digits. Performing multiprecision multiplication quickly 
requires some sophistication, and Slowinski and Gage use an algo- 
rithm of Schonhage and Strassen, implemented using the Fast Fourier 
Transform. 

Why do they continue the search? Running this particular set of 
programs seems to be a good test of reliability for both hardware and 
software on new machines. In addition, of course, there is a 
fascination with finding the largest known prime—at least, largest for 
the moment. 

Slowinski and Gage comment that they have not checked all 
exponents below 858, 433. There may be other Mersenne primes to be 
discovered. But chances are that people who have the time and 
inclination (and a Cray) will forge on to higher values. After all, 
there’s nothing remarkable about finding the second largest known 
prime number. 
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Odd Magic Powers 


A. C. Thompson 


INTRODUCTION. By a magic square we understand a square array of numbers (a 
Square matrix) whose row sums, column sums and two diagonal sums are all equal. 
The magic square 


10 5 til 8 
3 16 2 13 
6 9 7F 12 

15 4 14 1 


(which is obtained from the famous example in Diirer’s Melancholia by interchang- 
ing the first two rows and the first two columns) has the additional property that 
all of its 8 diagonals (e.g. 5,2, 12,15; 11, 13,6,4; 10,13, 7,4) also have the same 
sum. A magic square with this extra property is called a pandiagonal magic square. 
Conventionally, additional restrictions are placed on the entries: (a) that they be 
(non-negative) integers; and often, (b) that they be the consecutive integers from 1 
to n* (when the array is n X n). We shall not require either of these restrictions 
and the numbers may be rational, real or complex. 

The purpose of this note is to consider the multiplicative properties of sets of 
magic squares (using the normal matrix multiplication). As van den Essen [3] has 
shown, some of these turn out to be quite surprising. We extend his results by 
showing that the product of any odd number of 3 X 3 magic squares is magic, that 
if such a square is invertible, then the inverse is magic and that these results 
extend to 4 xX 4 and 5 X 5 pandiagonal magic squares. Because our method of 
proof is somewhat different from that of [3] and for completeness, we present the 
whole argument. As C. Small [2] points out, results of this type provide good 
examples for a Linear Algebra class. 

The first question of a multiplicative type is to ask whether a magic square is 
necessarily invertible. Clearly the answer is no—take the trivial case with all 
entries equal (even 0!) but a less trivial one is the example above (the vector 
(3, —1, —3, 1)? is in the null space). 

Our notation and terminology is standard except that we shall use first and 
second diagonal to mean the main diagonal and the ‘other’ one respectively. Also, 
we Shall call a matrix with constant row sums an affine matrix and an affine matrix 
with constant column sums will be called doubly affine. Since we often need 
various permutation matrices, these will be defined by listing the columns as 


elements of the usual basis {e,, e,,..., e,} of R”. For example: 
0 0 1 
P = [e3,€,€;] = 0 1 Q |. 
1 0 0 


Proposition 1. Both the set of n X n affine matrices and the set of n X n doubly 
affine matrices are algebras. 
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Proof: It is clear that both sets are closed under addition and scalar multiplication. 
Moreover, a matrix is affine if and only if the constant vector (1,1,...,1)7 is an 
eigenvector with eigenvalue equal to the row sum r. It follows that the set of affine 
matrices is closed under multiplication. The same fact for doubly affine matrices 
follows by considering transposes. 

Thus the crucial part of our problem is to consider the behaviour of diagonal 
sums under multiplication. 


1. THE 3 X 3 CASE. This has been discussed (in an entirely numerical way) by 
J. M. H. Peters in [1]. 


Proposition 2. [fA is a 3 X 3 invertible magic square, then A7! is magic. 


Proof: Let r be the row sum of A. Then r is an eigenvalue of A (and of A’) and 
tr A = r. Hence, if A, and A, are the remaining eigenvalues we have X, + A, = 0. 
Since A is invertible A, = —A, #0. Hence, Aj! = —Az! and trA7'=r7' + 
A; i +Az!=r7! = row sum of Aq. 

For the second diagonal we consider B = PA where P = [e3, e5, e,]. Clearly B 
is also magic with the same row sum r. Therefore, the same argument shows that 
tr(B~!) = r~'. However, B=! = A7!P~! = A7!P and hence tr(B~') is the second 
diagonal sum of A7}. 


The 3 X 3 magic squares form a 3-dimensional space with basis {E,, E,, E3} 
where 


0 1 -1 1 -1 0 1 1 1 
Ek, = —] ) 1 9 BE, = —1 ) 1 9 E, = 1 1 1 . 
1 -1 0 0 1 -1 1 1 1 


The larger algebra of 3 X 3 doubly affine matrices is 5-dimensional (now the 
two diagonal sums can vary) and has a basis consisting of F,, E,, E;, I = [e,, e, e3] 
and P = [e3, e5, e,1. 


Theorem 1. The product of an odd number of 3 X 3 magic squares is again a magic 
square. 


Proof: First look at products of pairs from the basis {F,, E,, E3}. We get FE, E, = 
E,E, = E,E, = §£,E,=0; E,E, = —E,E, = 3P — E;; E? = —E} = E, — 31 
and E} = 3E,. Hence the product of any two 3 X 3 magic squares is in the 
subspace of .Y% spanned by J, P, and E£. But this subspace is a subalgebra. Hence 
the product of any even number of magic squares is in this subalgebra. Finally, 
since the product of any magic square with each of FE, J, and P is magic, we have 
the result. 


Corollary. If A is a 3 X 3 magic square then A* is magic for every odd positive 
integer; and for every odd integer if A is invertible. 


2. THE 4 X 4 CASE. As in the previous case the fact that the trace is equal to one 
of the eigenvalues means that the sum of the other three is 0. This can happen in 
three ways. 


(i) The remaining eigenvalues are 0, + A. It is an interesting exercise to show 
that this is so for all 4 X 4 pandiagonal magic squares. Thus none of these 
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matrices is invertible. It is, however, true that a product of an odd number 
of 4 X 4 pandiagonal magic squares is pandiagonal magic. The proof of this 
is similar to, but less natural than, the 5 X 5 case. For reasons of space we 
omit it. 

(ii) The remaining eigenvalues are A, wA, w7A (where w is a complex cube root 
of unity). An example of such a magic square is 


A= [e1, 4, ez, e3] 


for which A? = A7! and is magic. So it is possible for a product of an even 
number of magic squares to be magic. 

(iii) The remaining eigenvalues are not symmetric with respect to the origin. An 
example is 


10 2 14 18 
17 15 1 141 
13 21 7 3 

4 6 22 12 


3. THE 5 X 5 CASE. The previous cases show that magic squares with nice 
multiplicative properties have their eigenvalues (other than the row sum r) 
symmetrically placed with respect to the origin. To achieve this when nm is odd we 
need the characteristic polynomial to factor into (x — r)p(x) where p(x) is even. 
It appears that, as n increases, increasingly many extra conditions are needed to 
guarantee such a factorization. 

The invertible magic square 


23 1 2 20 19 

22 16 9 14 4 

5 11 13 15 21 
8 12 17 10 18 
7 2 24 6 3 


is an example of one whose inverse is not magic. It is also not pandiagonal. We will 
see below that the extra pandiagonal condition is the “right” one to impose on 
5 < 5 magic squares to recover the results of the 3 X 3 case. 

Another linear algebra exercise is to show that the 5 X 5 pandiagonal magic 
squares form a 9-dimensional vector space. To describe a basis for this space we 
consider the following 10 matrices: 


Py = Le, C425 €5> es], Py = [e,, €2,€5, 3; e,], an, Ps = Le3, €1,&4, €2; és], 
Q, = [e3, €5>&25&4, e,], Q, = [es, €2>€4, ©], e3], ose) Qs = le., €3,&5, €25 é4); 
where, in each line, the columns are permuted cyclically. Each of these matrices is 
pandiagonal and, because ?_,P, = X}_,Q, and this is the only non-trival linear 
condition they satisfy, their span is 9-dimensional. Thus every pandiagonal 5 x 5 

magic square is of the form 


5 
A= » a; P; + B;Q;. 
i=1 


iM 


L 


Theorem 2. The product of an odd number of 5 X 5 pandiagonal magic squares is 
again a pandiagonal magic square. 
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Proof: Consider the algebra @ spanned by R and S where 
R = [es, €4,€3,€2,€,] and S = [e, e3, e4, es, e,]. 
This algebra is also 9-dimensional being the linear span of 


S, 57, 8°, S*, S° =I; RS, RS*, RS’, RS*, RS° = R; 
(again ©>_,S' = 3_, RS‘). These 10 matrices themselves form a group isomorphic 
to the symmetry group of a regular pentagon). 

Now it is an elementary matter to check that, for i, j = 1,2,...,5 we have both 
P,Q; = S* and Q;P,=S* for some value of k and k’ depending on i and j. 
Similarly, P,P; = RS” and Q;Q,; = RS”. Thus if A, and A, are two pandiagonal 
magic squares then A,A, is in @. Hence every product of an even number of such 
squares is in @. Multiplying a matrix on the right by a power of S permutes the 
columns cyclically and a similar multiplication by R reverses the order of the 
columns. Clearly the set {P,,Q;: i = 1,2,...,5} is invariant under these actions. 
Therefore, if A is a5 X 5 pandiagonal magic square so is AB for every B in &; 
from which the result follows. 


Corollary 1. If A is a 5 X 5 pandiagonal magic square, so is A*® for every odd 
positive integer k. 


Corollary 2. [f A is a 5 X 5 pandiagonal magic square and if A,,A,,A, and A, 
denotes its eigenvalues other than the row sum r, then (suitably numbered) 1, = —d, 
and hj = —A4. 


Proof: It follows immediately from Corollary’1 that tr. A4* = r* and hence that 
y4_,A* = 0 for every odd positive integer k. If we take k = 1, and.k = 3, and use 
elementary algebra we get the result. 


Corollary 3. If A is an invertible 5 x 5 pandiagonal magic square so is A~'. 


Proof: The proof follows from Corollary 2 exactly as in the proof of Proposition 2, 
but using the permutation matrices R and S in place of P. 


Note. This is a revised version of a paper originally submitted in October 1988. 
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Mathematicians, Including 
Undergraduates, Look at Soap Bubbles 


Frank Morgan 


Why are soap bubbles so beautifully round? A soap bubble tries to minimize 
surface energy or area, and the round sphere has the least surface area for the 
fixed volume of air trapped inside. See FiGure 1. A soap bubble very quickly 
succeeds in finding this mathematically optimal shape. Thus the underlying princi- 
ple is area minimization. 


Figure 1. A spherical soap bubble has found the least-area way to enclose a given volume of air. Jim 
Bredt [M4]. 


Similarly bubble clusters try to minimize the total surface area enclosing and 
separating several volumes. Whether the number of enclosed volumes is one or 
one thousand, it is the same principle of area minimization at work. See FIGURES 
2, 3. This principle alone has sufficed to produce computer simulations of bubble 
clusters, as in the frame in FiGure 4 from the video “Computing soap films and 
crystals” by the Minimal Surface Team at the Geometry Center (formerly the 
Minnesota Geometry Supercomputer Project). 

Do soap bubble clusters always find the absolute least-area shape? Not always: 
FiGuRE 5 illustrates two clusters enclosing and separating the same five volumes. 
In the first, the tiny fifth volume is comfortably nestled deep in the crevice 
between the largest bubbles. In the second, the tiny fifth volume less comfortably 
sits between the medium size bubbles. The first cluster has less surface area, 
although I do not know for sure that there is not a third possibility of still less area. 
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Figure 2a and 2b. Double and triple bubbles seek the least-area way to enclose and separate two or 
three given volumes of air. 


As a matter of fact, it is an open question whether the standard triple bubble of 
FIGURE 2 is the least-area way to enclose three given volumes. Even for the double 
bubble, the proof seems incomplete, as realized over the course of a recent 
undergraduate thesis [F1] by Joel Foisy, Williams ’91. 

In fact, for area-minimizing clusters, it is an open question whether each 
separate region is connected, or whether it might conceivably help to subdivide the 
regions of prescribed volume, with perhaps half the volume nestled in one crevice 
here, and the other half in another crevice there. 

Similarly, it is an open question whether an area-minimizing cluster may 
incidentally trap inside “empty chambers,” which do not contribute to the pre- 
scribed volumes. FIGURE 6 shows a 12-bubble with a dodecahedral-like empty 
chamber on the inside, obtained by Tyler Jarvis of Princeton University using a 
program of Ken Brakke [B] of the Geometry Center. The computation postulated 
the empty chamber; without such a restriction, empty chambers probably never 
occur. 
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Figure 3. Configurations of thousands of bubbles are governed by the same principle of area minimiza- 
tion. Photograph from Science ’84 [S] courtesy of Gordon Graham /Prism. 


Figure 4. A computer simulation of a bubble cluster. “Computing soap films and crystals” video by the 
Minimal Surface Team, The Geometry Center. Photo provided by John Sullivan. 
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Figure 5. Soap bubble clusters are sometimes only relative minima for area. These two clusters enclose 
and separate the same five volumes, but the first has less surface area than the second. 
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Figure 6. It is an open question whether area-minimizing clusters may have empty chambers, such as 
the dodecahedral-like chamber at the center of this 12-bubble. Tyler Jarvis, Princeton University. 
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You must be wondering what is known! It is known that soap bubble clusters 
consist of constant-mean-curvature surfaces (such as pieces of spheres) meeting in 
threes at 120° angles along seams, which in turn meet in fours at about 109° 
angles at points. Four such seams and two such points are visible already in the 
triple bubble of Figure 2. Nothing more complicated ever happens, even in such 
complicated clusters as that of FiGuRE 3. 

These laws were observed and recorded by the Belgian physicist J. A. F. Plateau 
[P] over a century ago, but it was over 100 years until a complete explanation was 
proved by Jean Taylor, now a professor at Rutgers University. Her demonstration 
required no physics or chemistry, just a single mathematical hypothesis: area-mini- 
mization. Many pages of complicated mathematics later came the conclusion: 
Plateau’s laws, 120° angles, 109° angles, and all. 


GT & 


52 

perimeter 9.33 perimeter 9,3 
area 2.5 area - 2.15 

2 

D 

— 6.96 — 8.05 

na na 
6, 6 
perimeter 9.99 2 
area 47 perimeter 9.64 

2 area 2.27 
a 6.73 p? 
na ; — 6.82 

na 

7, 1 
perimeter 11.21 perimeter 11.6 
area 2.77 area 2.59 
p? p? 
— 6.48 — 6.67 
na na 


Figure 7a. Comparison of candidates for the least-perimeter way to enclose and separate n = 5, 6, 7 
unit planar areas. Note that only sometimes are the most symmetric candidates the winners. 
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Taylor’s work used geometric measure: theory, a relatively new kind of geometry 
that allows singularities such as the seams in soap bubble clusters (cf. [M2]). 
Classically, geometry studied mainly smooth surfaces. 

Although Taylor’s mathematical proof [T] is hard to read, a beautiful expository 
account [AT] appeared in Scientific American in July, 1976. 

Joel Foisy, whose undergraduate thesis work on the double bubble I mentioned 
earlier, got started the previous summer with an undergraduate research Geometry 
Group at Williams studying the 


Planar soap bubble problem. Find the least-perimeter way to enclose and separate n 
given areas (say unit areas) in the plane. 


FiGurREsS 7ab show the results of their experiments with soap bubbles between 
plexiglass plates [A]. Since it is hard to keep all the areas exactly 1, it is better to 
minimize the normalized quantity p*/na, where p is the total perimeter (counting 
all inside and outside walls once) and a is the total area. Could you have guessed 
the winners? I find it hard to see any pattern. 

There was also one notable theoretical result: 


8, 85 
perimeter 12.01 perimeter 12.45 
area 2.85 area 2.97 
p? p? 
— 6.33 — 6.52 
na na 


8, 8, 

perimeter 12.57 perimeter 14.69 
area 3.63 area 3.29 
p? p? 

— 6.52 — 8.2 
na na 


Figure 7b. Comparison of four candidates for the least-perimeter way to enclose and separate eight 
unit areas. For the fourth candidate, the eight enclosed areas incidentally surround and enclose an 
extraneous “empty chamber.” 
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81 Stanton Road 
Brookline, MA 02146 
January 2, 1983 


Professor Frank Morgan 
Rice University 

Department of Mathematics 
Houston, Texas 77001 


Dear Professor Morgan: 


Thank you again for sending me the 18.01-18.02 MIT Calculus 
materials. I recently took my first 18.02 test and made no errors 
on it. 


I believe that I have discovered a solution to the soap film 
problem which was the topic of your speech at the 1982 
Massachusetts State Science Fair, namely, Jind a simple, smooth, 
closed curve which can bound infinitely many minimal surfaces. 


Start with,the shape in Figure 1, a shape that has two minimal 
surfaces, 


cFigure 1. 
and elongate it at points A, B, C, and D to create the shape in 


= ny, 
and coat both of the two minimal surfaces with bubble suds. 
Introduce crosspieces AB, BC, CD, and DA and coat the resultant 


square with bubble suds too. By popping the proper bubble films 
you will get the configuration portrayed in Figure 3, 


i  Zaill 
ne . 
Figure 3. UE te mara 


Remove the crosspieces with care so‘as to not pop any more of the 


bubble films, and the shape will settle into the saddle of Figure 
4. - 


wats y 
wa 


Figure 4. 


This saddle can be shifted along the entire length of the shape, 
creating infinitely many minimal surfaces, 


I built a model of Figure 4 and found that the saddle could 
be slid along the length of the figure by tilting one end towards 
the floor. (The weight of the bubble suds made it slide - I 
wonder what would happen in a weightless condition. ) 


I have been unable to form a minimal surface on a mobius strip - 
I think that this happens because there is only one edge or surface 
for the bubble suds to adhere to. (Am I right?) 


Sincerely yours, 


Mark Kantrowitz 


Figure 8. As a high-school sophomore, Mark Kantrowitz addresses an open question on minimal 
surfaces. 
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Theorem ((A], [F1], [F2]). The standard double bubble is uniquely perimeter minimiz- 
ing. 


The hard part is showing that both areas are connected and that there are no 
empty chambers. For more bubbles or in space, such questions remain open. 


Undergraduate research. Undergraduates do mathematics, prove theorems, write 
papers for publication, and give talks at mathematics meetings. In the Williams 
College SMALL undergraduate research project, each student belongs to two of 
seven research groups, each with a student leader and a faculty advisor. 

The student groups work largely independently, although the faculty advisor 
usually provides the problem and some guidance. The students generally are quite 
busy and do not like to be interrupted by the faculty! The less experienced 
students learn from the others, and the most advanced like to bounce ideas off the 
rest. 

One woman, who had no intentions of attending graduate school, changed her 
mind during the summer project and is now a graduate student in computer 
science at Wisconsin. A nonmajor, who used to consider mathematics just an 
avocation, is now a graduate student in Mathematics at UC Berkeley. A Hispanic 
student, who never seemed to settle down in his coursework, succeeded outstand- 
ingly in the SMALL project his freshman year. In his junior year he wrote an 
Honors thesis, which has been accepted for publication in the Pacific Journal of 
Mathematics. He now has two jobs, teaching mathematics in his old high school 
and at a community college, including one course taught in Spanish. 

The name SMALL, in case you were wondering, is an acronym for the faculty 
on the original proposal. 

It is probably good to start research early on. One time I got a letter from Mark 
Kantrowitz, a high-school sophomore, that announced, “I believe that I have 
discovered a solution to the [open] soap film problem...: Find a simple, smooth, 
closed curve which can bound infinitely many minimal surfaces.” See FiGure 8. 
Although Mark’s stated proposal used gravity, the basic idea was one I had used in 
at least one published example. Mark’s continuing correspondence amazed me. My 
collaborators and I barely managed to keep ahead of him. The problem is still 
open. 
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Added in proof. ‘There has been progress on the planar triple bubble by the 1992 
Williams NSF SMALL undergraduate research Geometry Group, Chris Cox, 
group leader, Lisa Harrison, Michael Hutchings, Susan Kim, Janette Light, 
Andrew Mauer, Meg Tilton, ‘The shortest enclosure of three connected areas in 
R”” (preprint), and on the double bubble in space by Michael Hutchings. See the 
new chapter on “Soap bubble clusters,” in the second edition of [M2], to appear 


this year. 


Department of Mathematics 


Williams College 


Williamstown, MA 01267 
Frank.Morgan@williams.edu 
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“The advantage is that mathematics is a 
field in which one’s blunders tend to show 
very clearly and can be corrected or erased 
with a stroke of the pencil. It is a field 
which has often been compared with chess, 
but differs from the latter in that it is only 
one’s best moments that count and not 
one’s worst. A single inattention may lose 


a chess game, whereas a single successful 
approach to a problem, among many which 
have been relegated to the wastebasket, 
will make a mathematician’s reputation.” 


Excerpt from Ex-Prodigy: 
My Childhood and Youth 
by Norbert Wiener, p. 21 
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NOTES 


Edited by: John Duncan 


A Proof of Dilworth’s Chain 
Decomposition Theorem 


Fred Galvin 


Let P be a finite partially ordered set: a chain (antichain) in P is a set of pairwise 
comparable (incomparable) elements; the width of P is the maximum cardinality 
of an antichain in P. According to a celebrated theorem of Dilworth [2], the width 
of P is also equal to the minimum number of chains needed to cover P. The wider 
combinatorial significance of Dilworth’s theorem, especially as regards matching 
theory, is discussed by Bogart, Greene, and Kung [1], Mirsky [3], and Reichmeider 
[5]. Bogart, Greene, and Kung survey various proofs of Dilworth’s theorem; the 
proofs of Perles [4] and Tverberg [6] are especially simple and elegant. The proof 
given here seems to me to be as simple as any. This proof is probably well-known 
folklore; still, as far as I know, it has never appeared in print. 


Theorem. A finite partially ordered set P is the union of m chains, where m is the 
width of P. 


Proof: We use induction on the cardinality of P. Let a be a maximal element of 
P, and let P’ = P \ {a} have width n. Then P’ is the union of n disjoint chains 
C,,...,C,. We have to show that P either contains an (” + 1)-element antichain, 
or else is the union of n chains. Now, every n-element antichain in P’ consists of 
one element from each C;. Let a, = max{x € C,: x belongs to some n-element 
antichain in P’}. It is easy to see that A = {a,,...,a,} is an antichain. If A U {a} 
is an antichain, we are done. Otherwise, we have a >a, for some i. Then K = 
{a} U {x © C;: x < a,} is a chain, and there are no n-element antichains in P \ K, 
whence P \ K is the union of n — 1 chains. 
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On Intervals, Transitivity = Chaos. 


Michel Vellekoop and Raoul Berglund 


In an earlier article in the American Mathematical Monthly a redundancy was 
found in the definition of chaos by Devaney [1]: 
Let V be a set. A continuous map f: V — V is said to be chaotic on V if 


(1) f is topologically transitive: for any pair of open non-empty sets U,W c V 
there exists a k > 0 such that f*((U) NW# @. 

(2) the periodic points of f are dense in V. 

(3) f has sensitive dependence on initial conditions: there exists a 6 > 0 such 
that, for any x € V and any neighbourhood N of x, there exists a y € N 
and an n = 0 such that | f"(x) — f“(y)| > 6. 


In [2], Banks et al. prove that (1) and (2) imply (3) in any metric space V, and in 
[3] Assaf IV and Gadbois show that for general maps this is the only redundancy: 
(1) and (3) do not imply (2), and (2) and (3) do not imply (1). But if we restrict our 
attention to maps on an interval a stronger result can be obtained: 


Proposition. Let I be a, not necessarily finite, interval and f: I > I a continuous 
and topologically transitive map. Then (1) the periodic points of f are dense in I and 
(2) f has sensitive dependence on initial conditions. 


The first result (1) can be found in [4] (Chapter IV.5, Lemma 41) but the proof 
uses a lot of other highly non-trivial results. Since Devaney’s text is being used by 
sO many students, we think that it is interesting to give a very short, intuitive proof 
of this proposition. 

We will need the following lemma, which can be found in [4] (Chapter IV.1, 
Corollary 10) in a more general form: 


Lemma. Suppose that I is a, not necessarily finite, interval and f: I> TI is a 
continuous map. If J CI is an interval which contains no periodic points of f 
and z, f™ (z) and fz) €J with 0 <m <n, then either z < f™z) < f(z) or 
z>f™z)>f"). 


Proof of the lemma: Suppose we can find such a z€J with z</f™(z) and 
f(z) > f(z). Define the function g(x) = f(x). Then we know that z < g(z) 
and this implies z < g(z) < g**(z) for all natural numbers k > 1 by induction. 
Because, if g**'(z) < g(z) for a certain k then the function g*(x) —x has a 
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positive value in z and a negative value in g(z) and this would mean, by the 
Intermediate Value Theorem, that there exists a point c €]z, g(z)[c J with 
g"(c)-—c =0, giving a km-periodic point of f in J. Thus z < g*(z) for all 
positive k so in particular for k =n — m > 0, giving z < f“"™"(z). Since we 
assumed that f”""(f'’(z)) < f(z) we could prove analogously, taking g = f”~” 
that f"%~-""( f'™(z)) < f(z). But then we have that the function f“~”™ (x) — x 
has a positive value in z and a negative value in f(z), giving an (nm — m)m-periodic 
point in J and thus a contradiction. The other case can be proven analogously. O 


Proof of the Proposition: Suppose that f is continuous and topologically transitive. 
Because of the result in [2] we only need to prove that the periodic points are 
dense in J. Suppose that this is not the case, then there exists an interval J c J 
containing no periodic points. Take an x ©€J which is not an endpoint of J, an 
open neighbourhood N ¢J of x and an open interval E CJ\N. Since f is 
topologically transitive on J there exists a natural number m > 0 with f"(N) OE 
* © and thusa y €J with f(y) € E CJ. Since J contains no periodic points we 
know that y # f(y) and since f is continuous this implies that we can find a 
neighbourhood U of y with f’"(U) NM U = ©. Since U is an open set we can use 
the topological transitivity again and find an n > m anda z € U with f"(z) € U. 
But then we have 0 < m <n and z, f(z) € U while f(z) € U and this violates 
our earlier lemma. O 

We know now that for maps on an interval the only condition that has to be 
checked for Devaney’s definition of chaos is the first one, topological transitivity. 
Note that the proof cannot be generalized for higher dimensions or the unit circle 
S' because our lemma uses the ordering on R in an essential way. 

For completeness we note that there are no other trivialities in Devaney’s 
definition when restricted to intervals: 


A continuous function on an interval whose periodic points are dense doesn’t need 
to have sensitive dependence on initial conditions. 


The identity function on any interval trivially proves this. 


A continuous function on an interval which has sensitive dependence on initial 
conditions and whose periodic points are dense does not have to be transitive. 


Define on J = R, the function 


3x O<x< 4 

—3x+2 5 <x<§ 
I) = 3x -2 $<x<l 

f(x-1)+1 x21 


It is sensitive on initial conditions since |df/dx(x)| = 3 for all points on J, so every 
neighbourhood around a point will expand under iteration. It is easy to establish 
that f” has 3” — 2 fixed points between any two integer values with distances 
between these points smaller than (4774, so the periodic points are dense. But 
since f((0,1]) = [0,1] the function is not topologically transitive. When one re- 
stricts this function to the interval J = [0,2] one sees that it is a counterexample 
for finite J as well. 
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A continuous function on an interval which has sensitive dependence on initial 
conditions doesn’t need to have periodic points which are dense. 


As counterexample take the interval J = [0, 3] and the function 


f(x) = 


AlwO Ni[- 


Sensitive dependence is clear again since the function is expanding, but there can 
be no periodic points in JO, 3 /8[ since it is easy to establish that any trajectory with 
initial value in this subinterval, will not return there. For a counterexample in the 
infinite case, take J = R,and f(x) = 2x. 
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Proof of a Mixed Arithmetic-Mean, 
Geometric-Mean Inequality 


Kiran Kedlaya* 


The following conjecture was made by F. Holland in [3]. 


Conjecture. Let x,, X,..., X,, be positive real numbers. The arithmetic mean of the 
numbers 


3 n 
X,Y X1X2 5 V¥%1%2%3 pe eeg VX1X2 °° X, 


does not exceed the geometric mean of the numbers 


X, Xo X, FXz +X; Xi, FXg + °° FX, 
X43 oo 8 OO 
v2 3 n 
There is equality if and only if x, =x, = +*' =X,. 


* The author wishes to thank Cecil Rousseau for his help in preparing this note. 
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To prove this conjecture, we begin with the following bit of combinatorics. The 
utility of this construction will become clear as the five properties listed below are 
invoked. (The vectors can be explicitly constructed by trial and error up to about 
n = 5 so as to satisfy the properties; from these the general formula was deduced.) 


Lemma. The vectors a(i, j) = (a,(i, j), a,(i, j),..., a,(i, J)) given by 


aati = (FEE (Fa (1) 


; (n —i)M(n - A) i= 15-1)! 
“ThoD\k-)in-i-j+eui-kige-e! 
(i,j =1,2,...,n) satisfy 
(i) a,Gi, j) = 0 for all i, j,k, 
(ii) a,Gi, j) = 0 for k > min(, j), 
(iii) a,(i, j)D= a,j, i) for alli, j,k, 
(iv) L7_,a,Gi, j) = 1 for all i, j, 


. ay n/j fork <j 
(v) L714,G, J) = ft fork >j. 


Proof of the Lemma: Properties (i) and (ii) are obvious, and property (iii) is 
transparent in (2). Property (iv) is a consequence of the standard Vandermonde 
identity and property (v) follows from a variation of the Vandermonde identity 
obtained through appropriate use of upper negation. (See [1], p. 169.) Specifically, 


a, ~(n—-i\(i-1)\/{[n-1 
Ean E (ME A/G=1)- 
and for k <j, 


Lead = E(Fra(A)/ (Fat) = (F)/(Gat) = 4 


Alternatively, ("= i] ( int) can be interpreted as the number of j-element subsets 
of {1,...,2} whose k-th element is i. When summed over i this counts all 


j-element subsets of {1,...,} for k <j, which number ("); when summed over k 


this counts only those subsets containing i, which number (" “). 


Proof of the Conjecture: We use the notation of Hardy, Littlewood and Pdlya [2] 
for weighted arithmetic and geometric means: 


M(x,a)= Yia,x, and Y(x,a) = [[ x#, 
k=1 k=] 


where a = (4,, a ,...,4,) is an n-tuple of nonnegative numbers satisfying 07 _, 
a, = 1. By the arithmetic-mean, geometric-mean inequality 


M(x,a) => F(x,a), (3) 


with equality if and only if x, is constant over all k for which a, > 0. Let 4(i, j) 
and (i, j) be the means obtained by setting a = a(i, j) in H(x, a) and F(x, a) 
respectively. These are means because of properties (i) and (iv). Using properties 
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(ii) and (v) and the AM-GM inequality (3), we obtain 


XytXg tc +X; iv _ 
oo >> >? a,(i,j) = — LM Nz -L H(i,ij). 
J Ne=1 i=l M j=1 
(4) 
Taking the geometric mean of both sides over j, we have 
n X1 +X ttt +X; 1/n 1 n 1/n 
| moe ST waa) (5) 
j=l J MN j=1\j=1 
From Holder’s inequality ((2], p. 21), we know 
J on n 1/n J nm oon ' 
aulpezan) aah TT 4G, i) oO (6) 
NM j=1\j=1 M j=1j=1 


Equality holds only if every two of g,,g5,...,g, are proportional where 
g,= (4(i,1), Fi, 2),..., H(i, n)) (i= 1,2,...,n). 


Since Ai, 1) =x, for all i, this would imply that g,;=g,= --- =g,. But 
SA(i,n) = x;, so that would imply x, =x, = ++: =X,. 
Also notice that by (iii) and (v), 
n 1 noon - i i 
TT 4,4)" = TT 0" = T] xi) = aie, + x; (7) 
j=1 k=1j=1 k=1 
Now combining (5), (6) and (7) gives 
1/n 
n Xp bX, tr +X; 1 nj 
oe db yrien 0 x, (8) 
j=l: J M j=1 
the desired inequality. As we have shown that equality cannot occur unless 
X, =X, = ++: =x, (in which case equality is obvious), the conjecture of Holland 
is correct. a 
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UNSOLVED PROBLEMS 
Edited by: Richard Guy & Richard Nowakowski 


In this department the MONTHLY presents easily stated unsolved problems dealing 
with notions ordinarily encountered in undergraduate mathematics. Each problem 
should be accompanied by relevant references (if any are known to the author) and by 
a brief description of known partial or related results. Typescripts should be sent to 
Richard Guy, Department of Mathematics & Statistics, The University of Calgary, 


Alberta, Canada T2N 1N4. 


ApSimon’s Mints Problem 


Richard Guy and Richard Nowakowski 


H. ApSimon, on pages 65-76 of Mathematical Byways in Ayling, Beeling and 
Ceiling, Oxford University Press, 1984, introduces the problem of trying to identify 
which, if any, of a number of mints are producing a certain coin with a variant 
material. It is only possible to detect the difference by weighing. The correct 
weight of a coin is known and the small difference of the variant, if it is used, is 
always the same. Only two weighings are allowed, and the object is to find which 
mints are using the variant material, but using as few sample coins as possible. 

ApSimon gives the following analysis. Suppose that there are n mints. Let C be 
the true weight of a coin and C(1 + «) be the weight of a coin made with the 
variant material. For i = 1,...,n, let a; (b;) be the number of coins from mint i 
used in the first (second) weighing and let d;, i=1,...,m be zero or one 
according as the ith mint is using the correct or the variant material. Note that the 
case that all d; = 0 means all the mints are using the correct material. Also, we 
cannot have a; = b, = 0 for any i since this would give no information about 
mint i. 

Let the results of the two weighings be W and_X, so that 


W= )C(1+dj,e)a;, and X= ))C(1+4d,«)b, 
i=1 i=1 


n n n n 
[Ww - ¥ ca C= )\dj;ea, and [x - yc C= ) deb, 
i=1 i=1 i=l =I 
From the previous equations we have 
W — Lin1 Ca; j=1 4€0; j=1 4,4; 


X — LP, Cb, - 7-1 4;€5; 7 7-1 4;b; 
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The lefthand side is known, so to determine which mints are providing counterfeit 
coins we just need to choose the a,; and 5b, so that the left hand side takes on 
2” — 1 different values for the 2” — 1 different possible choices of the d,. 

ApSimon showed that this was always possible. He gave the solution b, = 1 and 
a; =i! fori =1,...,n. 

Since we require 2” — 1 different values, then an efficient way is to have 2”/2 
different sums on top and on bottom. This means that we require n/2 different 
values for the a, and b, at best 0,1,...|[”/2] and that the total number of coins is 
at least $|n/2||(n + 2)/2]. 

For one mint, only one coin and one weighing is necessary. For two mints 
it is clearly necessary and sufficient to take a coin from each mint; they 
may be weighed separately. For 3, 4 and 5 mints, ApSimon found that the least 
number of coins is 4, 8, and 15. Possible values for the a; and b, are 
{0, 1,2 & 1, 1, O}; {0, 1,2,3 & 1,0,2,2} or {0,1,1,4&2,0, 1, 1}; 
and {1,0, 1, 4,5 & 1, 2, 2, 5, O}. 

He asks two questions: 


1. Are 2"~! coins always sufficient? 
2. What is the least number of coins needed for n mints? 


It was found by computer that 38 coins suffice for 6 mints by taking the a; to be 
{1, 1, 2, 3,5, 8} and the b, to be {11, 6,5, 3, 1, 1} and that 74 coins suffice for 7 mints 
by taking {a,} = {0, 1,3, 6, 10, 15, 21} and {b,} = {1,5, 8, 14, 10, 8, O}. 

A result that is better for 9 <n < 12 than ApSimon’s )?_, i! is obtained by 
noting that (4”*! — 1)/3 coins suffice, where the a; are {0,1,2;...,n — 1} and the 
b, are {4,47,...,4”}. Curiously, this fails for n = 13. 

We can ask a further question: 


3. If we are using.the least number of coins, is it true that there is a choice of {a,} 
and {b;} such that for some j and k, a; > b; and a, < b,? 


ACHLAND 


onl WE Doesnt Know oNE ENO OF 


A Mostus STRIP FAN THE OTHER, 
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Jacobi’s Theorem 


In 1869, J. J. Sylvester gave an address to the British Association. 
One of the participants at that meeting was H. Jacobi, a physicist from 
St. Petersburg. He recounted to Sylvester the following episode 
concerning his celebrated brother C. G. J. Jacobi. (I have translated 
the French.) 


“Speaking to my deceased brother, one day, on the neces- 
sity of monitoring an observation by repeated experiments— 
even when these verified the hypothesis—he told of having 
discovered a remarkable law in the Theory of Numbers—a 
law, which he did not in the least doubt, held in all 
generality. However, wanting to be exceptionally cautious, 
or rather, wishing to do something he considered unneces- 
sary but prudent, he substituted a digit which he chose at 
random, or possibly by a kind of inspiration, and this choice 
did not verify the formula. Every other digit which he tried 
confirmed the result. Later he succeeded in proving that the 
digit he had chosen was one of a class which forms the only 
exceptions to his formula. 

This curious fact stayed in my memory, but since he passed 
away about 30 years ago, I do not recall any details.” 


Question to readers of AMM. What was Jacobi’s theorem to which his 
brother alludes? 


Submitted by Raymond Ayoub 
Department of Mathematics 


Pennsylvania State University 
University Park, PA 16802 
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PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on 
the inside front cover. Please include solutions, relevant references, etc. Three copies 
are requested. 


Solutions of published problems should arrive before September 30, 1994 at the 
MONTHLY PROBLEMS address given on the inside front cover. Solutions should be 
typed with double spacing, including the problem number and the solver’s name and 
mailing address. Two copies suffice. A self-addressed postcard or label should be 
included if an acknowledgment is desired. 


An asterisk (*) after the number of a problem, or part of a problem, indicates that 
no solution is currently available. Partial solutions will be useful in such cases. 
Otherwise, the published solution is likely to be based on a solution which is complete 
and correct. Of course, an elegant partial solution or a method leading to a more 
general result is always useful and welcome. In addition, references to other 
appearances of MONTHLY problems or to solutions of these problems in the 
literature are also solicited. 


PROBLEMS 


10375. Proposed by John Brillhart and J. S. Lomont, University of Arizona, Tucson, 
AZ. 


Find the complete solution of the recurrence 


U, 5 = 2(2n + 3)°U,,, — 4(n + 1)°(2n + 1)(2n + 3)U, (n=O). 


n 


10376. Proposed by Nobuhisa Abe, Oita; Japan. 
Determine all integer solutions of 
x(x + 1)(x + 2)(x + 3)(x + 4)(x + 5) =y?- 1. 
10377. Proposed by Kathryn R. Laberteaux, student, University of Michigan, Ann 
Arbor, MI. 


On the final exam in a linear algebra class, I was asked to express the statement 
“A is Hermitian” in the form of a matrix identity. I should have written “A = A*,” 
but out of haste and exhaustion I wrote “AA* = A*” instead. Was my answer 
correct? 
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10378. Proposed by Bjorn Poonen (student), University of California, Berkeley, CA. 


Given that point D is in the interior of A ABC and that there are real numbers 
a,b,c,d such that AB = ab, AC = ac, AD = ad, BC = bc, BD = bd, and CD = 
cd, prove that |Z ABD| + |Z ACD| = 7/3. 


10379. Proposed by Michael Hirsch, Emory University, Atlantic, GA, and Jonathan 
L. King, University of Florida, Gainesville, FL. 


Consider a two-dimensional world that three-dimensional beings would see as a 
vertical plane. This world is endowed with gravity pulling objects in the downward 
direction. An experiment is performed in which various convex polygonal figures 
are placed on a “table” (i.e., a horizontal line segment). A convex polygon P is 
called stable on an edge e if, when placed with edge e on the table and rolled 
slightly either way, gravity causes P to roll back to rest on e. 

Let S(P) denote the number of stable edges of P. What is the minimum of 
SCP) over all convex polygons P of uniform density? 


10380. Proposed by Michael Slater, University of Bristol, Bristol, England. 


Suppose that f,,..., f,, are continuous real periodic functions, and that ©7_, f,; 
is a constant function, while no sum of fewer than n of the f; is a constant 
function. Show that the f,; have a common period. 


10381. Proposed by Marcin E. Kuczma, University of Warsaw, Warszawa, Poland. 


Determine all real valued functions f on the integer lattice, Z* such that 
f(u + v) = f(u) + f(y) for every pair of orthogonal vectors u,v in Z?. 


NOTES 


Notes: (10378) Notation such as |Z ABD| is used for the size (always non-negative) 
of the angle in radians. Since D is an interior point, |Z ABD| + |Z ACD| is a 
priori between 0 and 77. 


SOLUTIONS 


Keeping the Ones Apart 


10196 [1992, 162]. Proposed by Barry Hayes, Stanford University, Stanford CA, and 
David S. Pearson, Cornell University, Ithaca NY. 
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Let M,, be the set of n-bit binary strings containing no pairs of consecutive 
ones. For example, 


M, = {(0,0,0), (0,0, 1), (0, 1,0), (1,0, 0), (1,0, 1)}. 
Find the probability p, that if (6,,6,,...,6,) and (€,,€5,...,¢€,) are in M,, then 
(max{6,, €,},max{6,, ¢,},...,max{6,, €,}) 
isin M,. 


Solution by Michael Andreoli, Miami Dade Community College (North), Miami, 
FL. The desired probability is given by 


_ 5. gn+2 (1 + Vi3)""* - (1 - vi3)"" 
v13 (1 + V5)" - (1 - v5)""?) 


An n-bit binary string may be identified with a sequence of n Bernoulli trials by 
letting 1 denote success and 0 failure. Consider a sequence of independent 
Bernoulli trials with success probability p and failure probability g = 1 — p. If T,, 
denotes the probability that there is no run of two consecutive successes in the first 
n trials, then conditioning on the outcome of the last trial yields. 


T, = qI,,_, + paT,_>, with T,=T, = 1. (1) 


P 


n 


Now suppose two players simultaneously and independently toss fair coins. Say 
that a win occurs on the kth toss if at least one player has a ‘head on that toss. 
‘Consider the events. 


A,, = {Player I has no run of two consecutive heads in the first n tosses} 

B,, = {Player II has no run of two consecutive heads in the first n tosses} 

C,, = {There is no run of two consecutive wins in the first n tosses}. 
We then have 
P(A, NB, NC,) _ P(C,) | (2) 

P(A,0B,)  P(A,B,) 
Letting Q,, = P(A,,) = P(B,,) and applying (1) with p = 1/2, we obtain 
Q, = 7Qn-1+4Q2,-2, with Q,=Q,=1, 


which has the solution 


5 +3 + (SAF) 


p, = P(C,|A, 0 B,) = 


1+ 75 
4 


Q, = 


2V5 2/5 4 
Similarly, letting R, = P(C,,) and applying (1) with p = 1/4, we obtain 
R,=1R,_,+R,_», with R,=R,=1, 


n-1 


which has the solution 
Vi3 +7\(1+v13\ (v13 -7\/1- v3 \- 
7 | 213 8 2v13 8 


Substituting these values for Q, and R,, into (2) and simplifying, we obtain the 
result stated at the outset. 
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Editorial comment. Most solvers first observed that |M,,| is the n + 2”% Fi- 
bonacci number (see, e.g., Example 10.10 in Ralph P. Grimaldi, Discrete and 
Combinatorial Mathematics, 2nd ed., Addison-Wesley, 1989). They then used a 
similar counting argument to establish that the number of pairs from M, whose 
bit-wise or is in M,, satisfies the recursion G,,,, = G, + 3G,,_,. The result follows 
easily by standard techniques. A similar counting argument was exploited in 
Richard Austin and Richard Guy, ‘Binary sequences without isolated ones,” 
Fibonacci Quarterly 16 (1978), 84-86. 


Solved also by J. C. Binz (Switzerland), W. J. Buhler (Germany), D. Callan, R. J. Chapman (U.K.), 
W. Y. C. Chen, C. P. Grant, I. Kastanas, K. S. Kedlaya (student), K. M. Levasseur, J. H. Lindsey II, 
G.A. Martin, T. S. Norfolk, H. Schmidt Jr., M. Stamp, J. H. Steelman, R. Stong, F. B. Strauss, C. Subi, 
D. White, National Security Agency Problems Group, Theory First, the 1993 UK IMO team (students, 
U.K.), and the proposers. 


The Longest Shortest Closed Walk 


10204 [1992, 265-6]. Proposed by Edgar A. Ramos and Douglas B. West, University 
of Illinois, Urbana, IL. 


Given a strongly connected directed graph G, let s(G) be the length of the 
shortest closed walk visiting every vertex. Determine, for each positive integer n, 
the maximum value of s(G) over strongly connected directed graphs with n 
vertices. 


Solution by David M. Wells, Penn State University, New Kensington, PA. The 
maximum value is [(n + 1)*/4]. To prove the upper bound, let G' be a strongly 
connected digraph with vertices v,,...,u,. Let d(i,j) denote the length of the 
shortest directed path.from vu; to v;, and let k denote the maximum d(i, j) over all 
i # J. We may index the vertices so that this longest shortest path is v,,...,0;41. 
This walk can be extended to a closed walk by appending paths to visit vertices 
Up42>+++,U, and v, in order. Each of these segments has length at most k, so we 
have constructed a spanning closed walk of length at most k(n + 1 —k). For 
1 <k <n, this quantity is bounded by [(n + 1)*/4]. 

Letting k = |(n + 1)/2], the maximum is attained by the digraph consisting of 
edges u,0;,, for 1 <i < k — 1, edges v,v,; for k + 1 <j <n, and edges u,v, for 
k+1<j<n. 


Editorial comment. Most solvers gave a similar proof, starting with a longest 
path or a longest cycle and building to a spanning walk by including the remaining 
vertices. Some solvers began with a shortest spanning closed walk and partitioned 
it into cycles that each contain a vertex not on any other cycle. Hence the total 
length is at most k(n + 1 — k), where k is the number of cycles in the decomposi- 
tion. 

Paul Cull noted that he published this result in his article ‘““Tours of graphs, 
digraphs, and sequential machines,” JEEE Trans. Computers C29(1980), 50-54, 
using the proof printed here. The solution of S. L. Hakimi and E. F. Schmeichel 
pointed out that the related problem in which the walk is not required to be closed 
has the solution |n7/4]. 


Solved also by P. Cull, J. W. Grossman, S. L. Hakimi & E. F. Schmeichel, R. High, G. Johns & T. 
Sprague & T. Sipka, K. S. Kedlaya (student), N. Komanda, O. P. Lossers (The Netherlands), R. F. 
McCoart, Jr., E. R. Scheinerman, R. Stong, and the proposer. 
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Spectral Representation of the Laplace Transform 


10237 [1992, 571]. Proposed by Paul R. Chernoff, University of California, Berkeley, 
CA. 


Consider the Laplace transform £ as an operator on L7(0,). Show that £ is a 
bounded self-adjoint operator which is unitarily equivalent to the ‘“‘position opera- 
tor” X = multiplication by the coordinate x on L7(— vz, v7). 


Solution by the proposer. It is convenient to work with the measure dt/t, which 
is the Haar measure for the multiplicative group R, of positive reals. Accordingly, 
define the unitary operator W: L?(0,) > L7(R,; dt/t) by (Wf(t) = f(t)vt. Then 
define L, = WLW ~' on L*(R,; dt/t). We have the formula 


(Ly6)(x) = ve fe a OO at 
- [ite*(0) 5 
-J “Yre~'o(- |< 


= [vies (= \“ 


Here 4(t) = 6(t~'). Note that the map o> @ is unitary on L7(R,; dt /t). 

Thus L,¢ = k * d, where k(t) = yte~! and * is convolution on the multiplica- 
tive group R,. “Since k is an L' function with respect to the Haar measure dt/t, it 
follows that L, is a bounded operator on L?(R_,; dt/t), and therefore the unitarily 
equivalent operator L is bounded on L7(0,). Therefore L is a self-adjoint 
Carleman integral operator. 

Next, via the Mellin transform (the Fourier transform on the group R_,), we see 
that L, is‘unitarily equivalent to the operator N on L?(—%,) given by the 
formula 


(Nw )(s) = K(s)b(—s) 


where K is the Mellin transform of k: 
co dt 
K(s) = [ 2?" *Vten — = T(1/2 + 2mis). 
0 


Thus N = M,R where M, is multiplication by K and R is the unitary reflection 
operator (Rus) = w(—s). Note the relation K(—s) = [(1/2 — 27is) = K(s), 
which is precisely equivalent to the self-adjointness of the operator N. 

Define w(s) = K(s)/|K(s)|. Then |w(s)| = 1 and w(—s) = w(s)= w(s)'. Let 


= {pW € L?(—~,%):(-s) = w(s)'W(s)} 
= {h € L?(—~,%):(-s) = —0(s)'(s)}. 


Then it is easy to see that L” is the orthogonal direct sum of the closed subspaces 
H* and H~. (Indeed, an arbitrary » © L* decomposes as wt+u&s 7 where 
w*(s) = (b(s) + w(s)b(—s))/2.) 

The subspaces H* are invariant under the operator N. For example, if 
w © H* then 


(Nwb)(s) = K(s)b(-s) = K(s)o(s)"'b(s) = IK(s)lb(s) 
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and |K|w © H* because |K| is an even function. Similarly, if % € H~, we have 
(Nw Xs) = —|K(s)|W(s). 

Finally, note that the map > y2 - w restricted to (0, ~) is unitary from H + 
to L7(0,). Conclusion: Nt (the restriction of N to H*) is unitarily equivalent to 
the multiplication operator M\x; on L7(0, 0). Likewise N~ is unitarily equivalent 
to M_,x,. Of course N is the direct sum of N* and N-. 


Now 
1 1 1/2 rT 1/2 
K(s)| = (T| = + 2ais|r{= -2is)|_ = {———— 
w= (S220) Lado” 


a monotone function on (0,) with 0 < |K| < vz. Accordingly M)x| is unitarily 
equivalent to the position operator X on L7(0, v7), and M_ 1K is unitarily 
equivalent to the position operator on L?(— Vr, 0). 

To sum up: by putting together our chain of unitary equivalences, we have 
proved that the Laplace transform operator L is self-adjoint, bounded (so a 
Carleman integral operator), and unitarily equivalent to the position operator X 
on L7(— yr, vr). (in particular ||L|| = var, a known fact; cf. the discussion of 
Hardy-type inequalities in Nelson Dunford and Jacob T. Schwartz, Linear Opera- 
tors I, Interscience, 1958.) 


Editorial comment. Jerome Goldstein supplied a reference to D. S. Gilliam, 
J. R. Schulenberger and J. L. Lund, ‘Spectral representation of the Laplace and 
Stieltjes transforms,”’ Mat. Aplic. 7 (1988), 101-107. This paper contains a solution 
of this problem, which the authors characterize as “at least partially known,” and 
they also point out that this provides a rare example of a bounded operator with a 
continuous spectrum for which the spectral representation is explicitly known. 


Solved also by R. J. Chapman (U.K.) and A. Vogt. 
Moments of a Bivariate Normal Distribution 


10246 [1992, 676]. Proposed by B. C. Carlson, Iowa State University, Ames, IA. 


For integers m and n with n > 0 and —n < m <n, find values of A, N, and A 
such that 


—x* + 2pxy —y? 
n+my n-—m | A(_ 
m0 mall Je on| ai pry | ACN) 


for —1 <p < 1, where C;, is a Gegenbauer polynomial. 


Solution by F. W. Steutel, Eindhoven University of Technology, Eindhoven, The 
Netherlands. Clearly, the integral to be evaluated, denoted here by J,, ,,, is given by 


| = FY” tmyn-m 


where (X,Y) is a normally distributed random vector with zero means, unit 
variances and correlation coefficient p, and where E denotes expectation. As we 
have 


n 
_ (nm) errr myn-m (X+ty)”, 
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it follows that 


-1 
— 2n 4 n-m:; 2n 
Lm = (, vf n| coefficient of t nE(X+tY)™. 


Since X + tY is a normally distributed random variable with mean zero and 
variance 1 + 2pt + t” we have 


E(X+tY)” =1-3-5-+-(2n—1)(1 + 2pt + #7)”. 
From the definition of C\(z) we then obtain 


poap = mm)! OO, (—p). 


nym n,—m Wn! n—|m| 


Solved also by J. Anglesio (France), R. J. Chapman (U.K.), I. Kastanas, O. P. Lossers (The 
Netherlands), H. J. Seiffert (Germany), and the proposer. 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank S. Cater, Gulbank D. Charkerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 


I know, indeed, and can conceive of no pursuit so antagonistic to the 
cultivation of the oratorical faculty ... as the study of Mathematics. 
An eloquent mathematician must, from the nature of things, ever 
remain as rare a phenomenon as a talking fish, and it is certain that 
the more anyone gives himself up to the study of oratorical effect the 
less will he find himself in a fit state to mathematicize. It is the 
constant aim of the mathematician to reduce all his expressions to 
their lowest terms, to retrench every superfluous word and phrase, 
and to condense the Maximum of meaning into the Minimum of 
language. He has to turn his eye ever inwards, to see everything in its 
dryest light, to train and inure himself to a habit of internal and 
impersonal reflection and elaboration of abstract thought, which 
makes it most difficult for him to touch or enlarge upon any of those 
themes which appeal to the emotional nature of his fellow-men. When 
called upon to speak in public he feels as a man might do who has 
passed all his life in peering through a microscope, and is suddenly 
called upon to take charge of an astronomical observatory. He has to 
get out of himself, as it were, and change the habitual focus of his 
vision. 

—J.J. Sylvester 
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Mathematics of the 19th Century: Mathematical Logic, Algebra, Number 
Theory, Probability Theory. A. N. Kolmogorov and A. P. Yushkevich, ed. 


Basel /Boston/Berlin: Birkhauser Verlag, 1992. xiv + 308 pp. Hardcover. 
$149.50. 


Reviewed by Karen Hunger Parshall 


The history of mathematics poses many difficulties for those who choose to pursue 
it. Who comprises the audience for the subject? Mathematicians? Historians of 
mathematics? Historians of science? Historians? Some of the above? All of the 
above? With what questions and issues should the subject deal? Who did what 
when, and who did what first? What theorems led up to theorem Y, and how were 
they proven? In what ways did the social and political milieu of the country in 
which mathematician X lived affect the way in which theorem-Y was conceived or 
proven? How did the particular biographical details of mathematician X influence 
mathematics as he or she chose to pursue it? How did the broader philosophical, 
intellectual, and cultural trends of the time influence the mathematical theories 
produced? Should the history of mathematics address all of these questions? Some 
of these questions? Should any single study in the history of mathematics try to 
answer all of these—and other possible—questions simultaneously? Or should 
distinct studies treat these various concerns separately? 

The issue of audience is a sticky one. Theoretically speaking, the answer to the 
first multiple choice question above should be “all of the above.” The history of 
mathematics should make mathematicians more conscious of their roots and 
inform them of the rich and complex past of their chosen field. For obvious 
reasons, it should interest historians of mathematics themselves. Historians of 
science, regardless of the particular science or time frame they study, should 
benefit from the added dimension that an understanding of concurrent mathemati- 
cal developments would bring to their work. And, similarly, historians, in order to 
understand fully the period defining their immediate research interests, should be 
aware of its scientific and mathematical matrix. 

This is, of course, just speaking theoretically. In practice, each possible con- 
stituency has its own notion of how the subject should be done. “Historians should 
write history and not mathematics,” the mathematician might say, and the histori- 
ans—and even some historians of science—might agree. Or the mathematician 
might just as likely charge the historian to “tell us where the theorems came from” 
and to “forget all of the historical mumbo-jumbo about politics and philosophy.” 
Here, however, those who find technical mathematical discussions less than 
illuminating would hardly concur. One mathematician, John Ewing, has even 
argued that the historian of mathematics should write “poetic history—the kind 
that Homer wrote—and the kind that many mathematicians want to read” [2, 
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p. 93]. In his view, the history of mathematics should satisfy the emotional needs of 
the mathematician seeking to appreciate and understand “‘both the heros and the 
villians” of the past [2, p. 93]. It should focus more on telling a good story, in 
emulation of the “history” of Eric Temple Bell [1], and less on analyzing the “dull 
details of a 100-year-old paper’ [2, p. 94]. This implied characterization of good 
history would certainly not attract many—if any—adherents within the historical 
community. Since the history of mathematics is, after all, history, it cannot ignore 
the prescripts of its parent discipline any more than it can avoid the fact that it 
deals with mathematics. 

The problematic nature of audience immediately suggests the difficulties sur- 
rounding the second set of questions posed above, those involving content and 
focus. Because the way in which historians ply their trade changes over time, what 
are deemed interesting and appropriate questions for historical investigation also 
experience historical change. This implies, moreover, that the historical standards 
of just a few decades ago may no longer obtain. Compare, for example, the 
historical climate of 1972, when Morris Kline published his mammoth book, 
Mathematical Thought from Ancient to Modern Times, with that of today [3]. 
So-called internalist practices in the history of science still held sway within the 
subdiscipline of the history of mathematics. The internal aspects of 
mathematics—the theorems along with their development and extension in the 
published works of mathematicians—defined the accepted scope of historical 
inquiry. Factors beyond the bounds of the technical evolution of the subject 
received scant—or, more likely, no—attention. Today, however, historians of 
mathematics try increasingly to respond to trends in scholarship both in the history 
of science and in history in general. While continuing to recognize the value of 
securing an historical grasp of internal mathematical developments, they also 
appreciate the important insights that an analysis of external social, intellectual, 
political, philosophical, and cultural factors can provide into their science. This 
heightened awareness and its concomitant widening of the historical perspective 
on mathematics have significantly deepened our understanding of the subject. 
Books like Herbert Mehrtens’s Moderne, Sprache, Mathematik [5] and Joan 
Richards’s Mathematical Visions: The Pursuit of Geometry in Victorian England [6], 
each in its own very different way, reflect these changing historiographical stan- 
dards. 

These observations now bring us to the book under review. Originally published 
in the former Soviet Union in 1978, Mathematics of the 19th Century represented 
the fruits of an ongoing, collaborative research program, which had already 
produced the three-volume History of Mathematics from Antiquity to the Early 
Nineteenth Century between 1970 and 1972 [7] and which was scheduled to 
generate three more volumes on various aspects of the history of nineteenth-cen- 
tury mathematics in addition to two volumes on the twentieth century. In their 
preface, this book’s editors, A. N. Kolmogorov and A. P. Yushkevich, stated their 
particular volume’s general objectives. It sought to “consider the development of 
mathematics not simply as the process of perfecting concepts and techniques for 
studying real-world spatial forms and quantitative relationships but as a social 
process as well’ [p. ix]. It also aimed ‘to “examine the interaction of mathematics 
with the social structure, technology, the natural sciences, and philosophy’”’ [p. ix]. 
In other words, this book, a product of the late 1970’s, was intended to address 
both internalist and externalist issues as they pertain to the history of nineteenth- 
century mathematics in an effort both to understand mathematics technically and 
to situate it within a broader cultural framework. Had it achieved its stated goals, 
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it would have been both path-breaking for its day and remarkably timely in its 
presentation in an English translation in the early 1990’s. Instead, however, this 
book provides a traditional, internalist account of the technical development of 
four areas of mathematics—mathematical logic, algebra, number theory, and 
probability—not during the whole of the nineteenth century, as the title implies, 
but rather from roughly 1800 to the more “natural chronological boundary” [p. xiii] 
of the 1870’s or 1880’s. Moreover, as the editors admit, their book “is more a 
collection of essays than a connected history of nineteenth. ..-century mathemat- 
ics” [p. x]. As such, it provides the reader with historical glimpses—and these of 
varying quality—of selected areas as opposed to an integrated discussion of the 
history of mathematics in the nineteenth century. 

The first essay treats of mathematical logic and is the least satisfying in the 
collection. By far the shortest chapter in the book (only thirty-three pages), it also 
opens not around 1800, in keeping with the book’s prescribed time frame, but with 
the work of Aristotle in the fourth century B.c.! In two brief pages, it skims 
through some 2000 years to focus (for six pages) on the work of Gottfried Wilhelm 
Leibniz before reaching the nineteenth century and the ideas of Augustus DeMor- 
gan, George Boole, William Stanley. Jevons, John Venn, Ernst Schréder, and 
finally P. S. Poretskii. This hyper-extended chronology almost necessarily results in 
an overly superficial and highly disjointed historical treatment of the subject 
matter at hand. Furthermore, in discussing the work of the nineteenth-century 
figures, the author, Z. A. Kuzicheva, merely presents brief summaries of the 
results in a chosen work rather than providing any meaningful historical—or 
mathematical—context for those results. This is not even good internalist history, 
much less the kind of historical approach promised in the preface, and that is 
really too bad because this chapter had much to offer. In particular, it touches on, 
but unfortunately does not explore, issues like the reception of Jevons’s ideas in 
Russia [p. 24] and the transmission of ideas from Great Britain to the Continent 
and to Russia [pp. 27-31]. Had the answers to these and related questions been 
fleshed out, this chapter would have not only shed real light on the evolution of 
mathematical logic in Russia but also satisfied, at least to some extent, the editors’ 
stated goal of viewing nineteenth-century mathematics through a wider historical 
lens. 

The second chapter, on algebra and algebraic number theory by IJ. G. Bash- 
makova and A. N. Rudakov (with the assistance of A. N. Parshin and E. I. 
Slavutin), contrasts markedly with the first in both range and depth. In 100 pages, 
the authors survey such topics as the theory of equations, group theory, linear 
algebra, the theory of nonassociative and associative algebras, invariant theory, and 
the theories of quadratic forms and ideals, while they focus more specifically on 
the work of figures like Carl Friedrich Gauss, Niels Henrik Abel, Evariste Galois, 
William Rowan Hamilton, Ernst Kummer, Richard Dedekind, and Leopold Kro- 
necker. Theirs is a difficult task given that algebra, as we now conceive of it, did 
not exist in the nineteenth century. The authors of this chapter thus attempt to tie 
together the various threads which ultimately interwove to form the subject. In so 
doing, however, they, more often than not, reach back into the nineteenth century 
from their contemporary point of view and “find” precursors to modern notions 
instead of analyzing the historical setting on its own terms. The chapter is 
peppered with statement like ““Gauss’ study of forms of the same discriminant was 
in effect a study of the fundamental properties of cyclic and general abelian 
groups” [p. 37] and ““What Gauss ‘wants to say is that the group he is considering is 
not cyclic but is a direct sum of two or more cyclic groups [p. 65] and “To this end 
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he [E.I. Zolotarev] constructed in effect the semilocal ring of p-integral numbers 
in Q(0)”’ [p. 111]. While it is mathematically the case that what Gauss and 
Zolotarev were doing can be expressed in these modern-day terms, to describe 
their work in this way with no further qualification suggests that Gauss thought in 
terms of groups and Zolotarev understood semilocal rings. These are value-laden 
terms to contemporary mathematicians and, as such, convey a skewed picture of 
the mathematics of the historical figures. True, the historian needs to communi- 
cate with the mathematician in understandable terms, but the changed historical 
climate also requires that that communication nevertheless reflect a sensitivity to 
the historical setting. In short, the authors of this, and indeed, of all of the 
chapters of this book, have quite unavoidably adopted the historiographical stan- 
dards of the time in which they were initially writing, but that tends to render their 
work somewhat anachronistic today. 

This almost inevitable aspect of the chapter and of the book aside, the 
discussion of nineteenth-century algebra brings quite a lot of material together in 
one manageable essay. In particular, it provides for the non-Russian reader an 
account of Zolotarev’s work in the theory of integral and p-integral numbers. It 
also briefly and tantalizingly touches on—but does not delve into—the cultural 
and mathematical interaction between Zolotarev and the European mathematical 
community during Zolotarev’s trips abroad in 1872 and 1876 [pp. 108-109]. More 
analysis of the effects of these trips on Zolotarev’s process of mathematical 
maturation would have significantly deepened the reader’s understanding of the 
history of Russian mathematics, of the relationship between that history and 
parallel developments in western Europe, and of the dissemination of mathemati- 
cal theories across political boundaries. The. chapter would also have benefitted 
from a well-developed conclusion which attempted some synthesis—around the 
notion of the establishment of an area called algebra—of the different sets of 
ideas discussed. 

Like the second chapter, the third represents a substantial study of a panoply of 
topics which can be unified under one rubric, in this case, number theory. Written 
by E. P. Ozhigova (with the assistance of A. P. Yushkevich), this essay focuses 
particularly on the arithmetic theory of quadratic forms, the geometry of numbers, 
analytic methods in number theory, and the problem of transcendental numbers, 
and basically proceeds as a series of synopses of important works in the various 
topical categories. Here, the reader sees the ideas of some of the key European 
figures—Charles Hermite, Henry J. S. Smith, Hermann Minkowski, Peter 
Lejeune-Dirichlet, Bernhard Riemann, and Joseph Liouville—juxtaposed with the 
roughly contemporaneous researches of the Russians, A. N. Korkin, Zolotarev, 
A. A. Markov, G. F. Voronoi, P. L. Chebyshev, and N. V. Bugaev, among others. 
Although the chapter rarely fleshes out meaningful historical connections between 
the Russians and the Europeans, it does provide insights into the lesser known (at 
least among non-Russian readers) work of key Russian mathematicians. In fact, 
this chapter—and, indeed, this volume—is at its best in its incorporation of 
important Russian mathematical contributions into what have come to be recog- 
nized as mainstream historical developments in western Europe. Aside from 
occasional mentions of Chebyshev and Markov, most non-Russian histories which 
deal with nineteenth-century mathematics tend to ignore major Russian contribu- 
tors because of their lack of influence outside of Russian-speaking regions. The 
reader of this volume, however, will come away with a heightened appreciation of 
the Russian mathematical endeavor in the nineteenth century. 
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Perhaps precisely because of this, though, that same reader will probably also 
come away wanting to know more than is provided about the Russian mathemati- 
cal scene. In Chapter 3, for example, numerous references to St. Petersburg 
appear: Zolotarev and Markov earned their Master’s Degrees there in 1869 and 
1880, respectively [pp. 144 and 151]; Korkin held a professorship there from 1868 
until his death in 1908 [p. 145]; Voronoi first studied and then served on the faculty 
there [p. 166]; and, in fact, the theory of algebraic numbers occupied the attention 
of “many mathematicians in St. Petersburg” in the 1890’s [p. 166]. All of these 
references lead the reader to wonder what St. Petersburg was like as an academic 
institution and as an educational environment. How did so many associated with it 
come to be interested in algebraic number theory? Does it make sense to speak of 
a St. Petersburg “school” of algebraic number theory? If so, what characterizes its 
approach to the subject? If not, why not? The very short (half-page-long) conclu- 
sion to the chapter only piques the reader’s curiosity further on these and related 
issues. ““The sharply increased role of number theory,” the authors state, ‘‘is 
attested by the long list of names of outstanding mathematicians who studied it in 
in the 19th century, as well as the appearance of scientific schools, of which the St. 
Petersburg school of number theory is a remarkable example” [p. 209]. The 
presentation up to this point has certainly and rightly suggested the presence of 
such a school in St. Petersburg, but the authors have provided neither an historical 
argument for it nor an analysis of it. Again, this is the sort of social historical 
discussion the preface had led the reader to expect, and its omission weakens the 
chapter’s overall effectiveness. 

The fourth and final chapter, by B. V. Gnedenko and O. B. Sheinin, does 
address the issue of a mathematical school at St. Petersburg, namely, the school of 
probability begun by B. Ya. Bunyakovskii, driven by Chebyshev, and perpetuated 
by Chebyshev’s student, Markov. Moreover, it provides more historical indications 
of the mathematical ‘influences on that school than do the preceding chapters. 
Here, the authors trace the evolution of probability in France from Laplace 
through Poisson and Cauchy, and they link that discussion to their presentation of 
the St. Petersburg school with the remark that “the probability-theoretic studies of 
the Paris school of mathematicians...soon became known and were further 
developed in Russia” [p. 247]. They also make passing mention of “the powerful 
influence of Laplace and Poisson” on Bunyakovskii [p. 248] and to Chebyshev’s 
references to “the celebrated Geometer,” Poisson, and his works [p. 256], in order 
to reinforce these intellectual connections. 

Despite these sorts of historical linkages, however, the material presented here 
and elsewhere in this book tends to be treated synoptically and with a sense of 
tracing the prehistory of some contemporary idea or theory, rather than in 
historical context. Thus, the reader is told that the form of the integral limit 
theorem which Laplace gave in his Théorie analytique des Probabilitiés of 1812 [4] 
“may be considered the origin of later studies that found completion in S. N. 
Bernshtein’s paper” of 1943 [p. 216] and that “Laplace’s reasoning [on a problem 
applicable to election procedures] may be seen to belong to the prehistory of rank 
correlation and of the statistics of random processes” [p. 219]. This sort of “result 
tracing’ provides a certain amount of useful information, but it generally fails to 
illuminate the historical figure’s approach to or understanding of his or her work. 
It, too, reflects an older historiographical standard. 

From the perspective of 1994, then, Mathematics of the 19th Century suffers 
from various shortcomings. There are also a few mistakes which, if they existed in 
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the original Russian text, should certainly have been corrected here. For example, 
the Irish mathematician, George Salmon, is said to have done his work “in 
England” [p. 80] and the definition of the Jacobian appears incorrectly [p. 82]. 
Moreover, the book’s princely price tag of $149.50 hardly warrants the inordinate 
number of typographical errors and the less-than-felicitous English exposition 
which appears in more than several places, the latter despite the efforts of a 
talented team of translators which included Abe Shenitzer, H. Grant, O. B. 
Sheinin, and A. B. Aries. Taking all of this into account, however, the reader will 
still find much valuable technical information on four areas of the history of 
nineteenth-century mathematics in this volume. 
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Calculus. Gems: Brief Lives and Memorable Mathematics. By George F. 


Simmons, McGraw-Hill, Inc., New York, xiv + 355 pp. 


Reviewed by David J. Pengelley 


Today we see the beginning of a flood of new publications and curricular programs 
aimed at ‘reforming’ how we teach calculus, rightly spurred by disenchantment 
with the status quo. While there is considerable diversity in this flood, there is also 
an emerging nationwide convergence toward common themes for reform. Calculus 
Gems, however, does not fit conveniently into the mold and buzz words of these 
themes. It is instead a needed, albeit likely unintended, challenge to ‘reformed’ 
and ‘unreformed’ alike. I hope that the congealing orthodoxy of calculus reform 
will be receptive to the enriching and inspiring vantage point this book provides. In 
fact I am part of that orthodoxy—I have been one of the principals involved in a 
major calculus ‘reform’ project from its inception. 

What are some of the laudable features of revised calculus programs? We now 
train students in collaborative learning and have them actively involved in the 
classroom. We provide alternatives to lecturing, and emphasize writing as a way to 
learn. We seek a ‘leaner and livelier’ syllabus with more concentration on the 
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understanding of concepts and on meaningful and challenging applications, and 
less emphasis on standard calculational drill, especially things done better by 
calculator or computer. We expect students to become more active learners both 
in and out of class, tackling larger projects, individually or in groups, and acquiring 
and melding the tools of calculus to solve multi-step or open-ended problems. And 
we recognize the usefulness of calculators and computers as new experimental or 
visual tools. 

Yet a fundamental spark is still missing from these aims—precisely that which 
often inspires us as mathematicians. We are motivated by seeing mathematics yield 
beautiful, often surprising results, introducing new mysteries even as it illuminates 
old ones; and when it fascinates and captures our imagination. Moreover, as each 
of us follows and perhaps contributes to the great mathematical discoveries of our 
day, are we not crucially stimulated by the fabric of human, philosophical and 
scientific interconnections involved: people and places, flows of ideas between 
groups and individuals, and lively interplay with other ideas in mathematics, 
science, and philosophy? 

Do our students feel this kind of excitement when we exhort them that the 
fundamental theorem of calculus, the ideas it connects, and its consequences, are 
some of the greatest discoveries of the human species, rather than merely an 
eminently practical calculational device? How can we possibly convince them of 
this in an age when they ‘know’ that areas, volumes, graphs, etc., are available at 
the push of a few buttons, thanks to the brute calculating power of the digital 
computer? Do our students gain any sense of profundity or beauty lurking in the 
connections between infinite processes and numbers like zr and e? Do they even 
‘see any importance in a distinction between. these numbers and their calculator 
approximations? 

It is almost as if we hide the crown jewels of the infinite from’ our students, so 
caught up are we in imparting training for routine, technocratic applications. If we 
expect students to appreciate the beauty and power of calculus, we must immerse 
them in those very aspects which spark the imagination and delight us with a sense 
of awe. Perhaps because calculus is a mature subject we have forgotten that such 
inspiring jewels abound. They were discovered by some of the world’s greatest 
minds, and the context of their discovery is as rich (possibly more so) in human, 
scientific, and philosophical controversy and interest as that of present day science. 
We and our students should dare to play with them and struggle with their history 
and import. 

Turning with this perspective to the work under review, Calculus Gems: Brief 
Lives and Memorable Mathematics, let us begin with what author George Simmons 
tells us about his book. “My overall aims are bound up with the question, ‘What is 
mathematics for?’ and with its inevitable answer, ‘To delight the mind and help us 
understand the world.’ I hold the logically impeccable view that there are only two 
kinds of students... : those who are attracted to mathematics; and those who are 
not yet attracted, but might be. My intended audience embraces both types.” 

The book is in two parts, adapted from two massive appendices to his Calculus 
with Analytic Geometry textbook. The first, “entitled Brief Lives, amounts to a 
biographical history of mathematics from the earliest times to the late nineteenth 
century,” with two main purposes, first “to ‘humanize’ the subject, to make it 
transparently clear that great human beings created it by great efforts of genius, 
and thereby to increase students’ interest... . [A] tiny minority of men and women 
are drawn irresistibly to problems: their minds embrace them lovingly and wrestle 
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with them tirelessly until they yield their secrets. It is these who have taught the 
rest of us most of what we know and can do,... . I have written about some of 
these people from our past in the hope of encouraging a few in the next 
generation.” The second purpose is “connected with the fact that many students 
from the humanities and social sciences are compelled against their will to study 
calculus... . The profound connections that join mathematics to the history of 
philosophy, and also to the broader intellectual and social history of Western 
civilization, are often capable of arousing the passionate interest of these otherwise 
indifferent students.” 

Introducing the second part of his book, entitled Memorable Mathematics, 
Simmons writes: “...I have collected a considerable number of miscellaneous 
topics [‘nuggets’] from number theory, geometry, science, etc., which I have used 
for the purpose of opening doors and forging links with other subjects ...and also 
for breaking the routine and lifting the spirits. ...in the hope of making a few 
more converts to the view that mathematics, while sometimes rather dull and 
routine, can often be supremely interesting.” A final aspect of the author’s 
philosophy appears in his remarks introducing “The Bernoulli numbers and some 
wonderful discoveries of Euler.”’ He derives the MacLaurin series for the cotan- 
gent via the generating function involving Bernoulli numbers, and also the partial 
fraction decomposition of the cotangent from Euler’s infinite product expansion 
for the sine. Comparing coefficients of the two expansions yields the beautiful 
formula 
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In introducing these derivations, acknowledging that not every ‘step is fully sup- 
ported by the rigor available in a calculus class, Simmons says: “[The] mere fact 
that we are not able to seal every crack in the reasoning seems a flimsy excuse for 
denying students an opportunity to glimpse some of the wonders that can be found 
in this part of calculus.” 

The division between the two parts of the book is not as great as their titles 
suggest. The Brief Lives are chock full of small mathematical gems and the history 
and development of concepts, often beautifully woven into the context of biogra- 
phy. An excellent example is Simmons’ treatment of Fermat. Along with the 
biography we find a comparison of the ideas of Descartes and Fermat on analytic 
geometry, and the details of Fermat’s method of finding tangents, complete with a 
quote from Newton crediting Fermat for his own ideas. There is also reference to a 
‘nugget’ in the second part of the book giving Fermat’s ingenious method for 
calculating {?x” dx by using a geometric, rather than arithmetic, partition of the 
interval. We find ties to modern science, beginning with Fermat’s proof of Snell’s 
refraction law from his principle of least time, and discussion of how this led to the 
calculus of variations and Hamilton’s principle of least action. And finally, a 
comparison of the discoveries of Fermat in calculus with those of Newton and 
Leibniz, and a description of several of his important results in number theory. 

The biographies are beautifully written and make engaging reading. The style is 
often captivating and witty, occasionally irreverent, and sometimes wryly humor- 
ous, as in the remark that Euler’s “personal life was as placid and uneventful as is 
possible for a man with 13 children” or that Cauchy was “a great mathematician 
who happened also to be a sincere and narrow-minded bigot.” Many of the 
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biographies are prose gems worth reading out loud in class, as I often do with 
Euler: 


“.. He was perhaps the most prolific author of all time in any field. From 
1727 to 1783 his writings poured out in a seemingly endless flood, constantly 
adding to every known branch of pure and applied mathematics, and also to 
many that were not known until he created them. He averaged about 800 
printed pages a year throughout his long life, and yet he almost always had 
something worthwhile to say and never seems long-winded. ...Euler evi- 
dently wrote mathematics with the ease and fluency of a skilled speaker 
discoursing on subjects with which he is intimately familiar. His writings are 
models of relaxed clarity. He never condensed, and he reveled in the rich 
abundance of his ideas and the vast scope of his interests. The French 
physicist Arago, in speaking of Euler’s incomparable mathematical facility, 
remarked that ‘He calculated without apparent effort, as men breathe, or as 
eagles sustain themselves in the wind.’...He was also a man of broad 
culture, well versed in classical languages and literatures (he knew the 
Aeneid by heart), many modern languages, physiology, medicine, botany, 
geography, and the entire body of physical science as it was known in his 
time. However, he had little talent for metaphysics or disputation, and came 
out second best in many good-natured verbal encounters with Voltaire at the 
court of Frederick the Great.” 


The thirty-three mathematical biographies have a vast reach; from Thales in the 
7th century B.C. to WeierstraB near the turn of our own waning century. They are 
not restricted to calculus, but include for example the development of the concept 
of mathematical proof from the Greek tradition of criticism and skepticism. In fact 
the biographies touch in a meaningful way on virtually every aspect of mathematics 
up into the nineteenth century. For instance, we learn about the astonishing 
discovery by Heiberg in 1906 of The Method by Archimedes, and the train of ideas 
leading from Apollonius’ Conics to Kepler’s laws of planetary motion and thence 
to Newton’s theory of gravitation and laws of motion. We are treated to a 
derivation of the planar forms for conic sections arising from sectioning a cone, 
and why these are called ‘parabola, hyperbola, ellipse’; and also to the classifica- 
tion of Pythagorean triples. We learn Cavalieri’s Principle for computing volumes 
by his ‘geometry of indivisibles,’ and are invited to try our own hand at this in a 
collection of problems (a few sections of the book have good problem sets; ‘more 
would be a welcome addition). Simmons sometimes expresses strong opinions, for 
example “it was Cavalieri the disciple and not Galileo the master who first 
published the correct parabolic law of projectile motion, which most scientists 
assume is due to Galileo. In fact, Galileo was not much of a mathematician, and 
managed to confuse himself about the geometry.” 

The cycloid is a recurring theme scattered throughout the book. When reading 
about Torricelli we learn that Galileo incorrectly concluded that the area under 
one arch was not quite three times the area of the generating circle, but that his 
student Torricelli proved it was exactly that; and the next major step in its study 
was Christopher Wren’s 1658 demonstration that its length is four times the 
diameter of the circle. We also learn that in 1645 Torricelli performed the very 
first calculation of the length of a noncircular curve, namely the equiangular spiral, 
and how difficult this was to believe: Descartes had asserted it was impossible, 
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“Geometry should not include lines that are like strings, in that they are some- 
times straight and sometimes curved, since the ratios between straight and curved 
lines are not known, and I believe cannot be discovered by human minds,” while 
Galileo responded, “Who is so blind as not to see that, if there are two equal 
straight lines, one of which is then bent into a curve, that curve will be equal to the 
straight line?” 

The story of the cycloid continues, not only through the biographies of Huygens, 
Newton, and the Bernoulli brothers, but also in four nuggets in the second part on 
the catenary, brachistochrone, tautochrone, and evolutes and involutes of the 
cycloid. For each a full and delightful mathematical treatment is presented, and 
the prominence of this 17th century theme comes through clearly, for instance in 
Jean Bernoulli’s “...you will be petrified with astonishment when I say that this 
very same cycloid, the tautochrone of Huygens, is also the brachistochrone we are 
seeking,” and his reaction to Newton’s anonymous solution: “I recognize the lion 
by his print.” This cycloid theme is so captivating that I wish it were graced with a 
unified exposition. 

The mathematical biographies of Newton and Leibniz are strong, fascinating, 
and awe inspiring. I have the sense that Simmons’ own love and admiration is 
perhaps greatest for the 17th century, stretching here from Descartes and Kepler 
to Newton, Leibniz, and the Bernoullis, as evidenced by his quote from Alfred 
North Whitehead: 


“A brief, and sufficiently accurate, description of the intellectual life of the 
European races during the succeeding two centuries and a quarter up to our 
own times is that they have been living upon the accumulated capital of ideas 
provided for them by the genius of the seventeenth century. The men of this 
epoch inherited a ferment of ideas attendant upon the historical revolt of the 
sixteenth century, and they bequeathed formed systems of thought touching 
every aspect of human life. It is the one century which consistently, and 
throughout the whole range of human activities, provided intellectual genius 
adequate for the greatness of its occasions.” 


Continuing, the mathematical biography of Euler is equally awe inspiring, but by 
the mid-18th century, Simmons’ biographies become on average somewhat weaker, 
more personal biography and less mathematical biography, and I sense that this is 
not just because there are fewer giants and more advanced mathematics to explain. 
There are exceptions, with lovely, fascinating writings on both the lives and work of 
GauB, Riemann, and WeierstraB, and to a lesser extent Liouville and Chebyshev, 
but weakness in those on Lagrange, Laplace, Fourier, Cauchy, Abel, and Dirichlet. 
Simmons gives a beautiful exposition of Riemannian geometry, and Riemann’s 
work on the prime number theorem via the zeta function. The wonderful 
WeierstraB biography, newly written for this book, has lucid descriptions of how 
elliptic functions arose as natural generalizations of trigonometric functions, and of 
spacefilling curves, and a fascinating footnote on the work of Bolzano. Thus this 
part of the book ends with a bang; one can hardly imagine a more enticing 
historical introduction to the ideas of higher mathematics for a student of 
calculus. 

The author’s view of the importance of making the contribution of these 
mathematicians known and appreciated is indicated by his quote from Arthur 
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Koestler: 


“In the index to the six hundred odd pages of Arnold Toynbee’s A Study of 
History, abridged version, the names of Copernicus, Galileo, Descartes and 
Newton do not occur...yet their cosmic quest destroyed the mediaeval 
vision of an immutable social order in a walled-in universe and transformed 
the European landscape, society, culture, habits and general outlook, as 
thoroughly as if a new species had arisen on this planet.” 


The biographies also make a wealth of specific humanistic, psychological, social, 
and scientific connections regarding the development of Western civilization, with 
tantalizing footnotes and suggestions for further reading, and marvelous quotations 
throughout. For instance, regarding discovery, ‘‘It is often seen in the history of 
ideas that one generation’s disaster is an opportunity for the next,” or Linus 
Pauling’s comment that the way to have many good ideas is “Have lots of ideas 
and throw away the bad ones. You aren’t going to have good ideas unless you have 
lots of ideas and some sort of principle of selection.” Again, Simmons often 
expresses strong opinions: He first quotes the response of English political philoso- 
pher Thomas Hobbes to Cavalieri’s determination that a solid (the hyperbolic solid 
of revolution) can have infinite extent but finite volume, as ““To understand this for 
sense, it is not required that a man should be a geometrician or a logician, but that 
he should be mad,” and then goes on to say “Hobbes had the curious belief that 
mathematical theorems can be attacked by ridicule and invective as if they were 
obnoxious planks in an opponent’s political platform.” However, some of the 
quotations are unfortunately stated without .reference, and in general I wish that 
some of the claims, as profound and provocative as they are, were better docu- 
mented, lest the reader wonder whether the desire for a captivating story some- 
times takes over. 

Some of the Memorable Mathematics pieces in the second part have already 
been mentioned above, a few more I will mention now, leaving yet others for the 
reader to discover when she/he reads the book. All are eminently accessible to 
calculus students, and well worth presenting or having students work on as 
projects. 


e Several proofs of the Pythagorean Theorem, including two by the Indian 
mathematician Bhaskara, are followed by beautiful proofs of Heron’s formula 
for the area of a triangle in terms of its side lengths, and of Brahmagupta’s 
generalization to quadrilaterals (great for a precalculus class?) There seems to 
be no other work by non-Western mathematicians in the book, however. 


A beautiful description of Archimedes’ quadrature of a segment of a parabola, 
finding it is 4/3 the area of a certain inscribed triangle (perfect for the second 
week of a ‘lean and lively’ calculus course?). 

e The earliest, alluring, precise determination of the area of a region bounded 

by curves (no, it’s not a circle, and I won’t give it away). 

e An exposition of Archimedes’ Method, the equilibrium of moments in balance 
he described in his letter to Eratosthenes for finding the volume of a sphere. 
The earliest appearance of the basic idea of integral calculus, this is quite 
suitable for reading and discussion with first semester students. Archimedes 
himself said “I am persuaded that this method will be of no little service to 
mathematics. For I foresee that once it is understood and established, it will 
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be used to discover other theorems which have not yet occurred to me, by 

other mathematicians, now living or yet unborn.” How ironic that this treatise 

was lost until the year 1906! 

Several gems deriving particular infinite series and products, such as Wallis’ 

infinite product for 2, Leibniz’s ingenious derivation of the series for m/4 

which bears his name using integration by parts, and three delightful deriva- 

tions of Euler’s summation of X(1/n2), including the master’s own method 

based on factoring (sin x/x) into an infinite product. 

The distribution of primes. 

The irrationality of a, and of rational powers of e, Liouville’s Theorem 

On approximating transcendentals with rationals, and Hilbert’s version of 

Hermite’s proof that e is transcendental. 

e The product form of the zeta function, and a proof that the series of 
reciprocals of the primes diverges. 


e Derivations of Kepler’s three laws (including some nice problems). 


The gems have a sumptuous finale, beginning with the extension of the complex 
numbers to the quaternions, discussion of more general abstract “linear algebras,” 
nonassociative algebras, and then an impressive application of the quaternions to 
prove Lagrange’s theorem that every natural number can be expressed as the sum 
of four squares. (This is an example of several spots where more historical 
perspective could enrich the gems, and more cross references to the Brief Lives 
would be illuminating.) 

Simmons ends by enticing readers in further directions with the Armenian 
proverb that “mathematics, love, and home: remodeling are the principal human 
enterprises in which one thing leads to another.” 

The somewhat artificial division between the “Brief Lives” and the “Memora- 
ble Mathematics” could be bridged in large part by the incorporation of original 
sources. Using the words and methods of its creators would unify the mathematics 
with its human and developmental context, and bring it even more alive. It is hard 
to overestimate the positive impact that reading original sources can have on 
learning mathematics. 

Every mathematician will learn something new and interesting from Simmons’ 
book. It shows calculus (and more) as a lively, powerful, and challenging human 
struggle. This is what both we and our students need to feel, and Simmons reminds 
us of its wonders from a historical perspective. Students who work with these ideas 
will gain motivation and awe we otherwise fail to provide. 

When my niece started college at a (very large) premier west coast university, 
where her calculus class numbers several hundred students, I gave her a copy of 
Calculus Gems, in a desperate attempt to counteract all that she will encounter 
there militating to convince her that calculus is a class well worth dropping, not 
treasuring. As with everything about an eighteen year old, only time will tell. 


Department of Mathematical Sciences 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 
with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 

T : Textbook 

C : Computer Software 


1-4: Semester 
** > Special Emphasis 
2? : Questionable 


P : Professional Reading 
L : Undergraduate Library 
13: Grade Level 


S : Supplementary Reading 


Readers are advised that price information is subject to change. Selected books and 
software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, 


MN 55057-1098. 


General, P**, L*. Handbook of Writing for the 
Mathematical Sciences. Nicholas J. Higham. 
SIAM, 1993, xii + 241 pp, $21.50 (P). [ISBN 
0-89871-314-5] The mathematician’s Strunk 
and White. With chapters such as English Us- 
age, Writing a Paper, Revising a Draft, Writing 
a Talk, and Computer Aids for Writing and Re- 
search, this handbook belongs on every math- 
ematician’s desk. A pleasure just to thumb 
through and spot read. BC 


General, P. How To Teach Mathematics: A 
Personal Perspective. Steven G. Krantz. AMS, 
1993, x + 76 pp, $21 (P). [ISBN 0-8218-0197- 
X] Practical advice about teaching. Few will 
agree with every recommendation, but there is 
much of value here. AO 


General, S(13). Conquering Math Anxiety: 
A Self-Help Workbook. Cynthia A. Arem. 
Brooks/Cole, 1993, xvii + 158 pp, $9.50 (P). 
[ISBN 0-534-18876-1] Hints and exercises 
for controlling anxiety, developing positive 
thinking, visualizing success, recognizing and 
applying learning styles, learning study skills, 
conquering test anxiety. Despite the title, this 
is a general approach; main references to math- 
ematics are chapter and section titles. MW 
Mathematics Appreciation, T(13: 1), S, L*. 
Geometry in Nature. Vagn Lundsgaard Hansen. 
Transl: Tom Artin. AK Peters, 1993, xiii + 
238 pp, $29.95. [ISBN 1-56881-005-9] A 
broad, accessible, readable overview. Topics 
include: classification of surfaces, Poincaré’s 
conjecture, catastrophe theory, ancient astron- 
omy, relativity, Minkowski space-time, fiber 
bundles, gauge theory, string theory. DP 
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Recreational Mathematics, S(13), L. Math- 
ematics: How To Look Like a Genius Without 
Really Trying. Arthur Benjamin, Michael Brant 
Shermer. Lowell House (2029 Century Park 
E., Suite 3290, Los Angeles, CA 90067), 1993, 
xx + 218 pp, $22.95. [ISBN 0-929923-54-5] 
Dedicated to the proposition that learning men- 
tal arithmetic can hook students on real math- 
ematics, this book explains tricks Arthur Ben- 
jamin uses when performing as a “lightning cal- 
culator.” This reviewer was charmed by the 
fun-filled anecdotes, clever tricks, and wonder- 
ful quotes such as this one, from Samuel John- 
son: “Arithmetical inquiries give entertainment 
in solitude by the practice, and reputation in 
public by the effect.” AWR 


Precalculus, T(13: 1). Algebra with Trigonom- 
etry for College Students, Third Edition. 
Charles P. McKeague. Saunders College, 1993, 
xix + 827 pp, $46.75. [ISBN 0-03-096561-6] 
Many exercises in each section; extensive re- 
view material after each chapter. Useful chap- 
ter summaries, “common mistakes” sections. 
Clearly worked examples, with helpful hints, 
common sense suggestions. (First Edition, TR, 
December 1988.) DS 


Precalculus, C. GrafEq V1.13 for the Ap- 
ple Macintosh. Pedagoguery Software (4446 
Lazelle Ave., Terrace, BC, Canada V8G 1R8), 
1991, 21 pp, (P), $200 site license. A sim- 
ple, inexpensive, easy-to-use plotting program 
with pre-calculus and elementary calculus ap- 
plications. Plots equations—not just functions; 
includes convenient zooming, tracking, deriva- 
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tive, and other features. Manual includes sam- 
ple teaching materials. PZ 


Education, P*. Integrating Research on 
the Graphical Representation of Functions. 
Eds: Thomas A. Romberg, Elizabeth Fennema, 
Thomas P. Carpenter. Stud. in Math. Thinking 
& Learning. Lawrence Erlbaum Assoc, 1993, 
xi + 350 pp, $79.95. [ISBN 0-8058-1134-6] 
11 articles summarize current knowledge from 
teaching, learning, curriculum, and assessment 
perspectives. AO 

Education, P. “Open-ended Questioning: A 
Handbook for Educators. Robin Lee Harris 
Freedman. A-W, 1994, v + 81 pp, $12.96 (P). 
[ISBN 0-201-81958-9] Advice on writing and 
using open-ended questions for assessment. 
Some examples from mathematics. AO 


Education, P. Authentic Assessment: A Hand- 
book for Educators. Diane Hart. A-W, 1994, 
vii + 120 pp, $12.96 (P). [ISBN 0-201-81864- 
7] Surveys alternatives to standardized tests 
for assessing learning. Gives practical tips for 
use of authentic assessment techniques. AO 


Education, P, L. Assessment in the Mathe- 
matics Classroom: 1993 Yearbook. Norman 
L. Webb, Arthur F. Coxford. NCTM, 1993, viii 
+ 248 pp, $20. [ISBN 0-87353-352-6] 27 pa- 
pers on general assessment themes; assessment 
techniques and practices in grades K-4, 5-8, 
and 9-12; issues in assessment. AO 


Education, P, L. Partnerships in Maths: Par- 
ents and Schools: The IMPACT Project. Eds: 
Ruth Merttens, Jeff Vass. Falmer Pr, 1993, vi 
+ 255 pp, $35 (P); $90. [ISBN 0-75070-155-2; 
0-75070-154-4] The IMPACT project made 
parents partners in mathematics education. 22 
papers, written by parents, teachers, supervi- 
sory personnel, and researchers, offer perspec- 
tives on the project and guidance for those in- 
terested in similar efforts. AO 


Education, P. American Perspectives on the 
Seventh International Congress on Mathemat- 
ical Education. Ed: John A. Dossey. NCTM, 
1993, vi+ 73 pp, $10 (P). [ISBN 0-87353-360- 
7] 38 papers by American educators report on 
sessions, experiences, and observations. AO 

Education, P. Mathematics Assessment: Al- 
ternative Approaches. Therese Kuhs. NCTM, 
1992, $55 video/viewer’s guide set; A Viewer'’s 
Guide, iv + 20 pp, $4.50 (P). [ISBN 0-87353- 
355-0] Dramatizations illustrate several. ap- 
proaches to assessment. Viewer’s Guide sum- 
marizes each video segment, suggests follow- 
up activities. Stimulating, provocative. AO 


Education, S(15—18). Talking Points in Math- 
ematics. Anita Straker. Cambridge Univ Pr, 
1993, 124 pp, $15.95 (P). [ISBN 0-521-44758- 
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5] Ideas for mathematical “thought experi- 
ments” in elementary and middle school. Ac- 
tivities and teaching suggestions to encourage 
mathematical discussion, imagery, and mental 
arithmetic. Organized by mathematical themes 
and British school levels. MW 


History, P. Bertrand Russell and the Ori- 
gins of the Set-theoretic ‘Paradoxes’. Alejan- 
dro R. Garciadiego. Birkhiduser, 1992, xxix + 
264 pp, $77.50. [ISBN 0-8176-2669-7] ‘“Re- 
constructs and reinterprets the role of Russell 
in the origins of the set-theoretic ‘paradoxes’.” 
Carefully documented scholarly work; exten- 
sive bibliography. BH 


Logic, T(17), P, L. Model Theory. Wilfrid 
Hodges. Ency. of Math. & Its Applic., V. 42. 
Cambridge Univ Pr, 1993, xiii + 772 pp, $99.95. 
[ISBN 0-521-30442-3] Nice encyclopedic in- 
troduction to model theory; chapters loosely or- 
ganized around theme of model-theoretic con- 
structions. RM 


Logic, T(16—18: 1, 2), L. Logic and Prolog. 
Richard Spencer-Smith. Harvester Wheatsheaf 
(Div. of Simon & Schuster), 1991, viii + 392 pp. 
[ISBN 0-7450-1022-9] Stimulating interplay 
between classical logic and Prolog. Logic is 
first seen as a representation system for knowl- 
edge found in natural language. Then Prolog as 
a programming language for processing knowl- 
edge is discussed. In the second half, logic as 
a system of deduction, is followed by consider- 
ing Prolog as a computer system for achieving 
similar results. RJA 


Combinatorics, T(17-18: 2), P, L. Alge- 
braic Combinatorics. C.D. Godsil. Chapman & 
Hall, 1993, xv + 362 pp, $54.95. [ISBN 0- 
412-04131-6] Explores interactions between 
algebra and combinatorics. Develops theory 
of characteristic and matching polynomials of 
a graph, connection to orthogonal polynomi- 
als. Also treats connections among orthogo- 
nal polynomials and other combinatorial objects 
(e.g., walks), distance regular graphs, associa- 
tion schemes, polynomial spaces. LC 


Linear Algebra, T(14—16: 1), L. Linear Al- 
gebra: An Introduction to Abstract Mathemat- 
ics. Robert J. Valenza. Undergrad. Texts in 
Math. Springer-Verlag, 1993, xviii + 237 pp, 
$39. [ISBN 0-387-94099-5] Distinctly dif- 
ferent from the usual computational approach, 
this treatment leans heavily toward axiomatic 
structural development. Matrices don’t appear 
until Chapter 5. Non-standard topics at this 
level include dual spaces, the Cayley-Hamilton 
theorem, and the Jordan normal form. Supple- 
mentary topics include quadratic forms, and cat- 
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egories and functors. Lean, concise treatment; 
exercises often illustrate or extend theory. JS 


Group Theory, T(18: 1, 2), P. Nilpotent 
Groups and their Automorphisms. Evgenii I. 
Khukhro. Expos. in Math., V. 8. Walter de 
Gruyter, 1993, xiii+ 252 pp, DM 158. [ISBN 3- 
11-013672-4] First half treats basic linear and 
combinatorial methods in theory of nilpotent 
groups. Second half details recent results. DP 


Group Theory, T*(16—-17: 1, 2), L. Repre- 
sentations and Characters of Groups. Gor- 
don James, Martin Liebeck. Cambridge Univ 
Pr, 1993, x + 419. pp, $69.95; $29.95 (P). 
[ISBN 0-521-44024-6; 0-521-44590-6] Nice 
introduction to group representations, based 
on modules but stressing characters. Useful 
worked examples (including character tables for 
many groups), good exercises (many with solu- 
tions) make text accessible to undergraduates, 
chemists, physicists, etc. Appealing application 
to molecular vibration. RM 


Algebra, T(17: 1, 2), L. Algebra. T.T. Moh. 
Ser. on Univ. Math., V. 5. World Scientific, 
1992, viii + 350 pp, $68; $32 (P). [ISBN 981- 
02-1195-3; 981-02-1196-1] Provides “quick 
access to the basic knowledge of algebra,” while 
showing diverse applications. Many examples; 
exercises stress intuitive approach. DP 


Algebra, P. The Basic Theory of Power Se- 
ries. Jesis M. Ruiz. Adv. Lect. in Math. Friedr 
Vieweg & Sohn, 1993, ix + 134 pp, $17 (P). 
[ISBN 3-528-06525-7] Theorems and tech- 
niques of power series for analytic and alge- 
braic geometry, commutative algebra. Includes 
Riickert’s complex nullstellensatz, Risler’s real 
nullstellensatz, Tougeron’s implicit function 
theorem, Artin’s approximation theorem. BH 


Calculus, T(13: 2). Calculus with Applica- 
tions, Second Edition. Raymond F. Coughlin, 
David E. Zitarelli. Saunders College, 1993, 
xxii + 722 pp, $58.75. [ISBN 0-03-055757-7] 
Changes (First Edition, TR, August-September 
1989) include exercises divided into three sets: 
standard problems, “referenced exercises” (new 
and updated applications problems), “cumula- 
tive exercises” (solutions require ideas from 
preceding sections). Beginner’s appendix for 
graphics calculators. Graphics calculator ex- 
plorations in each chapter. KB 


Calculus, T(13). Applied Technical Mathemat- 
ics. Merwin J. Lyng, L.J. Meconi, Earl J. Zwick. 
Wm C Brown, 1992, xvi + 857 pp. [ISBN 
0-697-08543-0]; Applied Technical Mathemat- 
ics with Calculus, xvi + 1142 pp. [ISBN 
0-697-05970-7] Basic algebra, trigonometry, 
sequences and series, analytic geometry, statis- 
tics; calculus in the extended version. No-frills 
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approach stresses physical examples, not the- 
ory. Useful as a reference. BH 


Calculus, T(13), P, L. Calculus with Maple V. 
John S. Devitt. Brooks/Cole, 1993, xxii + 
502 pp, $24.50 (P). [ISBN 0-534-16362-9] 
Standard topics and order of presentation, but 
Maple V replaces paper-and-pencil for compu- 
tations. Exposition illustrates commands; style 
allows students to focus on concepts. End-of- 
chapter exercises; some go beyond what is rea- 
sonable to do “by hand.” AO 


Real Analysis, T(15—16: 1), P*, L**. A Course 
of Pure Mathematics, Tenth Edition. G.H. 
Hardy. Cambridge Univ Pr, 1992, xii+ 509 pp, 
$22.95 (P). [ISBN 0-521-09227-2] A reprint 
of the timeless classic, first published in 1908. 
Introduces analysis with the enthusiasm of “a 
missionary talking to cannibals.” DP 


Complex Analysis, T*(18), P*. Composi- 
tion Operators and Classical Function The- 
ory. Joel H. Shapiro. Universitext. Springer- 
Verlag, 1993, xvi + 223 pp, $34 (P). [ISBN 0- 
387-94067-7] A holomorphic function ¢ that 
maps the unit disk U into itself defines the com- 
position operator C,f = fog for f holo- 
morphic on U. Starting with Littlewood’s sub- 
ordination principle (that C, maps H? into it- 
self), text explores “connections between func- 
tion theoretic properties of g and the behavior 
of C, on H?.” Rudin’s Real and Complex Anal- 
ysis is the only prerequisite. Chapter exercises; 
extensive bibliography. Wonderfully motivated 
exposition of some beautiful theory. BH 


Differential Equations, T(18), P. Singular 
Perturbation Methods for Ordinary Differen- 
tial Equations. Robert E. O’Malley, Jr. Appl. 
Math. Sci., V. 89. Springer-Verlag, 1991, viii + 
225 pp. [ISBN 0-387-97556-X] Covers sin- 
gularly perturbed initial value and boundary 
value problems, significant applications. Ap- 
pendix on history. BL 


Differential Equations, P. Nonlinear Equa- 
tions in the Applied Sciences. Eds: W.F. Ames, 
C. Rogers. Math. in Sci. & Eng., V. 185. Aca- 
demic Pr, 1992, ix + 475 pp. [ISBN 0-12- 
056752-0] 11 papers on symmetry in nonlin- 
ear mechanics, waves, integrable systems, sym- 
metric chaos, etc. BL 


Differential Equations, S(18), P, L. Solv- 
ing Ordinary Differential Equations I: Non- 
stiff Problems, Second Revised Edition. E. 
Hairer, S.P. Ngorsett, G. Wanner. Ser. in Com- 
put. Math., V. 8. Springer-Verlag, 1993, xv 
+ 528 pp, $98. [ISBN 0-387-56670-8] New 
features: Hamiltonian systems, symplectic and 
parallel Runge-Kutta methods, dense output 
for RK methods, a new Dormand and Prince 
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method of order 8 with dense output. (First 
Edition, TR, June-July 1987.) SM 


Partial Differential Equations, T(15-16: 1), 
L. Boundary Value Problems and Partial Dif- 
ferential Equations. Mayer Humi, William 
B. Miller. PWS-Kent, 1992, x + 342 pp, 
$39. [ISBN 0-534-92880-3] For science 
and engineering students—intuitive, technique- 
oriented. Treats separation of variables, closed 
form representations, Fourier and Laplace 
transforms, numerical methods. BL 


Partial Differential Equations, T(15: 2), L. 
Partial Differential Equations and Boundary- 
Value Problems with Applications, Second Edi- 
tion. Mark A. Pinsky. McGraw-Hill, 1991, xxii 
+461 pp. [ISBN 0-07-050128-9] Stresses so- 
lutions of physically motivated problems via 
separation of variables, Fourier series, and 
transforms. Also includes asymptotic analysis 
of integrals, numerical solutions, Green’s func- 
tions, and approximate solutions. Appendix 
with Mathematica examples. (First Edition, 
TR, February 1985.) BL 


Partial Differential Equations, P. Pseu- 
dodifferential Operators and Nonlinear PDE. 
Michael E. Taylor. Progress in Math., V. 100. 
Birkhauser, 1991, 213 pp. [ISBN 0-8176- 
3595-5] Operators with nonsmooth symbols, 
paradifferential operators, nonlinear hyperbolic 
systems, propagation of singularities, nonlin- 
ear parabolic equations, elliptic boundary prob- 
lems. BL 


Dynamical Systems, P. The Geometry of 
Hamiltonian Systems. Ed: Tudor Ratiu. Math. 
Sci. Res. Inst., V. 22, Springer-Verlag, 1991, x 
+ 527 pp. [ISBN 0-387-97608-6] 19 papers 
from 1989 MSRI workshop. BL 


Dynamical Systems, T(18: 1, 2), P. Elements 
of Topological Dynamics. J. de Vries. Math. & 
Its Applic., V. 257. Kluwer Academic, 1993, 
xvi + 748 pp, $235. [ISBN 0-7923-2287-8] 
Systematic exposition; aims at recent research. 
Assumes general topology, some Lebesgue in- 
tegration thecry and functional analysis. DP 

Dynamical Systems, T(18), S, P, L. Com- 
plex Dynamics. Lennart Carleson, Theodore 
W. Gamelin. Universitext: Tracts in Math. 
Springer-Verlag, 1993, ix + 175 pp, $29 (P). 
[ISBN 0-387-97942-5] Expanded notes from 
a UCLA course. Conformal and quasicon- 
formal mappings, fixed points, rational itera- 
tion theory (Julia sets), classification of peri- 
odic components, critical points and expanding 
maps, quasiconformal surgery, local geometry 
of the Fatou set, and the Mandelbrot set. As- 
sumes graduate real and complex analysis. KS 


Numerical Analysis, S(16). Numerical Meth- 
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ods for Two-Point Boundary-Value Problems. 
Herbert B. Keller. Dover, 1992, 397 pp, 
$9.95 (P). [ISBN 0-486-66925-4] A minor 
revision of the original 1968 Blaisdell text (TR, 
January 1969). DH 


Numerical Analysis, T(13), L.  Numer- 
ical Mathematics—A Laboratory Approach. 
Shlomo Breuer, Gideon Zwas. Cambridge Univ 
Pr, 1993, xiii + 267 pp, $49.95. [ISBN 0-521- 
44040-8] Assumes neither calculus nor linear 
algebra. Topics include iterative processes, area 
approximations, linear systems, acceleration of 
convergence, interpolative approximation. AO 


Numerical Analysis, T(15: 1), P. Numerical 
Methods for Engineering Applications. Edward 
R. Champion, Jr. Mech. Eng., V. 84. Marcel 
Dekker, 1993, xv + 442 pp, $150. [ISBN 0- 
8247-9135-5] Covers the standard topics plus 
Fourier transforms. Includes application to par- 
tial differential equations, comments on some 
software packages. Chapters contain source 
code for each algorithm, examples. DH 
Numerical Analysis, T(15-17: 1), L. Solv- 
ing Linear and Non-Linear Equations. Chris 
Woodford. Math. & Its Applic. Ellis Horwood, 
1992, 190 pp, $60 (P). [ISBN 0-13-830415-7; 
0-13-830423-8] Theoretical background and 
practical advice. A careful study of selected 
algorithms, not a comprehensive survey. AO 


Functional Analysis, T(18: 1), S, P. A Primer 
of Nonlinear Analysis. Antonio Ambrosetti, 
Giovanni Prodi. Stud. in Adv. Math., V. 34. 
Cambridge Univ Pr, 1993, viii+ 171 pp, $44.95. 
[ISBN 0-521-37390-5] Differentiation in Ba- 
nach spaces, including local and global inver- 
sion theorems; bifurcation theory applied to wa- 
ter waves, the restricted three-body problem, 
etc. Assumes solid grounding in linear func- 
tional analysis. Brief problem list. JS 


Analysis, P. Analysis and Geometry on 
Groups. N. Th. Varopoulos, L. Saloff-Coste, T. 
Coulhon. Tracts in Math., V. 100. Cambridge 
Univ Pr, 1992, xii + 156 pp, $44.95. [ISBN 
0-521-35382-3] Advanced research material 
from courses given by Varopoulos at Université 
Paris VI, 1982-87. DP 


Analysis, T(16—18), S, P, L. Wavelets and Op- 
erators. Yves Meyer. Transl: D.H. Salinger. 
Stud. in Adv. Math., V. 37. Cambridge Univ 
Pr, 1992, xv + 223 pp, $49.95. [ISBN 0-521- 
42000-8] Surveys popular alternative to tradi- 
tional Fourier analysis. Argues that Fourier and 
wavelet analysis are complementary. Assumes 
rudiments of Fourier analysis. A “must read” 
for anyone who wants to learn the subject. KS 


Analysis, S(15—18), L. Selected Problems in 
Real Analysis. B.M. Makarov, et al. Transl. 
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of Math. Mono., V. 107. AMS, 1992, x + 
370 pp, $112. [ISBN 0-8218-4559-4] Prob- 
lems (with solutions) for students who want to 
learn by working (mostly) hard problems. Ar- 
ranged like Halmos’s A Hilbert Space Problem 
Book. Chapters on sequences, functions, se- 
ries, integrals, asymptotics, Lebesgue measure 
and integration, sequences of measurable func- 
tions, iterates of transformations. Based on a 
course at Leningrad University. KS 


Analysis, T(18: 1, 2), P**. The Logarithmic 
Integral: II. Paul Koosis. Cambridge Univ 
Pr, 1992, xxvi + 574 pp, $150. [ISBN 0-521- 
30907-7] Volume IT of acomprehensive study 
of the logarithmic integral’s wide-ranging roles 
in real and complex analysis. Like Volume I 
(TR, May 1989), an expository masterpiece: so- 
phisticated but informal, serious but engaging, 
demanding but generous. PZ 


Algebraic Geometry, S(18), P. Clifford Num- 
bers and Spinors. Marcel Riesz. Eds: E. 
Folke Bolinder, Pertti Lounesto. Fund. The- 
ories of Physics, V. 54. Kluwer Academic, 
1993, ix + 241 pp, $98.50. [ISBN 0-7923- 
2299-1] First section is transcription of au- 
thor’s 1957-1958 lectures on Clifford Algebras 
and Spinors; topics are Clifford numbers, rota- 
tions and reflections, canonical representations, 
representations of isometries. Second section is 
a supplement on Clifford algebras. Third sec- 
tion, by Lounesto, is a survey article on Clifford 
algebras. JS 


Topology, T**(16—17: 1, 2), L*. Classical 
Topology and Combinatorial Group Theory, 
Second Edition. John Stillwell. Grad. Texts 
in Math., V. 72. Springer-Verlag, 1993, xi 
+ 334 pp, $49. [ISBN 0-387-97970-0] A 
wonderful textbook. Historical development 
stresses geometric aspects, focusing on connec- 
tions with complex analysis, mechanics, group 
theory. Treats only dimensions < 3, with em- 
phasis on fundamental group. New edition in- 
cludes proof of the unsolvability of the word 
problem for groups, some of its consequences. 
(First Edition, TR, January 1982.) DP 


Operations Research, S(16-17), C, L. A 
Computer-Assisted Analysis System for Mathe- 
matical Programming Models and Solutions: A 
User’s Guide for ANALYZE. Harvey J. Green- 
berg. OR/Comp. Sci. Interface Ser. Kluwer 
Academic, 1993, xii + 264 pp, $105, disk in- 
cluded. [ISBN 0-7923-9322-8] Many exam- 
ples; no exercises. Assumes basics of LP for- 
mulation, solution, and post-optimality analy- 
sis. Can be used with MINOS, MPS III, MPSX, 
OB1, OSL, etc. SM 


Mathematical Modeling, P. Innovation in 
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Maths Education by Modelling and Applica- 
tions. Jan de Lange, et al. Math. & Its Ap- 
plic. Ellis Horwood, 1993, xii + 392 pp, $50. 
[ISBN 0-13-017351-7] 37 papers on philo- 
sophical issues, new topics and tools, case stud- 
ies, curriculum and assessment. Many articles 
from the 1991 International Conference on the 
Teaching of Mathematics by Applications held 
in Noordwijkerhout, The Netherlands. AO 

Probability, T(17-18: 2), S*. Martingale 
Spaces and Inequalities. Ruilin Long. Friedr 
Vieweg & Sohn, 1993, iv + 346 pp, $64. 
[ISBN 3-528-08397-2] A systematic intro- 
duction; reflects developments of last 20 years. 
Topics: regular martingales and atomic decom- 
position; aspects related to BMO (Carleson 
measures, commutators, weights, martingale 
transforms, etc.); applications to analysis. KB 


Probability, T(16—17: 2), S. Probability and 
Its Applications for Engineers. David H. Evans. 
Marcel Dekker, 1992, xiv + 634 pp, $89.75. 
[ISBN 0-8247-8656-4] Six chapters on the- 
ory; six stand-alone chapters on applications to 
Statistics, quality control, tolerancing, reliabil- 
ity, random processes, and decision trees. DH 


Stochastic Processes, T(17—18: 1, 2), S, P, L. 
Introduction to Multiple Time Series Analysis, 
Second Edition. Helmut Ltitkepohl. Springer- 
Verlag, 1993, xxi + 545 pp, $49 (P). [ISBN v- 
387-56940-5] Very minor changes from First 
Edition (TR, May 1992). KB 


Elementary Statistics, S?(14). The Proofs for 
a First Course in Statistics. T.P. Hutchinson. 
Rumsby Scientific (POB Q355, Queen Victoria 
Bldg., Sydney, N.S.W. 2000, Australia), 1993, 
10 pp, $2.50 (P). Proofs of ten elementary re- 
sults, such as Var(X) = Var(X)/n and the 
formulas for the coefficients of the least squares 
regression line. Requires some calculus. RSK 


Statistical Methods, T(16—17: 2), S. Time 
Series: Forecasting, Simulation, Applications. 
Gareth Janacek, Louise Swift. Math. & Its Ap- 
plic. Ellis Horwood, 1993, 331 pp. [ISBN 0- 
13-918459-7] Broad practical/theoretical in- 
troduction to time series analysis (mostly uni- 
variate). Stresses analysis and forecasting of 
State space models using Kalman filter for 
estimation. Topics: estimation, prediction, 
model selection and evaluation, ARMA mod- 
els, Kalman filter algorithm, structural models, 
frequency domain ideas, spectral estimation, 
multiple series, inclusion of exogenous vari- 
ables, transformations, missing values. KB 
Computer Systems, P. Understanding 
Japanese Information Processing. Ken Lunde. 
O’Reilly & Assoc, 1993, xxxii + 435 pp, 
$29.95 (P). [ISBN 1-56592-043-0] 
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Theory of Computation, T(17-18: 1), S, P. 
Deduction and Declarative Programming. Pe- 
ter Padawitz. Tracts in Theoret. Comput. 
Sci., V. 28. Cambridge Univ Pr, 1992, vi + 
279 pp, $44.95. [ISBN 0-521-41723-6] Be- 
gins with basics of functional and logic pro- 
gramming, stressing specification, verification. 
Treats guards, generators, constructors, models, 
correctness, computing goal solutions, induc- 
tive and directed expansion, reduction, ground 
confluence. Some good examples. Last chap- 
ter treats EXPANDER, a proof support system 
for reasoning about data type specifications and 
declarative programs. RJA 

Theory of Computation, T(17-18: 1), S, P. 
The Clausal Theory of Types. D.A. Wolfram. 
Tracts in Theor. Comp. Sci. Cambridge Univ 
Pr, 1993, viii + 124 pp, $39.95. [ISBN 0-521- 
39538-0] Begins by developing logic pro- 
gramming (LP) from automatic theorem prov- 
ing. Uses simply typed A-calculus to define 
syntax of higher-order logics and to study their 
properties. Uses higher-order equational unifi- 
cation to generalize the resolution principle. A 
higher-order LP language is derived and shown 
to be sound and complete. Extensive bibliogra- 
phy. RJA 

Computer Science, T(13: 1), L*. The (New) 
Turing Omnibus: 66 Excursions in Computer 
Science. A.K. Dewdney. Computer Science Pr, 
1993, xvi + 455 pp, $24.95. [ISBN 0-7167- 
8271-5] Revised and expanded; new chapters 
on disk operating systems, computer viruses, 
Newton’s method, genetic algorithms, how neu- 
ral nets learn, the Mandelbrot set. AO 


Computer Science, P. Specification and Proof 
in Real-Time CSP. Jim Davies. Cambridge Univ 
Pr, 1993, xvii + 180 pp, $49.95. [ISBN 0-521- 
45055-1] Extends Hoare’s theory of Commu- 
nicating Sequential Processes to timed denota- 
tional semantics for the specification of real- 
time systems. RM 


Applications (Biological Science), T(15—17: 
1), L. Mathematical Modeling in the Life Sci- 
ences. Paul Doucet, Peter B. Sloep. Math. 
& Its Applic. Ellis Horwood, 1992, xiv + 
490 pp, $45 (P). [ISBN 0-13-562018-X; 0-13- 
562000-7] Aims to present “what a life scien- 
tist should know about modeling.” Differential 
equations and probability theory are the primary 
mathematical tools. AO 

Applications (Economics), T(16-17: 2), S. 
The Medieval Village Economy: A Study of 
the Pareto Mapping in General Equilibrium 
Models. Robert M. Townsend. Frontiers of 
Econ. Res. Princeton Univ Pr, 1993, xxi + 
145 pp, $35. [ISBN 0-691-04270-5] Path- 
breaking contributions to contract theory and 
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general equilibrium analysis. Explains me- 
dieval village economy by combining theory of 
general equilibrium with notion that allocations 
and institutions of an economy might be Pareto 
optimal. Standard models are reinterpreted, ap- 
plied, and extended. Draws from historical ob- 
servations; characterizes solutions analytically 
and numerically. KB 


Applications (Information Theory), P. Sit- 
uation Theory and Its Applications, Volume 2. 
Eds: Jon Barwise, et al. Lect. Notes, No. 26. 
CSLI (Ventura Hall, Stanford U., Stanford, CA 
94305), 1991, xiii + 637 pp, $26.95 (P). [ISBN 
0-937073-70-9] Proceedings of 1990 confer- 
ence in Scotland. Papers treat issues in sit- 
uation theory; applications of theory to logic, 
computer science, philosophy; applications to 
natural language. RJA 

Applications (Physical Science), S(18), L. 
Lecture Notes in Control and Informa- 
tion Sciences-186: Systems Representation of 
Global Climate Change Models: Foundation 
for a Systems Science Approach. N. Sreenath. 
Springer-Verlag, 1993, xxvi + 258 pp, $79 (P). 
[ISBN 0-387-19824-5] Models global cli- 
mate change using systems theory. Systems 
theory is used to model global climate change. 
An intricate block diagram on a fold-out page 
describes the model graphically. Stresses incor- 
poration of human activities into the model. SM 


Applications (Physics), P. Introduction to 
Quantum Groups. George Lusztig. Progress in 
Math., V. 110. Birkhauser, 1993, xii + 341 pp, 
$49.50. [ISBN 0-8176-3712-5] 


Applications (Physics), P. Mathematical Scat- 
tering Theory: General Theory. FER. Yafaev. 
Transl. of Math. Mono., V. 105. AMS, 1992, x 
+ 341 pp, $216. [ISBN 0-8218-4558-6] 


Applications (Physics), P, L. Selected Works 
of Yakov Borisovich Zeldovich, Volume IT: Par- 
ticles, Nuclei, and the Universe. Eds: J.P. Os- 
triker, G.I. Barenblatt,R.A. Sunyaev. Transl: A. 
Granik, E. Jackson. Princeton Univ Pr, 1993, 
xiv + 644 pp, $59.50. [ISBN 0-691-08742-3] 
83 papers on nuclear and particle physics, as- 
trophysics and cosmology, and the history of 
physics. BC 


Reviewers 


RJA: Richard J. Allen, St. Olaf; KB: Karla Ballman, 
Macalester; LC: Laura Chihara, St. Olaf; BC: Barry 
Cipra, St. Olaf; BH: Bruce Hanson, St. Olaf; DH: Deanna 
Haunsperger, St. Olaf; RSK: Richard S. Kleber, St. Olaf; 
BL: Brian Luc, Carleton; SM: Steve McKelvey, St. Olaf; 
RM: Richard Molnar, Macalester; AO: Arnold Ostebee, 
St. Olaf; DP: David Peifer, St. Olaf; AWR: A. Wayne 
Roberts, Macalester; KS: Karen Saxe, Macalester; JS: John 
Schue, Macalester; DS: Dan Schwalbe, Macalester; MW: 
Martha Wallace, St. Olaf; PZ: Paul Zorn, St. Olaf. 
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Symbolic Computation in 
Undergraduate Mathematics 


Education 


Zaven Karian, Editor 


If you are considering putting a symbolic comput- 
ing system into your curriculum, this is one publi- 
cation you should have. 

—Mathematics Teacher 


This well-written book should be helpful to anyone 
using symbolic computation as an aid in teaching 
undergraduates—The book provides a number of 
examples for presenting probability and statistics 
in away that removes the tedium and emphasizes 
the underlying ideas. 

—AAAS, Science Book and Films 


If you have any plans to integrate symbolic com- 

puting into your program, read and study this book 

first. Your students will thank you for it. 
—AMATYC Review 


This volume brings together many of the facets 
associated with the pedagogic uses of symbolic 
computation. 


Part | consists of articles that deal with general 
issues of learning mathematics and the role of 
symbolic computation in that process. The articles 
in Part Il describe the use of symbolic computation 
in teaching calculus. Some of the areas covered 
are the use of symbolic computation in a labora- 
tory calculus course, the uses of Derive in the 
instruction of calculus, antidifferentiation and the 
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Symbolic Computation 
in Undergraduate 
Mathematics Education 


Zaven A. Karian, Editor 
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definite integral, and the experiences and reflec- 
tions of teachers who have used symbolic compu- 
tation in calculus instruction. 


Part Ill consists of papers on sophomore-level 
courses on linear algebra and differential equa- 
tions. The articles in Part lV describe what can be 
done in using symbolic computation in teaching 
combinatorics, probability and statistics courses. 
The articles and references in Part V will help you 
get started in using some of these ideas at your 
own institution. 
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Proofs Without Words 


Exercises in Visual Thinking 


Roger B. Nelsen 


Just what are “proofs without words?” First of all, 
most mathematicians would agree that they cer- 
tainly are not “proofs” in the formal sense. Indeed, 
the question does not have a simple answer. Proofs 
without words are generally pictures or diagrams 
that help the reader see why a particular math- 
ematical statement may be true, and howone could 
begin to go about proving it. While in some proofs 
without words an equation or two may appear to 
help guide that process, the emphasis is clearly on 
providing visual clues to stimulate mathematical 
thought. Proofs without words bear witness to the 
observation that often in the English language to 
see means to understand, as in “to see the point of an 
argument.” 


Proofs without words have a long history. In this 
collection you will find modern renditions of proofs 
from ancient China, classical Greece, twelfth-cen- 
tury India—even one based on a published proof 
by a former President of the United States! How- 
ever, most of the proofs are more recent creations, 
and many are taken from the pages of MAA jour- 
nals. 


The proofs in this collection are arranged by topic 
into six chapters: Geometry and Algebra; Trigo- 
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nometry, Calculus and Analytic Geometry; Inequali- 
ties; Integer Sums; Sequences and Series; and 
Miscellaneous. Teachers will find that many of the 
proofs in this collection are well suited for class- 
room discussion and for helping students to think 
visually in mathematics. 


The readers of this collection will find enjoyment 
in discovering or rediscovering some elegant visual 
demonstrations of certain mathematical ideas that 
teachers will want to share with their students. 
Readers may even be encouraged to create new 
“proofs without words.” 
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No matter how you 


express It, it still means 


The DERIVE A Mathema- 
tical Assistant program lets 
you express yourself symbol- 
ically, numerically and 
graphically, from algebra 
through calculus, with vec- 
tors and matrices too—all 
displayed with accepted 
math notation, or 2D and 3D 
plotting. DERIVE is also easy 
to use and easy to read, 
thanks to a friendly, menu- 
driven interface and split or 


DERIVE is a 
registered trademark of 
Soft Warehouse, Inc. 


DERIVE. Is half price. 


overlay windows that can 
display both algebra and 
plotting simultaneously. 
Better still, DER/VE has been 
praised for the accuracy and 
exactness of its solutions. 
But, best of all the suggested 
retail price is now only $125. 
Which means DERIVE is now 
half price, no matter how you 
express it. 


Soft Warchouse: 


HONOLULU*HAWAII 


System requirements 


DERIVE: MS-DOS 2.1 or 
later, 512K RAM, and one 33" 
disk drive. Suggested retail 
price now $125 (Half off!). 


DERIVE ROM card: 

Hewlett Packard 95LX & 
100LX Palmtop, or other PC 
compatible ROM card 
computer. Suggested retail 
price now $125! 


DERIVE XM 

(eXtended Memory): 

386 or 486 PC compatible 
with at least 2MB of extena- 
ed memory. Suggested list 
price now $250! 


Soft Warehouse, Inc.* 3660 Waialae Ave. 
Ste. 304 « Honolulu, HI, USA 96816-3236 
Ph: (808) 734-5801 + Fax: (808) 735-1105 


Game Theory and Strategy 


Philip D. Straffin, Jr. 


This valuable addition to the New Mathematical 
Library series pays careful attention to applica- 
tions of game theory in a wide variety of disci- 
plines. The applications are treated in consider- 
able depth. The book assumes only high school 
algebra, yet gently builds to mathematical think- 
ing of some sophistication. Game Theory and 
Strategy might serve as an introduction to both 
axiomatic mathematical thinking and the funda- 
mental process of mathematical modelling. It 
gives insight into both the nature of pure math- 
ematics, and the way in which mathematics can be 
applied to real problems. 

Since its creation by John von Neumann and 
Oskar Morgenstern in 1944, game theory has 
contributed new insights to business, politics, eco- 
nomics, social psychology, philosophy, and evolu- 
tionary biology. In this book, the fundamental 
ideas of game theory share the stage with applica- 
tions of the theory. How might strategic business 
decisions depend on information about a rival 
company, and how much would such information 
be worth? When is it advantageous to vote for a 
candidate who is not your favorite? What are the 
optimal strategies for teams in the football draft, 
and what paradoxes can result from following 
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those strategies? What is a fair way to share the 
costs of adevelopment project? What can we learn 
about the problem of “free will” by imagining 
playing a game with an omnipotent Being? How 
might natural selection lead to altruistic behavior 
in animal species? Game theory gives insight into 
all of these questions. 

The book includes many exercises, with answers, 
which allow the reader to try out calculations, and 
explore alternative formulations of game-theo- 
retic ideas. 
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UNFORTUNATELY, THIS IS WHERE 
PEOPLE ARE PUTTING 


very year, a lot of people make a 
huge mistake on their taxes. They 
don’t take advantage of tax deferral and 
wind up sending Uncle Sam money they 
could be saving for retirement. 
Fortunately, that’s a mistake you can 
easily avoid with TIAA-CREF SRAs. 
SRAs not only ease your current tax- 
bite, they offer a remarkably easy way 
to build retirement income—especially 
for the ‘extras” that your regular pension 
and Social Security benefits may not 
cover. Because your contributions are 
made in before-tax dollars, you pay less 
taxes now. And since all earnings on 
your SRA are tax-deferred as well, the 


TOO MANY RETIREMENT DOLLARS. 


money you don’t send to Washington 
works even harder for you. Down the 
road, that can make a dramatic difference 
in your quality of life. 

What else makes SRAs so special? 

A range of allocation choices—from the 
guaranteed security of TIAA to the 
diversified investment accounts of 
CREF’s variable annuity—all backed 
by the nation’s number one retirement 
system. 

Why write off the chance for a more 
rewarding retirement? Call today and 
learn more about how TIAA-CREF 
SRAs can help you enjoy many 
happy returns. 


Benefit now from tax deferral. Call our SRA hotline 1 800-842-2733, ext. 8016. 


Ensuring the future 
for those who shape it” 


CREF certificates are distributed by TIAA-CREF Individual and Institutional Services. For more complete information, including charges and expenses, 
call 1 800-842-2733, ext. 8016 for a prospectus. Read the prospectus before you invest or send money. 


Essays in Humanistic 
Mathematics 


Alvin White, Editor 


A dazzling array of essayists reveal humanistic 
mathematics in this volume, and in so doing go 
beyond the facts, formulas, and algorithms that most 
students associate with mathematics to a presenta- 
tion of mathematics as an intellectual discipline with 
a human perspective and a significant history. Hu- 
manistic mathematics challenges dogmatic teach- 
ing styles that expect students to parrot the lecturer. 
It demands creativity from both the teacher and 
student. 

Teaching mathematics humanistically seeks to 
place the student more centrally in the position of 
inquirer than is generally the case, while at the same 
time acknowledging the emotional climate of the 
activity of learning mathematics. This type of 
teaching encourages students to learn from each 
other and to better understand mathematics as so- 
cially constructed knowledge, rather than as an 
arbitrary discipline. 

Teaching humanistic mathematics brings the fo- 
cus less upon the nature of the teaching and learning 
environment and more upon the need to reconstruct 
the curriculum and the discipline of mathematics 
itself. This reconstruction relates mathematical 
discoveries to personal courage, discovery to verifi- 
cation, mathematics to science, truth to utility, and 
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mathematics to the culture in which it is embedded. 

The humanistic mathematics movement, which 
began as the personal vision of a few, has now 
become a major part of mathematical culture. What 
was viewed with skepticism is now accepted and 
expected. Humanistic mathematics is not a new 
discovery. Itis arecent rediscovery of ideas that go 
back to Plato. It has provided a vocabulary for 
previously unarticulated concepts and approaches. 

The essays in this volume illustrate and help to 
define humanistic mathematics. The variety and 
scope indicate the richness and fruitfulness of the 
concept. Although each essay is independent, a 
sense of unity emerges. A glimpse at the table of 
contents will give you an idea of the excitement and 
range of the ideas presented. 
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Mathematical Cranks 


Underwood Dudley 


A delightful collection...lt is hard to put down and 
provides topics for an unending series of interest- 
ing discussions. The organization and breaath of 
the book are impressive, supported by a helpful 
index and a list of resources that encourage 
further explorations. A classic. —CHOICE 


A jewel... The most interesting book that | have 
read this year. 
—dJournal of Recreational Mathematics 


It's a gem...Dudley hasn't tossed out crank sub- 
missions over the years; he's saved them and 
collected samples from other mathematicians. 
And what wonderful samples they are. 
—Sunday Telegraph, Nashua, New Hampshire 


Mathematical Cranks is about people who think 
that they have done something impossible, like 
trisecting the angle, squaring the circle, duplicat- 
ing the cube, or proving Euclid’s parallel postulate; 
people who think they have done something that 
they have not, like proving Fermat’s Last Theo- 
rem, verifying Goldbach’s Conjecture, or finding a 
simple proof of the Four Color Theorem; people 
who have eccentric views, from mild (thinking we 
should count by 12s instead of 10s) to crazy 
(thinking that second-order differential equations 
will solve all problems of economics, politics, and 
philosophy); people who pray in matrices; people 
who find the American Revolution ruled by the 
number 57; people who have in common some- 
thing to do with mathematics and something odd, 
peculiar, or bizarre. 
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Cranks and their ideas come in great variety. The 
book is a collection of examples, designed to give 
readers an idea of what cranks do and how they do 
it. Contemplating the odd, peculiar, or bizarre can 
be entertaining or enlightening. There can be no 
solution to the problem of mathematical cranks— 
obsessive people we will always have with us, and 
some will become obsessed with mathematics— 
but perhaps viewing the futility of their efforts will 
turn some prospective cranks toward more fruitful 
endeavors. 


This is a truly unique book, written with wit and 
style. 
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The Concept of Function 


Aspects of Epistemology 


and Pedagogy 


Guershon Harel and Ed Dubinsky, Editors 


Highly readable contribution to the literature on 
learning the concept of function. The overall qual- 
ity of the papers is quite high. 

—Mathematics Teacher 


The contributors of this volume probe the idea of 
what it means to learn the concept of function and 
how instruction, based on research, could assist 
teachers in finding ways of helping their students 
understand this all-important mathematical con- 
cept. 


The concept of function is one that will appear 
again and again in a student’s mathematics train- 
ing. Arithmetic in the early grades, algebra in 
junior high school, and transformational geometry 
in high school are all largely based on the idea of 
function. Moreover, people involved in calculus 
reform know that understanding the idea of func- 
tion is an indispensable part of the background 
students need to understand calculus. As math- 
ematical education is being renewed and reformed 
throughout the world, this movement requires that 
we learn more about the concept of function both 
from epistemological and pedagogical points of 
view. 
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There are several major themes that emerge in 
the pages of this volume. They are theoretical 
perspectives of development of the function con- 
cept, theory-based teaching experiments, con- 
ceptions held by students and teachers, and the 
use of pedagogical software. The volume begins 
with a summary and overview of the subject and 
is followed by a brief glossary of terms. 


The development of the papers presented in the 
volume began with a conference held in West 
Lafayette, Indiana in October 1990 with the sup- 
port of Purdue University and the Exxon Founda- 
tion. This volume is, however, much more than 
just a conference proceedings. It is a truly coop- 
erative writing effort by a group of dedicated 
researchers and educators. 
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EXCURSIONS IN CALCULUS: 


an Interplay of the Continuous 


and the Discrete 


Robert M. Young 


An excellent source of projects for well motivated 
students. This list of 463 references is a valuable 
aid for those who wish to dig deeper. _ CHOICE 


The presentation is clear and the topics very 
interesting...fully accessible to students for whom 
the book is intended. The book will be influential in 
awakening students’ awareness for good classi- 
cal mathematics. —Paulo Ribenboim 


Printed with eight full-color plates. 


The purpose of this book is to explore, within the 
context of elementary calculus, the rich and el- 
egant interplay that exists between the two main 
currents of mathematics, the continuous and the 
discrete. Such fundamental notions in discrete 
mathematics as induction, recursion, combinato- 
rics, number theory, discrete probability, and the 
algorithmic point of view as a unifying principle are 
continually explored as they interact with tradi- 
tional calculus. The interaction enriches both. 


The book is addressed primarily to well-trained 
calculus students and their teachers, but it can 
serve aS a supplement in a traditional calculus 
course for anyone who wants to see more. 
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EXCURSIONS 
CALCULUS 


NTERPLAY OF 
THE CONTINUOUS AND THE DISCRETE 


Robert M. Young 


Doloani Mathenitical Lxposiiions Number 13 


@ 


Tb MARTEMATILAL ASSOCLATION OF AMERICA 


CONTENTS: 


¢ Infinite Ascent, Infinite Descent: The Principle 
of Mathematical Induction 

¢ Patterns, Polynomials, and Primes: Three. 
Applications of the Binomial Theorem 

e Fibonacci Numbers: Function and Form 

¢ On the Average 

¢ Approximation: frony Pi to the Prime Number 
Theorem 

¢ Infinite Sums: A Potpourri 


The problems, taken for the most part from 
probability, analysis and number theory, are an 
integral part of the text. Many point the reader 
toward further excursions. There are over 400 
problems presented in this book. 
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OLD AND NEW UNSOLVED 


PROBLEMS IN PLANE 
GEOMETRY AND 
NUMBER THEORY 


Victor Klee and Stan Wagon 


Many facts and problems to fascinate both on 
familiar and unfamiliar topics. It is compulsive 
reading and will fill your mind with problems that 
will come back to haunt you again and again 
during idle moments. 

—Mathematical Intelligencer 


The book will serve well as a point of entry for 

students who want to know more about cel- 

ebrated questions, or simply take in the vistas. 
—CHOICE 


This is a book that not only belongs in every 

university, college and high school library, it very 

definitely belongs in every public library. 
—Mathematical Reviews 


Part of the broad appeal of mathematics is that 
there are simply stated questions that have not 
yet been answered. These questions are plenti- 
ful in the areas of plane geometry and number 
theory, and the purpose of this book is to discuss 
some unsolved problems in these fields. Be- 
cause the central concepts of geometry and 
number theory are understood by everyone, many 
of the questions can be understood by readers 
with extremely little mathematical background. 


The authors place each problem in its historical 
and mathematical context. Each problem section 
is presented in two parts: The first gives an 
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in Plane Geometry and Number Theory 


elementary overview discussing the history and 
both solved and unsolved variants of the prob- 
lem. Part Two contains more details, including a 
few proofs of related results, a wider and deeper 
survey of what is known about the problem and 
its relatives, and a large collection of references. 
Both parts contain exercises, and solutions to 
the exercises are included. 


The book is aimed at both teachers and stu- 
dents of undergraduate mathematics, and at 
beginning graduate students. It could be used 
as a text in a course about unsolved problems, 
and also in courses: in geometry or number 
theory. High school teachers interested in learn- 
ing about developments in modern mathemat- 
ics will find much of interest here. 
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Knot Theory 


Charles Livingston 


I learned more about knots after an hour with the book 
than I thought I could, and I am glad that it is here on 
my desk so that I may spend more time with it and, I 
hope, learn more. —Paul Halmos 


Knot Theory, a lively exposition of the math- 
ematics of knotting, will appeal to a diverse 
audience from the undergraduate seeking ex- 
perience outside the traditional range of studies 
to mathematicians wanting a leisurely introduc- 
tion to the subject. Graduate students begin- 
ning a program of advanced study will find a 
worthwhile overview, and the reader will need 
no training beyond linear algebra to under- 
stand the mathematics presented. 

Over the last century, knot theory has pro- 
gressed from a study based largely on intuition 
and conjecture into one of the most active areas 
of - mathematical investigation. Knot Theory 
illustrates the foundations of knotting as well as 
the remarkable breadth of techniques it em- 
ploys—combinatorial, algébraic, and geomet- 
ric. 

The interplay between topology and algebra, 
known as algebraic topology, arises early in the 
book, when tools from linear algebra and from 
basic group theory are introduced to study the 
properties of knots, including the unknotting 
number, the braid index, and the bridge num- 
ber. Livingston guidés you through a general 
survey of the topic showing how to use the 
techniques of linear algebra to address some 
sophisticated problems, including one of math- 
ematics’ most beautiful topics, symmetry. The 
book closes with a discussion of high-dimen- 
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of the recent advances in the subject—the 
Conway, Jones and Kauffman polynomials. A 
supplementary section presents the fundamen- 
tal group, which is a centerpiece of algebraic 
topology. 

An extensive collection of exercises is included. 
Some problems focus on details of the subject 
matter; others introduce new examples and 
topics illustrating both the wide range of knot 
theory and the present borders of our under- 
standing of knotting. All are designed to offer 
the reader the experience and pleasure of work- 
ing in this fascinating area. 
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Memorabilia Mathematica 


The Philomath’s 
Quotation Book 


Robert Edouard Moritz 


When Robert Edouard Moritz compiled his book 
of quotations, Memorabilia Mathematica, which 
appeared in 1914, he stated that his primary 
objective was to seek out the exact statement ofand 
exact references for famous passages about math- 
ematics. He searched the writing not only of 
mathematicians, but poets, philosophers, histori- 
ans, statesmen, and scientists as well. His sources 
ranged from the works of Plato to the writings of 
Hilbert and Whitehead. His second objective was 
to produce a volume that would be a source of 
pleasure, encouragement, and inspiration to both 
mathematicians and non-mathematicians alike. 

This work was a ten-year labor of love, and it is 
a tribute to his discerning eye that this selection of 
passages should remain one of the most stimulat- 
ing works about mathematics ever published. It 
was the first collection of its kind in English and it 
conveys a sense of the full range of mathematics, its 
enormous accomplishments, and the living per- 
sonalities of great mathematicians. 

The more than eleven-hundred fully anno- 
tated selections in this book, gathered from the 
works of three hundred authors, cover avast range 
of subjects pertaining to mathematics. Grouped 
in twenty-one chapters, they deal with such topics 
as the definitions and objects of mathematics; the 
teaching of mathematics; mathematics as a lan- 
guage or as a fine art; the relationship of math- 
ematics to philosophy, to logic, or to science; the 
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nature of mathematics, and the value of mathemat- 
ics. Other sections contain passages referring to 
specific subjects in the field such as arithmetic, 
algebra, geometry, calculus, and modern math- 
ematics. Of special interest is the extensive amount 
of material on great mathematicians which pro- 
vides irreplaceable glimpses into the lives and 
personalities of mathematical giants. 

To mathematicians the book will be a great 
source of pleasure, inspiration, and encourage- 
ment. To teachers of mathematics and writers 
about mathematics, it will remain of inestimable 
value as a source of quotations and ideas. To the 
layperson, it will be a revelation. It should dispel 
forever the narrow notion that mathematics is a 
cut-and-dried affair, isolated from other compart- 
ments of life and thought. 
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Out of the Mouths 
of Mathematicians 


A Quotation Book for Philomaths 


Rosemary Schmalz 


Published as a companion volume to Robert 
Edouard Moritz’s Memorabilia Mathematica, Rose- 
mary Schmalz’s Out of the Mouths of Mathemati- 
cians picks up where Moritz left off. Her work will 
give you asense of the “story” of twentieth century 
mathematics. You will encounter the mathemati- 
cians, their collaborations and disputes, the move- 
ment from abstraction to application, the emer- 
gence of new areas of research, the impact of 
computers on mathematics, the challenges in math- 
ematics education, and more. 

Out of the Mouths of Mathematicians: A Quo- 
tation Book for Philomaths is acompilation of 727 
quotations from 292 contributors, almost all of 
whom are twentieth century mathematicians. Tak- 
ing the advice of Abel to learn mathematics by 
reading the masters, the author offers the reader a 
unique perspective on this century’s mathematics 
through the words of the mathematicians who are 
its creators. Stories about these mathematicians, 
their exhortations to their students, their descrip- 
tions of their efforts, successes, and failures, all 
make this century’s mathematics come alive. The 
bookalso offers readers the opportunity to broaden 
their ideas about what mathematics is by offering 
many definitions of mathematics, making com- 
parisons of mathematics to computing and to the 
fine arts, and showing similarities between many 
aspects of mathematics and religion. The com- 
plete reference for each quotation allows the reader 
to continue exploration into a favorite area. A 
large topic index makes the book quite user- 
friendly. Some of the subject categories include: 
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The Development of Mathematics, Exhortations 
to Aspiring Mathematicians, Pure and Applied 
Mathematics, About Mathematicians (by name), 
Anecdotes and Miscellaneous Humor, Particular 
Disciplines in Mathematics, Moments of Math- 
ematical Insight, Mathematics and the Arts,... and 
much more. 

This book will give pleasure to any philomath. 
It can be used to facilitate a literature search or to 
give quick access to an appropriate quote for writ- 
ers and speakers. It will be particularly useful to 
teachers of mathematics at all levels, to encourage, 
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COMMENTS 


It is not the employer who pays wages — he only 
handles the money. It is the product that pays wages. 
--Henry Ford 


Rebates. 


It's the free enterprise solution both to grade inflation and to the lack of 
accountability. Here's how it works. Students pay an exorbitant fee for each credit 
hour or course. How high? Triple whatever students pay now at your institution. (So 
far, this sounds normal — at least for a parent with children in college). Now comes 
the difference. Depending on their grade in the course, students receive rebates. An 
A returns a whopping 80%; a B gives 60%; a C 40%; a D 20%; and an F gives 
nothing at all. Rebate checks (along with the grade report) are mailed to expectant 
parents promptly at the end of each term. 

What happens to the money generated by higher tuition? A portion is 
returned to each unit involved in the instruction — the college, the department, the 
instructor — perhaps even the grading assistant. 

That's it, the system; set it up and let it run. 

This is a system with all the right economic forces . . . and with real 
accountability. Mom and Dad can pay for their winter cruise when they receive a 
check at the end of good semester. (It's more tangible than a small slip of paper with 
3.8 typed on it.) Students who pay their own way will be even happier. Money is 
motivation. 

What about grade inflation? Too many A's and B's and the college and 
department soon go broke. Too few A's and B's means no one signs up for the 
courses in that expensive department. For instructors, the incentive is even clearer: 
Grade too easily and you're poor, grade too hard and no one takes your courses (and 
you're still poor). Clearly there exists a maximum profit in between. 

There is flexibility in this system. When the departmental xerox machine 
breaks down in December, an immediate call goes out to make finals tougher. They 
can raise the funds with one demanding Calculus test. On a smaller scale, faculty can 
generate the down payment on a new house, or the cost of an unexpected auto repair, 
by teaching one large course with a tough curve. Don't do it too often, however, or 
you won't have the chance again. 

The system has an additional advantage for people who like to use simple 
numbers to measure young faculty at tenure time. "Mary has only a 3.6 in teaching 
evaluations," they point out, "but she's generated $47,563 for the department in two 
years." 

Above all, this is a system that recognizes the proper relation of students as 
customers and teachers as vendors. The product we sell pays our wages. 


Does all this make you uneasy? 
I hope so. 


John Ewing 
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On the Geometry of Piecewise Circular 
Curves 


Thomas Banchoff and Peter Giblin 


In this article we would like to promote a class of plane curves that have a number 
of special and attractive properties, the piecewise circular curves, or PC curves. 
(We feel constrained to point out that the term has nothing to do with Personal 
Computers, Privy Councils, or Political Correctness.) They are nearly as easy to 
define as polygons: a PC curve is given by a finite sequence of circular arcs or line 
segments, with the endpoint of one arc coinciding with the beginning point of the 
next. These curves are more versatile than polygons in that they can have a 
well-defined tangent line at every point: a PC curve is said to be smooth if the 
directed tangent line at the end of one arc coincides with the directed tangent line 
at the beginning of the next. (In particular, in a smooth PC curve, no arc 
degenerates to a single point.) 

In the literature of descriptive geometry and more recently in computer graph- 
ics, PC curves have been used to approximate smooth curves so that the approxi- 
mation is not only pointwise close, as in the case of an inscribed polygon, but also 
has the property that the tangent lines at the points of the smooth curve are 
approximated by the tangent lines of the PC curve. Given a pair of nearby points 
on a smooth curve together with their tangent directions, there will not in general 
be a single circular arc through the points with those directions at its endpoints, 
but there will be a family of biarcs meeting these boundary conditions, PC curves 
composed of two tangent circular arcs. (See [M-N] for a discussion of this 
construction.) 


EXAMPLES OF PC CURVES. PC curves arise naturally as the solutions of a 
number of variational problems related to isoperimetric problems. A classical 
problem is to find the curve of shortest length enclosing a fixed area, and the 
solution is a circle. If the curve is required to surround a fixed pair of points, then 
the curve of shortest length enclosing a given area will be either a circle or a lens 
formed by two arcs of circles of the same radius meeting at the two points. More 
generally Besicovitch has shown that a curve of fixed length surrounding a given 
convex polygon and enclosing the maximum area must be a PC curve with all radii 
of’arcs equal [Be]. One such curve is the Reuleaux “triangle”, a three-arc PC curve 
enclosing an equilateral triangle, with each radius equal to the length of a side of 
the triangle. Such three-arc PC curves, and many far more elaborate examples can 
be found in the tracery of gothic windows [A]. 

If we require that a curve of fixed length L surround a given pair of discs of the 
same radius, then, for a certain range of values of L, the curve that encloses the 
greatest area is a smooth convex PC curve consisting of two arcs on the boundary 
circles of the discs and two arcs of equal radius tangent to both discs. Such four-arc 
convex PC curves have long been used in engineering drawing for approximating 
ellipses, and we call such a curve a PC ellipse [FicureE 1]. One special PC ellipse is 
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Figure 1 


the boundary of the smallest convex set containing the two discs, called the convex 
envelope of the two discs, consisting of two semicircles and two line segments. (We 
thank Salvador Segura for pointing out the importance of PC curves in such 
isoperimetric problems.) 

The collection of PC curves is invariant not only under Euclidean motions and 
scaling, but also under inversion with respect to a circle. 


PARALLEL CURVES OF PC CURVES. The Reuleaux triangle is a non-smooth 
PC curve of constant width, so that every strip containing the curve and bounded 
by a pair of parallel lines through points of the curve has the same width. [FiGuRE 
2a]. We can obtain a smooth PC curve of constant width by taking an outer parallel 
curve of the Reuleaux triangle, i.e. the boundary of the parallel region, the locus of 
all points within a fixed distance of the points of Reuleaux triangle [FIGURE 2b]. 

This construction points out one of the main properties of PC curves: since the 
parallel curves of circular arcs are circular arcs, the parallel curves of a PC curve 


(a) (b) 
Figure 2 
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are PC curves. As an example, consider a convex PC ellipse. If we increase all of 
the radii by the same amount, keeping the same centers for the arcs, we obtain an 
outer parallel curve which is also a PC ellipse. This situation is in contrast with the 
case of an actual ellipse, for which the exterior parallel curves are not conic 
sections but rather algebraic curves of fourth degree. 

Consideration of parallel curves is especially important in computer graphics, in 
particular in robotics, where it is necessary to find the centers of all discs of a fixed 
radius touching a given curve. In this subject, parallel curves are often called offset 
curves, obtained by moving away from the curve a given distance. For a curve 
defined by an algebraic equation, the offset curves are also algebraic, but the 
degrees of the offset curve is in general much higher [R-R]. 

If instead of increasing all radii of arcs of a PC ellipse, we decrease all radii by 
the same amount, keeping the same centers, we obtain the family of inner parallel 
curves. As in the case of the ordinary ellipse, for sufficiently small radius, the inner 
parallel curves remain smooth, and in the PC case, they remain PC ellipses. For an 
ellipse, after a certain distance the inner parallel curve develops four cusps where 
the directed tangent line reverses direction. Similarly, at a distance equal to the 
smaller radius, the parallel PC curve degenerates into a lens, and just after this we 
obtain a four-arc PC curve with cusps, where two arcs come together at the same 
tangent line but with different directions. Beyond a certain distance, the parallel 
curve of an ellipse is again a convex curve (but not a conic section). For a convex 
PC ellipse, the inner parallel curve at distance equal to the larger radius is a lens, 
and after that, it is again a convex PC ellipse. 


EVOLUTE POLYGONS OF PC CURVES. In the case of an ellipse, the cusps of 
parallel curves trace out the evolute curve, consisting of the locus of centers of 
curvature of the ellipse. For a PC ellipse, the cusps of the parallel curves trace out 
the edges of a polygon with vertices at the centers of the arcs, called the evolute 
polygon of the PC curve [Ficure 3]. If the radius of the inner parallel curve equals 


Figure 3 
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the radius of one of the arcs of a PC curve, then that arc degenerates to a single 
point. We say that a PC curve is non-degenerate if all arcs have non-zero length. 
Almost all parallel curves of a PC curve are non-degenerate. 

We can obtain smooth PC curves by starting with a sequence of circles, each 
one tangent to its successor. The points of tangency divide each circle into two 
arcs, and choosing one arc from each circle gives a PC curve with a well-defined 
tangent line at each node where two successive arcs meet. If we wish the resulting 
PC curve to be smooth, then once we have chosen an arc from the first circle, the 
arcs on all subsequent circles are uniquely determined. If the last circle is tangent 
to the first, then this construction gives a closed PC curve. If each of the circles is 
externally tangent to the next, then the resulting PC curve which will be smooth if 
the number of arcs is even [FIGURE 4], while if the number of arcs is odd, then we 
inevitably obtain a cusp when we return to the starting point. 


Figure 4 


If two successive circles are externally tangent, then the node of the PC curve 
will either be a smooth inflection point [FIGURE Sa] if the tangent lines have the 
same direction or an ordinary cusp [FIGURE 5b] if the directions are different. In 
each of these cases, the two arcs lie on opposite sides of their common tangent line 
at the node. If two circles are internally tangent, then the two arcs at the node lie 
on the same side of their common tangent line, and we obtain either a smooth 
locally convex point [Figure 5c] if the tangent lines have the same direction or a 
rhamphoid cusp [FiGuRE 5d] if the directions are different. 


COOL 


(a) (b) (c) (d) 


Figure 5 


Once we begin to consider curves with cusps, we can obtain many of them from 
the same collection of successively tangent circles. Selecting one of the two 
possible arcs of each of the circles, we get 2” such curves if there are n circles. If n 
is odd, we obtain two such curves with cusps at all nodes [FiGuRE 6a—d]. 
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(a) (b) 


(c) (d) 


Figure 6 


THREE-ARC PC CURVES. If two of the circles are inside a third, we can form 
eight PC curves in this way, leading to three distinct types, each of which appears 
in a classical guise. If the inner circles have half the radius of the outer one, then 
one such curve is the Yin- Yang curve, with one convex smooth node, one inflection 
node, and one node which is a rhamphoid cusp [FIGURE 7a]. From the same set of 
circles we can form the PC cardioid with two smooth convex points and one 
ordinary cusp. This curve appears in the work of the eighteenth century Jesuit 
geometer Roger Boscovich as an example of a non-centrally symmetric curve with 


(a) (b) 


Figure 7 
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a center of length, so that every line through this center cuts the curve into two 
pieces of equal length [FiGuRE 7b]. A third type of PC curve determined by these 
three circles is the arbelos, or shoemaker’s knife, with two rhamphoid cusps and 
one ordinary cusp. This curve was originally studied by Archimedes and Pappus, 
and it has inspired numerous articles in recreational mathematics, for example 
[Ba], [Ga], and [H]. 


EVOLUTES, INVOLUTES, AND OSCULATING CIRCLES. The sequence of cen- 
ters of the circular arcs of a PC curve determines the evolute polygon of the curve. 
For a PC ellipse, the evolute polygon is a rhombus, and we can find non-convex PC 
curves with the same rhombus as its evolute polygon [FicurE 4 and FiGurRE 8]. 
Any parallel curve of either of these PC curves will be a PC curve with the same 
evolute polygon. 


Figure 8 


If we start with a sequence of circles, each one internally tangent to its successor 
and contained within it, then we obtain a PC spiral. The evolute polygon of the 
spiral will be a locally convex polygonal arc, and we may recover the spiral by a 
“string construction’. We think of a string attached at one end of the polygonal 
arc and pulled tightly along it. As we unwind the string, keeping it tightly along the 
polygon at all times, the endpoint of the string traces out a PC curve with the 
polygon as evolute polygon [FiGuRE 9]. By using a longer string, we may construct 
a parallel PC curve with the same evolute polygon. Such PC spirals have been used 


Figure 9 
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by several authors in computer-aided design as a means of approximating curves 
with increasing curvature [M-P]. 

For a smooth curve with continually increasing curvature, the best approximat- 
ing circle at a point, called the osculating circle at the point, is defined by the 
properties that it is tangent to the curve at the point and it crosses from one side of 
the curve to the other near that point. The evolute curve is then the locus of 
centers of osculating circles at the points of the curve. At a node of a convex PC 
curve, the circles which are tangent to the curve at the node and which cross from 
one side of the curve to the other near the point have their centers on the segment 
joining the centers of the two arcs that meet at the node [Ficure 10]. For this 
reason we may consider the evolute polygon as the locus of centers of “‘osculating 
circles” of the PC curve. At an inflection node, the circles tangent to the curve that 
cross from one side of the curve to the other have their centers on the line 
containing the centers of the arcs meeting at the node, but on the two rays that are 
the complement of the segment joining the two centers. In this case, the focal 
polygon is said to go to infinity. 


Figure 10 


For a smooth spiral with continually increasing curvature, the radii of the 
osculating circles continually decrease, and conversely. A point where the curva- 
ture stops increasing and begins decreasing or conversely is called a vertex. For a 
PC curve, a vertex arc is an arc such that both adjacent arcs either are inside the 
circle of the arc or outside it. For a convex PC ellipse, each arc is a vertex arc. 


FOUR-ARC PC CURVES. In the remainder of this article, we will discuss some 
results about closed four-arc PC curves, and point out an interesting connection 
between these and four-bar linkages in the plane. In effect, we show that all closed 
four-arc PC curves can be generated in a simple way from a very special class of 
“collapsed” quadrilaterals. 

Let us establish some notational conventions. Let C; be a circle with center c,, 
i= 1,2,3,4, each C, being tangent to C,;,,. (We adopt the convention that all 
subscripts are to be reduced modulo 4, so for example C, is the same as C,). Our 
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Figure 11 


PC curve C will be made from successive arcs of the C,, so that the quadrilateral 
C,C5C3C, is the evolute polygon of C. The nodes of C will be denoted p,, with p, as 
the node where C;_, meets C; [Ficure 11]. Finally the side-lengths of the evolute 
polygon will be denoted by /;: this is the distance between c;_, and c;. In the 
example of Ficure 11 it is clear, by splitting each /; into a sum of two radii, that 
1, +1, =1, + 1,. Whenever we have a PC curve based on the above quadrilateral, 
we must have some relation of the form 

= th tlt 1s, (1) 
for some choice of plus or minus signs. 

Suppose we start with two circles, C, and C,. What choice do we have for the 
centers of the remaining two circles? If, for example, the circles C, and C; are 
external to each other, as in FiGure 12a, then a circle C, tangent to both has 
[l, — 1,| equal to the sum or difference of the radii of C, and C,. Thus, by a 
standard property of hyperbolas, the center of C, (and likewise of C,) lies on one 
of two hyperbolas with foci at c, and c;. For one hyperbola, C, and C; are both 
outside or both inside C,; for the other hyperbola, one is outside and one is inside. 
If the radii of C, and C; are equal, then one of the hyperbolas degenerates to a 
straight line. If C, and C;, are differently placed (for example if one is inside the 
other), or if one of the circles becomes a straight line (a “circle of infinite radius’), 
then there may be changes in the locus of possible centers for C,. However, as the 
reader may verify, this locus always consists of two conic sections, i.e. an ellipse, a 
parabola, a hyperbola, or a straight line. 

A particularly interesting construction which can be carried out for any PC 
curve C, is to consider the full locus of centers of bitangent circles, i.e. circles 
tangent to C in at least two places. This locus, which necessarily includes the 
centers of the arcs making up C, is the symmetry set of C. By the above remarks, 
the symmetry set consists of arcs of conic sections; in fact, two consecutive arcs will 
always meet with a common tangent line. In the final section of this paper, we give 
an intriguing example of a four-arc PC curve which has an isolated point on its 
symmetry set; a full discussion of symmetry sets of PC curves appears in [Ba-G2]. 

Suppose now that we have four circles with each tangent to the next. As pointed 
out above, exactly two of the sixteen possible PC curves made up from arcs of 
these four circles are smooth. 
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l,-l,=r,-r, 


l,-l,=r,+1, 


(b) 


Figure 12 


In the case of four externally tangent circles (as in FiGure 4), it is clear that the 
arcs making up one of these smooth PC curves must alternate between clockwise 
and counter-clockwise orientation. With the usual convention that clockwise curves 
have negative curvature and counter-clockwise curves have positive curvature, this 
implies that each of the four arcs is a vertex arc, as defined previously. The reader 
may like to verify that, with configurations other than externally tangent circles, all 
four arcs remain vertex arcs so long as the PC curve remains smooth and does not 
intersect itself. This establishes a version of the Four Vertex Theorem for four-arc 
PC curves. More generally, one can show that any smooth closed PC curve which 
does not intersect itself has at least four vertex arcs. This can be proved using an 
argument analogous to that of Osserman, for the classical Four Vertex Theorem 
[O]. 


PC CURVES AND FOUR-BAR LINKAGES. It is instructive to regard the evolute 
polygon as a linkage in the plane. This amounts to thinking of the edges as rigid 
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rods connected at the endpoints c;. We usually take c, and c, as fixed in position 
in the plane and we allow the other three rods to turn about their endpoints. As 
the quadrilateral changes in shape, so will any PC curve with this quadrilateral as 
its evolute. We may note that a collection of four tangential circles centered at the 
vertices c,; will roll on each other without slipping as the linkage changes shape. 

The theory of four-bar linkages has been studied extensively. In that theory, it is 
shown that a linkage can move continuously from any position into a collapsed 
position, where all the centers are on a straight line (and hence all nodes lie on the 
same line, too), provided that some relation of the form (1) holds (where J, is the 
length of the fixed rod). In certain cases, namely 


l,=1,+1,+1, and 1/,= -1,+1,-1,, 


there is only one possible position for the linkage and that is when it is in a 
collapsed position, so the result holds automatically in these cases. We state and 
prove the result below, using an elementary argument; for a more general setting 
of this result, see for example [G-N]. 

For us, the main significance of the collapsing lemma is a sort of converse 
construction: 


Proposition A. Every closed four-arc PC curve can be obtained by starting with one 
based on a collapsed quadrilateral and “uncollapsing”’ it, keeping the radii of the 
circles unchanged as the quadrilateral moves away from the collapsed position. 


Note that since the radii remain unchanged, so do the edge lengths and the PC 
curve remains closed as the quadrilateral uncollapses. The proposition is an 
immediate consequence of the following lemma: 


Collapsing Lemma. Any quadrilateral satisfying (1) can be continuously collapsed 
so that its four vertices are collinear. 


Proof of Lemma: Let us fix 1, =1 and take c, = (0,0),c, = (,0),c, = 
(1 cos t,/ sin t) as in Figure 13a. The condition for c, to exist for this position of 
c, is |l,-1,| <d <1,+1,, where d is the distance from c, to c,. This is 


(a) (b) 


Figure 13 
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equivalent to 
12+ 1-(1,+1;)° <2l,cost <1?+1-(l,-1,)’. 

The allowable values of ¢ are therefore those points on the unit circle between 
two vertical lines, one or both of which may actually miss the circle [Figure 13b]. 
We need only check that 

(i) if ly +1,+1, = 1, then t = 0 is an allowable value, that is, 2/, </? +1 —- 
(1, ~~ l,) ) 

Gi) if —1, +/, +1, = 1, then t = 7 is an allowable value, that is, /? + 1 — 
(I, +1,)? < —21,. 

Since, in (i), (7, — 1)? = CU, + 1,)?, and, in (ii), (, + D? = (, + 1,)” the results 
are immediate. 

FicureE 14a gives an example of a closed four-arc PC curve in which the four 
centers have become collinear. Despite its ordinary appearance, however, this 
construction is very special: in this case, the lengths satisfy /, =/, and /, = ,, 
which implies that the quadrilateral of centers, before collapsing, was a parallelo- 
gram. As the quadrilateral unfolds, the PC curve evolves as shown in FiGURE 14b. 
On the other hand, when no special relation holds among the /;, besides (1), it 
turns out that, when the quadrilateral has collapsed, the PC curve has become 
degenerate. This is the content of the following result: 


(b) 


Figure 14 


Proposition B. Suppose that the quadrilateral of centers collapses, with the four 
centers (and the four nodes) along a line. Suppose also that the four nodes p, are all 
distinct. Then the quadrilateral is a parallelogram (i.e. 1, = 1, and 1, = 1,). 
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Remark. It is “usually” true that, in the non-parallelogram case, all four nodes 
coincide. More precisely, if the centers c, are all distinct and there is no restriction 
on the placing of the first node p,, then either the quadrilateral is a parallelogram 
or all four nodes p, coincide. 


Proof of Proposition B: Since the centers and nodes are along a line, we can take 
this line to be the x-axis and describe them by their x-coordinates. Since p, # p,, 
we must have p, = 2c, — p, since c, is the center of the segment from p, to p>. 
Similarly p, = 2c, — 2c, + p,, and p, = 2c; — 2c, + 2c, — p,. Going one more 
step brings us back to p,. This gives c, + c; = Cc, + c,, which implies both /, = /, 
(i.e., lc, — c,| = lez — cy|) and 1, = 1, @e., |e, — c.| = Ie, — c3|). The remark is 
proved by examining all possibilities for p,, p3, p,, given an initial p,. 

When two consecutive nodes of a PC curve coincide, we can take the arc joining 
them either as a complete circle or as a mere point. FIGURE 15a—c shows a PC 
curve growing out of a collapsed polygon of centers where all but one of the arcs is 
taken as a point. 


(a) (b) 


(c) 


Figure 15 


CONCLUDING REMARKS. Many of the topics we have introduced in this paper 
can be taken much further. Here we mention some natural extensions. 

Closed PC curves with n arcs and a given evolute polygon fall into two classes, 
depending on whether the number of cusps is even or odd (smooth PC curves have 
zero cusps, an even number). 

If the number of cusps is even, there is always a relation between the side-lengths 
of the polygon of the form 


L=+th+ht.. 


-t1_, (2) 
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analogous to (1) above. Furthermore, in this case, the radii of the PC curve can be 
varied to give a family of parallel PC curves. Examples are given above in FIGURES 
1, 2b, 4, and 11. 

If the number of cusps is odd, there is no restriction on the sides. There is a 
unique closed curve with a given evolute polygon, and the radii cannot be varied. 
Examples are the three-arc PC curves in Figures 5 and 6, Note that three-arc PC 
curves always have an odd number of cusps. 

The extension of Proposition A and the Collapsing Lemma to PC curves with 
more than four arcs is straightforward but more complicated. We know of no easy 
proof that a polygon satisfying (2) for some choice of signs necessarily collapses 
continuously to a position where all the nodes are collinear. It would appear that 
this should be easier to achieve as n becomes larger, since the polygon becomes 
“floppier” as it has more degrees of freedom. 

Finally we mention one remarkable example of a symmetry set of a PC curve. 
The four-arc PC curve C in FiGure 16a has a biosculating circle S, i.e. S is an 
osculating circle at two points p and q. By definition, the center of S is part of the 
symmetry set of C, but it is an isolated point of the symmetry set since there are no 
circles near S that are tangent to C at two points. If we perturb the curve C by 
moving the node at p slightly counter-clockwise round C,, and adjusting C, and 
C, accordingly, a family of bitangent circles appears to grow out from S. An 
enlarged picture of the locus of centers of curvature of these bitangent circles is 
shown in FicurE 16b. If we move p the other way, all of these bitangent circles 
come together and disappear. In the study of symmetry sets of one-parameter 
families of plane curves, this transition is called a moth. In [G-B] and [Br-G] there 
are extensive discussions of such transition phenomena for general smooth curves. 
Although many of the transitions for symmetry sets of general smooth curves 
already appear in the study of plane polygons, as in [Ba-G1], not all of them do, 
and part of our motivation for studying PC curves was an attempt to find an 
elementary class of curves for which all general phenomena were already present. 


(a) (b) 


Figure 16 


A full discussion of the symmetry sets of PC curves appears in [Ba-G2]. Many of 
the notions in this paper generalize to PC curves in space and in higher dimen- 
sions, and we intend to pursue these ideas in a subsequent paper. 
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Pascal’s Theorem 


The proof of Pascal’s Theorem mentioned in Professor van Yzeren’s article (MONTHLY 100, pp. 
930-931) is not my own but the proof I learned in 11th grade Descriptive Geometry and the 
Mathematisches-Naturwissenschaftliches Gymnasium of Basel, Switzerland. When I wrote the 
book I therefore assumed that the proof was part of everybody’s general mathematical 
education. I am quite sure that this proof was absorbed by Swiss Type C (science, A is classical 
languages, B modern languages) students for at least 50 years. It also appears in what was the 
standard Swiss high school text of Descriptive Geometry (Fliikiger). I did check in Italian and 
German D.G. texts; the Italians do not have Pascal’s theorem and a German University text does 
not prove it. Unfortunately, I do not have Austrian high school D.G. texts but I assume that at 
least pre-World War I Austrian texts did present a similar proof. To find out whether Dr. van 
Yzeren’s proof was known somewhere one would have to comb through the school literature of 
the few countries that did require descriptive geometry in their high school curriculum. 


Unfortunately, New Math has succeeded in destroying much of European education almost as 
much as it did American education. 


HT. Guggenheimer 
P.O. Box 401 
West Hempstead, NY 11552 
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The Two Envelope Paradox 


Elliot Linzer 


In a question and answer column in Parade magazine [1], a reader asks: 


I am asked to select one of two envelopes and told that one contains twice as much money as the 
other. I find $100 in the envelope that I select. Should I switch to the other one to improve my 
worldly gains? 


The column’s author answers that while it appears that switching is a good 
idea—$100 buys even odds at getting $50 or $200, which will average $125—it 
“actually makes no difference at all” whether or not you switch. That answer is not 
quite correct. Indeed, the question seems to present a paradox: while it cannot 
always pay to switch, switching seems to increase the average take by 25%. This 
apparent paradox is explained in this note. 

To be somewhat more specific, let us assume that a “host” chooses a random 
positive number and puts that many dollars in one envelope, say envelope A, and 
twice that many dollars in envelope B. Denote by p, the probability that x is the 
number the host chose at random. (For simplicity, I am assuming a discrete 
distribution.) We choose one envelope (which will be A with probability 1/2) and 
look inside. Denote by q, the probability that the chosen envelope is envelope A 
given that we see x dollars in that envelope. 

That probability is clearly the same as the probability that the host’s random 
number was x given that it was either x or x/2. Therefore, 


Py 


ts Pp, + P, {2 | 

If the chosen envelope contains x dollars, the expected value of the amount of 
money in the other envelope is 2xq, + 0.5x(1 — q,), which is larger than x when 
p,, > 0.5p,/2. Thus the answer to the above question is: it depends on pio, and 
Ps); because those parameters are not given, the question cannot be answered. 

It must be the case that the average return from always switching is the same as 
the average return from always staying. Consider the following. Suppose p54 = D499 
= 1/2, and p, = 0 otherwise. There will be $50 in the chosen envelope only if the 
host chooses $50 and we pick envelope A. Thus with probability 0.25 we will find 
$50 in the chosen envelope, and in that case we will get $100 if we switch. Also 
with probability 0.25, we will find $200 in the chosen envelope, and in that event 
we will get $100 if we switch. We will find $100 dollars in the chosen envelope with 
probability 0.5, (because that happens if the host chooses $50 and we pick 
envelope B or if the host chooses $100 and we pick envelope A), and in that event 
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we will average $125 by switching. Overall, if we always switch we average 
0.25 x 100 + 0.5 X 125 + 0.25 < 100 = 112.5 dollars. If we always stay we will 
average 0.25 <x 50 + 0.5 X 100 + 0.25 * 200 = 112.5 dollars. By contrast, if we 
employ a smart strategy and switch if we find $50 or $100 but stay if we find $200, 
we will average 0.25 x 100 + 0.5 X 125 + 0.25 X 200 = 137.5 dollars. 

It is instructive to try to create a probability distribution for which it always 
pays to switch. If it always pays to switch, then we must have p, > 0.5p, 2 
whenever it is possible to choose an envelope and find x dollars inside (That is, 
whenever p, > 0 or p, ;2 > 0.) Let’s assume that that is the case and see what 
happens. Choose x, so that p,,. > 0. Envelope B contains 2x, dollars with 
probability p,,. > 0, so it is possible to choose an envelope and find 2x, dollars 
inside. Because we are assuming that it always pays to switch, we must have 
Pox, > 9.5p,, > 0. Envelope B contains 4x, dollars with probability p,,, > 0, so it 
is possible to choose an envelope and find 4x, dollars inside; therefore, p,,, > 
O5P2,., > 0.25p,, > 0. Continuing in this way, we see that 


Pr, > 2° "Px: 


The value of the random number.chosen by the host is 


CO CO 
i _ 
Li xP, >) 2 XoPr!x, 7 » XoPy, = %: 
x i=0 1=0 


i= 


Thus we can (seemingly) always increase the average take by switching only if the 
average amount of money that the host puts in envelope A—and, therefore, the 
average that we will get by switching or staying—is infinite. 

As an example of when it can seem to always pay to switch, consider the 
following distribution. Let 


if i is a non-negative integer and p, = 0 if x is not a power of 2. For positive 
integers i the probability of having chosen envelope A given that the chosen 
envelope has 2! dollars in it is qi = 0.4. If, for a positive integer i, we find 2' 
dollars in the chosen envelope, the average amount of money that we will find in 
the other envelope is 2 x 2' x 0.4 + 0.5 X 2! X 0.6 = 1.1 X 2' dollars. The only 
other possible dollar amount that we can find in the chosen envelope is $1, and in 
that case we will get $2 by switching envelopes. We therefore seem to always 
increase the average return by switching. However, if we calculate the average 
return we get by always switching it will be the same as the average return we get 
by always staying; in both cases the average return 1s infinite. 

It is interesting that the explanation given is in accordance with the intuitive 
answer to the question of whether or not to switch envelopes. If most people were 
faced with the choice of switching envelopes or keeping what they saw in the first 
envelope, they would switch if they saw a small amount but keep a large amount. 
This is due not only to the fact that if someone is assured he will get a very large 
sum of money he would not be very interested in risking half of it for larger gains 
(even if the odds were in his favor), but because he knows that the host only has a 
finite amount of money (and is probably not planning and giving too much away). 
So in one sense the answer may be that you’ve got to guess on the probability 
distribution that the host is using and decide accordingly. 
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Disorderly Currencies 


The article “Orderly Currencies” (Jan. 94, pp. 36-38) has a lovely topic and a charming style. 
Unfortunately, the one theorem is wrong. 


A coinage system is orderly if to give change with the fewest coins it suffices to use a greedy 
algorithm—as many of the largest coin as possible, then of the next largest, ctc. The article claims that 
to be orderly it is mecessary and sufficient for the denominations 1 = d, < dz < 

<d,, to satisfy 


d,=2d;.,;-d,-2, 3sj<n. (1) 


Consider a coinage with denominations 1, 6, 11, 17. These mect (1), yet the greedy algorithm on the 
amount 22 gives one 17 coin and five 1’s, whereas only two 1I1’s are needed. So (1) is not sufficient. 


Neither is (1) necessary. Consider the denominations 1, 5, 10, 20, 25, 40. With a little work, one can 
show these form an orderly currency. However, for j = 5, condition (1) fails, as 25 <2 x 20 — 10. 
True, if we delete the 40 piece, then the currency is not orderly. And indeed, if we insist that every 
initial sequence d,,...,d; be orderly, then it turns out that (1) is necessary. But, the article does not 
require initial sequences to be orderly, and the proof is not trivial. 


There is a substantial literature on this subject, though it’s hard to look up because it’s hard to 
guess the right key words. My assertions and examples come from 


M. L.. Magazine, G. L. Nemhauser and L. E. Trotter Jr., When the greedy solution solves 
a class of knapsack problems, Operations Research 23 (1975) 207-217. 


Sce Theorem | and its corollaries. These authors actually discuss a more general problem, which in the 
terminology of the Monthly article could be called coin changing with minimum mass. 


An alternative exposition of the result of Magazine ef a/. appears in Section 6.1 of “Combinatorial 
Algorithms” by T. C. Hu (Addison-Wesley, 1982). Further references with brief annotations appcar at 
the end of the chapter. 


As far as I know, a good necessary and sufficient condition for orderly coinage has never been 
found. The article above gives a decent necessary and sufficient condition if cach initial sequence of 
the coinage must be orderly. 


Actually, before the days of cleetronic cash registers, which tell the clerk exactly how much change 
to give, clerks did not use the greedy algorithm, because they counted up to make change. That is, if 
you paid $1 for an 83¢ item, the clerk counted 84 (as he gave out a penny), 85, 90 a dollar. The 
advantage for clerks of counting up was that they didn’t need to know how to subtract! For us, though, 
the interesting point is that clerks were not using the greedy algorithm. Yet they (almost) always gave 
out the fewest coins. So what algorithm were clerks using, and why was it optimal? Readers might like 
to ponder this. I heard this question raised by Peyton Young, in December 1975 at Princeton, at the 
end of a seminar talk by Les Trotter about his paper. In correspondence with Trotter (unpublished) I 
gave an answer to this question that seems satisfactory for real-world currencies. 


Stephen B. Maurer 

Department of Mathematics & Statistics 
Swarthmore College 

Swarthmore PA 19081-1397 

smaurer 1@cc.swarthmore.edu 
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Fourier Series of Polygons 


Alain Robert 


It is generally admitted (among mathematicians...) that the complex form of 
Fourier series is the easiest to discuss. It has two main advantages 


e economical on the notational side 


>) c,e'* instead of ay + )> (a, cos kt + b, sin kt), 
— 00 k>1 


e straightforward for the discussion of absolute convergence 


ikt| _ 


Ic,e Ic, |. 


For the visualization of periodic functions however, one usually tends to 
separate real and imaginary parts, and draw separately the graphs of these two 
periodic functions. 

But a complex Fourier series L, < 7c,e'*’ represents a 27-periodic map 


f:R-C 


and at least when it is continuous, can be viewed as a closed parametrized curve 
t +> f(t) © C in the complex plane. In general, this closed curve will have multiple 
points (namely, f is not one-to-one). It is our purpose to illustrate this point of 
view. In particular, we intend to show that for n > 2 


f(t) = et /k? 
k=1(n) 
is the Fourier series of a regular n-gon in the complex plane. 
In the first picture, the Fourier series of the pentagon is illustrated: partial sums 
xk =1(5),|k| <5n+1--- are plotted for n = 1, 2, 3, 4 and 8. 


1. THE FUNDAMENTAL FOURIER SERIES AND THE MAIN RESULT. Let 
G(R/27Z) be the space of continuous 27-periodic functions f: R — C. We shall 
say that such a function f is a polygon if there is a finite subdivision 


OSty< cs <t<tjyy ttt <tj-y <t, =t) +27 
of [0,277] such that f is affine linear in each subinterval [t;, t;44l. In this case, the 
image of f is a polygonal line in C with vertices s; = f(t,), and f is a parametriza- 
tion with constant speed on each side. The set of polygons is obviously a subspace 
of ¢(R/27Z). 
Let us now determine the Fourier series of a polygon f. With the previous 
notations, we compute the Fourier coefficients c,(f) given by 


2me( fF) = fo "F(e dr = Df Aye de, 


O<j<n 4; 
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Figure 1 


by integration by parts when k + 0. For a typical term 


~ikt |" —ikt 


bi4 —itkt € bad € 
t dt = t ~ . dt. 
[ore oz] Jo 


In the sum over j, the integrated terms cancel out two by two (f is continuous and 
periodic) and we obtain 
etkt 


t 1 bar _; 
2 = ty, dt = — [Ue Wtk dt 
we,(f) 2 J Uj; EL , vif e 


_—; bi _ —ikt: 
=k* Vole ™]2" =k-* Vi(u,_, — ue. 
J J 
With a velocity jump o; = v, — U;_, at time ¢,, we can write the last relation as 
ce f) = ~(Loje | /(2mk?). 
J 


In spite of its appearance, this is a very simple expression indeed. Recall that a 
translate tf of a function f € ¢(R/27Z) is a function of the form 


(tf )(t) =f(t — a). 
As the integral formula shows, the Fourier coefficients of such a translate are 


c,(rf) = e7'**c,(f). We see now that a polygon f is a linear combination of 
translates of the basic Fourier series 


fi(t) =: dL et /k? 
k#0 


and a constant term c, = c,(f): 


f and cy+ dia f,(t — t,) 


have the same Fourier coefficients when a, = —o,/(277). 

Conversely, if we consider a linear combination 1 ,a;7,f, of translates of f;, 
where 1 ,a, = 0, we can construct a parametrized polygon in C with this Fourier 
series. We can indeed compute the o, = — 27a, and the t,, hence we can compute 
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the velocities v; starting from v, as follows 
U; =Ug + O1, VU, =U, + (0, + G2), ete. 


We have v, = Uy + (o, + +++ +0,,) = Uy by assumption. Moreover, the polygonal 
line will be closed when 


(t, _ ty) Uo Ft +(t, a tn—1)Un-1 a 0. 


But ¢, = 27 + t, and reordering terms, we get 


2 Uy, 1 = t4(V, — Up) + t,(¥2 — V4) +? £;0; 


O<j<n 


which fixes the value of vy = U, = U,_1 + 0, = Un_-1 + % = Gy + Lc jentjGj/2T. 
Hence the velocities are uniquely determined in all subintervals [¢,, t;, ,[ and the 
polygon itself is determined up to a translation. Without loss of generality, we can 
now restrict ourselves to centered polygons, namely those in €, = {f © @(R/27Z): 


Co(f) = O}. 


Theorem. The space of centered polygons V consists precisely of the linear combina- 
tions La, + 7(f,) of translates of 


f(t) = De /e? 
k#0 
where La; = 0. 


As we have just seen, the study of Fourier series of polygons can be based on 
the basic expansion 


fi(t) = ¥& e' 7k? = 2 Y cos kt/k?. 


k#0 k>1 


Its graph is easily pictured. Since the series of derivatives converges uniformly on 
compact subsets of ]0,27[ by Abel’s criterium, it is legitimate to write 


fi(t) = Yi-e/k = —-2¥ sinkt/k for0 <t <2. 
k#0 k>1 


As is known, this is the Fourier series of the 27r-periodic function 
t—7m forO0<t< 27. 
From this we infer 
fi(t) =t?/2 -—awt+f,(0) for0<t < 27. 
Using the well-known 1,.. ,1/k* = 17/6, we deduce f (O) = 17/3: 
fi(t) =t°?/2-at+7°/3 for0<t < 27m. 


The graph of f, is not piecewise linear (it is not a polygon) but the second 
derivative of a linear combination of translates La, « 7,(f,) vanishes when La, = 0 
because 7,(f,)” = 1 except at finitely many points of [0,27[. This shows that 
La, + 7(f,) is piecewise linear and gives an independent algebraic proof of the 
converse part of the theorem. 
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Figure 2 


Corollary. The subspace generated by the translates of f, is dense in @. 
Proof: Let f € @,. Since f is uniformly continuous, for each given e > 0, one can 
find a subdivision {t;} of [0,27[ such that 

f(t) —f(t;)| se/4 for allt, <t <t,4). 


The polygon g going through the points f(t,) is a uniform approximation of f: 
choosing the index j suitably, we can write 


f(t) — 8(t)| s|F(¢) -— f(4))| +] (6) - (2) s £2. 


Now, the mean value of g will also be small 


a = (1/2) [°"e(+) dt = (1/2m) f"(F(«) — g(t)) dt satisfies |a| < ¢/2. 
The polygon g, = g — a is centered, i.e. belongs to @ and satisfies 
|f(t) ~ go(t)| sIF() — a(t) + lal se, 
If — goll = Sup | f(t) — 2,(t)| <e. 
2. FOURIER SERIES OF REGULAR JN-GONS. We say that a continuous 


27-periodic function f: R — C has a symmetry of order n > 2 when there is an 
nth-root of unity ¢ such that 


f(t + 20/n) = f° f(t). 


We shall only be interested in the case ¢ = e*'”/” in this section (in Sec. 4 below, 


we shall give examples of the general case). 


Theorem. Let f € @(R/27Z) be a continuous, 2 ar-periodic function presenting a 
symmetry of order n = 2 in the sense f(t + 24/n) = e*7'/"f(t) (t © R). Then the 
Fourier sequence of f satisfies 


c.(f) =0 if k — 1 isnot a multiple of n, 


a(f) = (n/2m) f° 


T/n 


f(t)e dt ifk — 1 isa multiple of n. 


Proof: The nth order symmetry simply means that the function 


tree": f(t) 


is periodic of period 27 /n. It can be expanded in a Fourier series according to the 
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system (e'”"’), .>, and 


f(t) = git > ci etmnt _ > ch eltnn+ Dt _ > c,e*. 


meZ meZ k=1(n) 


In particular, f © @. Conversely, any exponential e'*t where k = 1 mod n has the 
n-th order symmetry property required. | 

In other words, the basic exponentials (e'*‘), _ 1,9, Constitute a Hilbert basis 
for the space of L?-functions with symmetry of order n. 

We consider now more particularly the case of regular polygons in C, with 
vertices at the nth roots s,; of 1. Put ¢ = g, = e?7'/” so that 

s = erm/n= Ci (0 <j <n). 
Since Lg <jen$ J = (), the linear combination 
Loft —j + 2a/n) 


O<j<n 


represents a polygon. More precisely, 


LD f(t 7 2m/n)= LY gelkePearm 74? 


O<j<n O<j<n k#0 
_ > y GIA Beikt 7h 2 =n y ef kt 7k? =n f(t). 
k#0 0<j<n k=1(n) 


This proves that f, represents a polygon (constant speed on each side) with an 
n-th degree symmetry and only n vertices (inear combination of n translates of 
f). The symmetry 
f(t + 2a/n) =¢° f(t) 
leads to 
fi(t+2r/n) = f(t) and |f'(t + 2a/n)| =|f"(2)| 

and since we know a priori that the velocity f,, has only n jumps, it proves that f, 
is a constant speed parametrization. 

We shall now use the Parseval formula which we recall. 

If f and g € L?(0,27r) have respective Fourier coefficients a, = c,(f), b, = 
c,(g), then 


1 no 
Laxby = (fle) = 5 fF) - a(1) at 


In particular, for square summable 27r-periodic functions f 


1 7 5 
Y lef) ° =I fle = at f(t) [° at. 


Using this identity for f’ having Fourier coefficients 
C.(f,) =i/k if k = 1(n) and 0 otherwise 
we deduce 
flO) = QL 1/k? = I fala = Bp. 
k=1(n) 


The last equality holds since we know that |f/| =v, is constant. This speed can 
easily be evaluated by geometric considerations 


2v, = L, = 2nf,(0) - sin w/n 
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hence 


sin 77 /n 


Un = fn(O) ° 


1 /n 


We have obtained 


sin a /n \? 
f,(0) = 02 = f2(0) | i | 
from which we deduce 
7/n ? 
IO) = [= 7/n | 


We have proved the following result. 


Theorem. The Fourier series of the regular n-gon having the s; = e*'!"/" as vertices 
—uniformly parametrized —is 


Chit) =C, Le eM /k? =C, Yel (1 + In)’ 


k=1modn leZ 


with normalization constant C,, = sin*(ar/n)/(1r/n)?. 


Corollary. We have 


D1/(1 +in)? = { n/n |. 


lez sin 7/n 
In particular for n = 2, we infer 


y 1/k? = (1/2) ¥ 1/(1 + 21)? = 2/8. 
leZ 


k odd= 1 


Let us observe at this point that C, < 1 tends monotonously to 1 for n — % and 
f, of =e" (uniform parametrization of the circle) uniformly for n > ©. 


3. QUADRATIC SPACE V. The vector space V carries a quite natural quadratic 
form. Here it is. 


g 


Definition. For f € V, we define Q(f) = ir (f' lf) = rCwf'lp). 


Theorem. The function Q takes real values only and defines a non-degenerate real 
quadratic form on V. Moreover 


1. QO(f) = rUkle,(fI’, 
2. When f = OIl is the (positively oriented) boundary of a polygonal piece 
IIc C, Q(f) = S = Area( II). 
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Proof: 1 results from the Parseval formula since the Fourier coefficients of (1 /i) f’ 
are the kc,(f). To prove 2, we use Stokes’ theorem 


Of) = im(f'If) = im/(2m) f "FD f(t) dt = i/2f, 2 dz 


(1/2) f {de \ dz = (i/2) f { (de + idy) A (dx — idy) 


=| ardy=S. 
wl 
Corollary. ©, — 1 moanl/k° = QUf,)/m = (ar/n)/sin 7 /n)) cos 1/n. 


Proof: The area of the regular n-gon inscribed in a circle of radius f,(0) is indeed 
S =n-f7(0): cos a/n - sin 7/n as is easily seen. 


4. EXAMPLES. Fix the integer nm > 1 and choose another integer 1 <a <n. 
Then 
Le /ki = (1/n) Lov f(t -j + 2a/n) 
k=amodn O<j<n 
is a star-shaped polygon with n/(n, a) sides (with n sides when a is relatively 
prime to n). The first side links s) = 1 to s, = £%, the second links £* to £°°... 
This parametrization f has an nth-order symmetry of the type 


f(t + 2m/n) = £4: f(t). 
Here is an example treated with MATHEMATICA, 
Another type of stars (without double point) is obtained as follows. For two 
constants A and B consider 
A Yo e/kK>+B Yl el /k*, 


k=1mod2n k=n+1mod2n 


fCt_J:=SumCExpC(2+5k) I t]/(2+5k)* 2, {k,-4,4}] 
uCt_J:=RelfC[t]J] 

vECt_J:=iImCfCtII 

ParametricPlot(C{ultJ,v[€t1},{t,1,1+ 2Pi},AspectRatio- >Automatic] 


Figure 3 
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This is a star with n vertices. The choice 


A=n-—Vvn ; B=n+vn 
leads to the classical stars where the sides [s,,5,,,] and [5,,3,5;,,] are on the 
same line when j is even. We give two more examples with MaTHEMaATICA™, 


fCt_J:=SumCExpC(1+ 20k) I t1/ (1+ 20k) ~ 2,{k,-2,2)] 
gCt_J:=SumCExpC(11+ 20k) I ti / (11+ 20k) * 2,fk,- 2,23] 
hCt_J:=fC€tl- 6gft] 

ufCt_J:=ReChCtI] 

vCt_J:=ImChCtI] 

ParametricPlot(C{ultJ],v(Ct]},{t,1,1+2 Pid,AspectRatio- >Automatic] 


ae 


Figure 4 
fCt_J:=SumCExpl(1+ 16k) I t)]/¢1+16k) ~ 2,€k,-4,4)) 
gCt_J:=SumCExpl((9+ 176k) I ti] /(9+16k) * 2,(k,-4,4)] 
h{Ct_J:=fl€t)+3gft] 


ult _J:=RelhCtI] 
vECt_J:=ImChCtI] 
ParametricPlot(C{ultJ,v[C€tjJ},{(t,1,1+2 Pit,AspectRatio- >Automatic] 


mn 
ne 


Figure 5 
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5. CONCLUDING COMMENTS. The interested reader may continue by pro- 
gramming the Fourier series of a cross (degenerate star with four sides) and check 
that O( f) = 0 for these functions. 

The formula Q(f) = rLk\c,(f)|? is reminiscent of the Sylvester decomposi- 
tion of the quadratic form Q. Indeed, Q is defined on the Sobolev space 


H'?? = ((cy)gez? LIKI legl? < @} 
which decomposes as a direct sum H'/? = H,® C @ H_ with 
H,={(c,):¢, =0 fork <0},H 


With respect to this decomposition, Q = Q,® O ® — Q_, where Q, are positive 
non degenerate on H,. 

In Fig. 3, Q(f) counts twice the area of the portion of the star which is the 
inner pentagon. In general, when f is self intersecting, one could introduce its 
algebraic area as being Q( f). Geometrically, in the computation of this area, each 
connected component of C — Image(f) is affected by a rational integer, the index 
of f with respect to the points in this component. Observe that the method of 
determining f(0) also applies to stars since we still have a constant speed 
parametrization: ) 


similarly defined. 


f(0) =IIf'llZ = v2? = (length /27)’. 
Several generalizations of the preceding considerations can be developed: 
e ¢' curves in C which are piecewise quadratic, 


e polygons in C” or R” (in particular in R*!). 


Question. Replacing the circle group by the rotation group, is it possible to 
describe in a simple way the Platonic solids by means of spherical harmonics? 


Finally, let me thank R. S. Strichartz whose comments helped me to write the final version of this 
paper. 
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The Paradox of Nontransitive Dice 


Richard P. Savage, Jr. 


Suppose one takes three cubes and writes a number from 1 to 18 on each face of 
each cube using each number once and only once. In this way one constructs some 
unusual dice we denote by A, B, and C. Is it possible to arrange the numbers on 
the dice such that if the dice are rolled, the probability that A beats B is greater 
than 1/2, the probability that B beats C is greater than 1/2, yet the probability 
that C beats A is also greater than 1/2? The answer is yes. For example, put 18, 
9, 8, 7,6, and 5 on A; 17, 16, 15, 4, 3, and 2 on B; and 14, 13, 12, 11, 10, and 1 on 
C. It is easily checked that the probability that A beats B is 21/36, that B beats C 
is 21/36, and that C beats A is 25/36. This phenomenon of “nontransitive dice” 
was popularized in Martin Gardner’s column in Scientific American [7]. He gives 
an example due to Bradley Efron with four dice in which A beats B, B beats C, C 
beats D, and D beats A each with probability 2/3. 

Nontransitive dice fall into the general category of nontransitivity paradoxes 
about which there has been considerable study. The best known nontransitivity 
paradox is the voting paradox the study of which was begun by Condorcet [4]. In 
the case of three candidates this is the observation that a majority of voters may 
prefer candidate A to candidate B, B to C, and C to A. For a history of the 
voting paradox see Black [1]. Other examples of nontransitivity are discussed in [2], 
[5], and [8]. 

Let us consider the following game. A casino has constructed three dice with 
the numbers arranged on them. A patron chooses whichever die he wishes and the 
house then picks one of the remaining dice (the one which beats the patron’s 
choice) and they roll the dice with a wager on the outcome. Assuming that the 
players will play to their own advantage, the goal that the casino has in arranging 
the numbers originally must be to make the smallest of the three probabilities as 
large as possible. In the following discussion the fact that a standard die has 6 
faces is not important to the analysis, thus we consider the problem for n-faced 
dice. 

Let X be a random variable taking as values the numbers on die A each with 
probability 1/n. Define random variables Y and Z for dice B and C similarly. 
Clearly X, Y, and Z are independent. It was noted by Steinhaus and Trybula [9], 
[11] that if X, Y, and Z are random variables it is possible that P(X > Y), 
P(Y > Z), and P(Z > X) can all exceed 1/2, a result which is known as the 
Steinhaus-Trybula paradox. Further, they announced that if X, Y, and Z are 
independent then at least one of the probabilities is no more than (V5 — 1)/2. 
(The golden ratio again!) Consequently we have the following result. 


Theorem 1. Suppose numbers 1,2,...,3n are arranged on n-sided dice A, B, and C. 
Then at least one of the probabilities (A beats B, B beats C, and C beats A) is less 
than (¥5 — 1)/2. 


1994] THE PARADOX OF NONTRANSITIVE DICE 429 


A proof of this result in the present context is included in this paper. 

More generally suppose we have m independent random variables (or m 
n-sided dice). Let p,,_, denote the unique solution of the system p, = p,(1 — p,) 
= -++ =p, (1 —p,,_»)=1-—p,,_, where for each i, 1/4 <p; < 3/4. Then 
Trybula [12] showed (see also [13]) that 


min{P(X, > X,), P(X, > X3),..-, P(X, > X1)} < Pm—t- 


Further, the sequence {p,} is increasing and converges to 3/4. For example, if 
m = 4 it is easy to determine that p, = 2/3. Hence Efron’s example is optimal. 

Returning to the problem of three dice, let us now consider placing the numbers 
on the n-sided dice in a systematic way. One first puts one or more of the highest 
numbers on A. Then one puts the highest numbers that remain (one or more) on 
die B. Then one goes to die C (or perhaps back to A) and puts the highest 
numbers that remain on die C (or A). This continues until all the numbers are 
placed. Form a sequence x,, X5,..., 3, by letting x, be the number of numbers 
placed on die A initially, x, the number on die B initially, x, the number on die 
C initially, x, the number on die A on the second go round and so on. Eventually 
the process stops. In the example already given x, = 1, x, = 3, x, =5, x, =5, 
x, = 3, and x, = 1. Note that the sequence has the following properties. 


k k k 
X32 = ye X34 = dy X35, = Nn. (1) 


i=] t=] i=] 


Each x, is a nonnegative integer. (2) 


Also note that the probabilities in question are given by 
1 1 | 
n , n , Y x5], and ne x x| Y x54:}, (3) 


j2i i=1 j2i 


Bev (Esn) bbe 


jeu 


Now allow the numbers in the sequence {x,} to be nonnegative real numbers 
instead of just nonnegative integers. The formulas for the probabilities are then 
continuous functions of the variables and therefore if we define F by letting F be 
the minimum of the three probabilities F is continuous. As F is defined on a 
compact set, F must achieve a maximum. If x; # 0 but x,, = 0 whenever m > Jj 
we will say that the length of the sequence is j. Also let a denote the number 
(V5 — 1)/2. 


The proof of Theorem 1 may be divided into three steps. 


Step 1. We show that if {x,} is a sequence of length 6 or more than there is a 
sequence {z,;} of the same length in which one of the terms is zero and for which 
each of the probabilities in (3) is at least as large as for {x,}. 


Step 2. We show that from a sequence in which one term is zero a new sequence 
can be constructed which is shorter and for which the minimum of the probabilities 
is at least as large as the minimum for the original sequence. 


Step 3. Steps 1 and 2 combine to show that to maximize the probabilities we need 
only consider sequences of length 5 or less. This maximum is calculated in this step 
by showing that x, = a7n, x, = an, x, =n, xX, = an, and x, = a’n maximizes F. 
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Remark. The proof of the theorem for m n-sided dice is analogons. For example, 
in Step 3 the sequences to be considered are of length 2m — 1. 


Proof of Theorem 1. 


Proof of Step 1: Note that by relabelling the dice if necessary we may assume that 
x, #0 and x, # 0. If some x,, = 0 where m = 3 and m is less than the length of 
the sequence we may proceed to Step 2. If not, we will find a new sequence for 
which some term is zero. To get the desired sequence {z,} we will only change the 
first 6 terms of {x,}. Therefore if i => 7 define z; =x,;. Define z, = Ax, where 
A > 0. By (1) this forces z, =x, + (1 — A)x3. We will choose z, such that the 
third of the probabilities in (3) is unchanged. It is easy to check that this forces 
Zz, = (1/A)x,. Then by (1) again we must have z, = x, + (1 — (1/A))x,. We would 
also like the first probability in (3) to be unchanged. A short computation shows 
that this forces z, = Ax, and therefore z; =x, + (1 — A)x,. 

Now comparing the second of the probabilities for {z,} with that for {x,} we get 


Ba(Es)- feo( Es) 


j2u i=] jet 
= (Z, —X2)n + 2Z5(n — 23) ~X5(n — X35) 


= (1 — A)x3(x5 — AX2) (*) 


First suppose x, > x,. The expression (*) is then increasing in A for A > 1. 
Hence by choosing A appropriately we can form a sequence of nonnegative reals 
with either z; = 0 or z, = 0 and for which F({z,}) = F({x,}). On the other hand, if 
X, <x, the expression (*) is decreasing for A < 1. Choose A such that z, = 0. 
Again we have F({z,}) = F({x;}). 


Proof of Step 2: Tf x, < x5 so that z, = 0 it is trivial that the sequence {y,} defined 
by y,; = Z;,, is shorter and has the same probabilities. In the other case, we have 
Zm = 0 where m > 3. If the length of the original sequence was 6 and m = 6 we 
have found our desired sequence. If not, define a new sequence {y,} by y, = 2, if 
j <m— 2, Yn-2 = %m-2 + Zmti> Ym-1 = %m-1 + Zm+2> Ym ~ 2m + Zm+3? and 
y; = 2Z;,3 if j > m. It can be checked that two of the three sums in the defini- 
tion of F are the same for {z,} as for {y,}. The sum involving y,,_>y,,_, = 
(Zm—2 + Zm4 (Zm—1 + 2Zm42) has the additional term z,,,,z,,_; So is at least as 
large as the sum for {z,}. Hence in this case we get a sequence {y,} shorter than 
{x;} with F({y,}) = F({x;}). As an illustration of this idea let {x,} be a sequence 
with x,=0 and form the sequence x, +x 4,X,+%5,X,¢,X7,.... Clearly the 
probability that A beats B has increased while the other probabilities are un- 
changed. 


Proof of Step 3: The three expressions under consideration are then [x,n + 
(n — x(n — x,)|/n’, x,/n, and (n — x,)/n. The first expression is increasing as a 
function of x, while the third is decreasing. The first expression is decreasing as a 
function of x, while the second is increasing. Thus for a sequence which maxi- 
mizes F the three expressions must be equal for if they were not we could increase 
the smallest (or two smaller) while decreasing the larger while preserving the order 
of the three expressions. Setting them equal we immediately get x, = n — x, and 
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completing the algebra we have 
3-5 
2 


The formulas for x, and x, follow immediately. 
It is immediate that all three probabilities yield a for this sequence, completing 
the proof. 


x, = n=a’n and x,=an. 


It is interesting to ask how close we can come to achieving the upper bound on 
the probabilities in Theorem 1. This question does not appear to have been 
considered in the literature, however see [6], [10], and [14] for other results 
concerning nontransitive dice. Of course, since a is irrational at least one of the 
probabilities must be strictly less than a. In terms of rational numbers, the result 
could be stated as saying that at least one of the probabilities does not exceed 
[an*]/n? where [ ] denotes the greatest integer function. Recall that if F, 
denotes the kth Fibonacci number then lim,_,,, F,/F,4, =a. The proof of 
Theorem 1 suggests that if m is a Fibonacci number then we should be able to 
come close to the upper bound. The following theorem shows that in this case it is 
possible to arrange the numbers so that all of the probabilities are at least 
[an*]/n7. Hence, in this case we can achieve the (rational) upper bound. 


Theorem 2. If n is a Fibonacci number it is always possible to arrange the numbers 
1,2,...,3n on three n-sided dice such that each of the probabilities is greater than 
(1 — 1/n”)a. Each of the probabilities is then at least [an?]/n’. 


Proof: Let n= F,. Let x, = F,_>, X, = Fy_y, %3 = Fy, Xq = Fy y, X5 = Fy_o, 
and x; = 0 if i2 6. The three probabilities are (F,_,F, + 
F,,_ Fy D/P; F,._ / Fe; and F,_,/F,. With the aid of the easily checked identity 
FF. — F? = (—1)* one can check that the sequence F,,/F,,,, is increasing, 
that F,;_,/F,,; is decreasing, and that the sequences have a common limit of a. 
Then F,_,/F, <a if k is odd and F,_,/F, > a if k is even. Now considering the 
three probabilities it is trivial that F,_,/F, >(1-—1/F2)a if k is even and 
follows from easy manipulations if k is odd. Now 
Fy Py + Py Py Fy ~ Fee 


Fy FR 
and 
1 F? — Fe, 
5 1—-— Ja if and only if ——~— > a. 
F; Fy F? 
The result now follows from the identity 
F? — F?_, eee if k is even 


Fe — FR, | 
————- > 
k — 1 


Fe -1 F,_,/F,_, if k is odd. 


The last claim follows from noting that a ~ (1 — 1/n?)a =a/n? < (1/n’). Thus, 
interpreting the result in terms of rational numbers we see that each probability is 
at least [an?]/n?. 


In general, we cannot always achieve the upper bound of Theorem 1. In what 
follows we simplify the notation by denoting x,,x,, x, by x, y,z and we will let 
X,=nN-X,X,;=n-—y, and x, =n — z. Motivated by the proofs of Theorems 1 
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and 2, in order for the probabilities to be close to the upper bound of Theorem 1 
we would expect y to be close to an. Try choosing y to be the integer in 
(an +6 -—1,an +65) where 5 is a number between 0 and 1 which will be 
specified later. Then set z =n and x =n —y. The numerators in the three 
probabilities are then n? — y*, ny, and ny. Now n? — y” is smallest at the right 
hand endpoint while ny is smallest at the left. Choose 5 such that n? — (an + 6)? 
= n(an + 6 — 1). This simplifies to n = (2a + 1)n6 + 67 so 6 = 1/Qa + 1). It 
is now easily checked that the smallest of the numerators on the interval is 
2a 1 


~ ———_ 1 — ————_... 
2a +1 (2a + 1)’ 
Hence we have shown that it is always possible to arrange the numbers on the dice 


such that each of the probabilities exceeds 
2a 1 1 1 


2a+1n (2a +1) n*- 


n? — (an + 6)° = an’ 


For 6-sided dice this construction yields x = 2, y = 4, and z = 6 for which the 
probabilities can be calculated to be 20/36, 24/36, and 24/36. This is not as good 
as the example given at the beginning. In fact, we can obtain a stronger result if 
more freedom is allowed in specifying x and z. 


Theorem 3. I/t is always possible to arrange the numbers 1,2,...,3n on the dice such 
that each of the probabilities exceeds 
2a* 1 a? 1 1 1 


— + 5 5° 
2Zat+il1n 2a+1n 2Zat+i1n 


Proof: Let (an + 6 — 1,an + 6) be an interval containing an where 6 is to be 
determined. Let y be the unique integer in this interval. Then we will choose x 
and z in one of the following three ways. 


(1) x =n—yandz=n 
(2) x=n—-y-—landz=n-1 
(3) x=n—-y-—2andz=n-1 


The numerators in the three probabilities are then 


(1) n* — y’, ny, ny 
(2) n*—y?—-y,ny+(n-—y)nyt+(n—-y)-1 
(3) n? — y* — 2y,ny + (n — y), ny + (2n — y) — 2. 


First set ny = n* — y* — y. Writing y = an — t and simplifying we get an = 
(2a + 1)nt — t? + t. Clearly this equation has a solution t, with 0 < t, < 1. Then 
an’< (2a + Int, + ty and an > (2a ¥ 1)nt, from which we obtain 


Qa Qa 
SoS bea] 
(2Qa+1)+— e 
If y is in the interval 
Qa 
I, = ~ , + 6 
{an an 


choose x and z as in (1). Note that to the left of an the smallest of these 
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numerators is ny and in J,, 


Qa ) an 
—__ —_— nh — 
2a+ 1 2a+ 1 
which exceeds the bound in the statement of the theorem. 

Now set ny + (n — y) — 1 =n’? — y? — 2y and write y = an — t. This yields 
(a+ 1)n—-1=(Qa+t 1)nt +t — t’. Again, there is a solution with 0 < t, < 1. In 
the same way as before we find 


ny > n{an ~ 


1 1 
a+1-— at+1—-— 
<n< aT 
(Qa+1)+— . 
n 
If y is in the interval 
, ati-— a 
2= yan 2a+1 °°" atl 


choose x and z as in (2). We see from the definition of t, that 


an 
2 2 2 
n~ — —-~yoan — 
yr? 2a+ 1 
in this interval, and in J, 
1 1 
a+1l—-— at1il--— 
n n 
n+(n-y)-1> |an —- —— [n+ |n -an+ ——|-1 
yn + (n—y) 2a + 1 Oe Da + 1 
1 
, 2a? a? n 


= —_ —_____—. + — 
Fat) 2at+1 2atl 
which gives the bound in the theorem. 
Finally if y is in the interval 


1 
at+lil-— 
n 


I, = jan+6-1,an - ta a 
choose x and z as in (3). Now from the definition of t,, n* — y* — 2y exceeds the 
bound of the theorem in J,. 

In the part of J, to the right of an the smallest numerator in (1) is n? — y* and 
n? — y? >.n? — (an + 6)* on J,. Now consider the numerator ny + (n — y) from 
(3). This exceeds n(an + 6 — 1) + (n — (an + 6 — 1)) on J,. Define 5, to be the 
number between 0 and 1 for which these expressions are equal. Making estimates 
as before we find 
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Now define 


1 
a-— 
_ —_ 
2a+1- 


It can be routinely checked that n? — y? exceeds the bound in the theorem at the 
right hand endpoint as does ny + (n — y) at the left. This completes the proof. 


In the following table only the numerators are given for the various probabili- 


ties. The denominators in each case are n’. The upper bound in column 2 is the 
upper bound on the smallest of the probabilities given by Theorem 1 truncated to 
two places to the right of the decimal. The lower bound is the bound of Theorem 
3. The values of x, y and z are those from the proof of Theorem 3. With x, y and 
z aS given, the column “highest” gives the numerator in the highest of the three 
probabilities and the column “lowest” gives the smallest. 


n Upper bound Lower bound x y Zz Highest Lowest 
3 5.56 4.55 0 1 2 6 5 
4 9.88 8.58 1 2 3 10 9 
5 15.45 13.82 2 3 5 16 15 
6 22.24 20.29 1 3 5 25 21 
7 30.28 27.99 2 4 6 31 29 
8 39.55 36.93 3 5 8 40 39 
9 50.06 47.10 3 5 8 51 48 
10 61.80 58.51 4 6 10 64 60 
11 74.78 71.15 4 7 11 77 72 
12 88.99 85.03 4 7 11 89 88 
13 104.44 100.14 5 8 13 105 104 
14 121.13 116.49 5 8 13 124 117 
15 139.05 134.07 6 9 15 150 135 
16 158.21 152.89 6 10 16 160 156 
17 178.61 172.94 6 10 16 179 176 
18 200.24 194.23 7 11 18 203 198 
19 223.11 216.76 6 11 18 234 217 
20 247.21 240.52 7 12 19 248 244 
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Cookeville, TN 38505 


Hardy and Polya 


Enclosed are contradictory quotations from two well-known mathematicians. They 
may be of some interest to your readers. 


Hardy believed that, in mathematics, the Cambridge examinations 
called “‘triposcs” were nonsensical. As a demonstration, he persuaded 
George Pélya (who, if anything, was a master of computation and 
manipulation in classical analysis) to take the mathematics tripos without 
previous coaching. Pdélya supposedly failed miserably.—Stanislaw M. 
Ulam, Adventures of a Mathematician (Scribners, 1970), page 60. 


I wrote to Professor Pélya to ask him about this reference but never received a 
reply. He died a few years later. Then I came across the following note. 


While Pélya was at Cambridge, Hardy was in the midst of his 
campaign to reform the mathematical Tripos and asked Pélya to take the 
exam unofficially. Hardy expected Pélya’s poor showing would demon- 
strate that most of the questions on the Tripos were irrelevant to 
“modern continental mathematics’. Unfortunately for Hardy’s plan, 
Pélya’s performance was the best on the examination, and he would have 
been named Senior Wrangler if he had been a student.—Halsey Royden, 
“A History of Mathematics at Stanford,” A Century of Mathematics in 
America, vol. 2 (Amcrican Mathematical Society, 1989), pages 250-251. 


Perhaps some of the readers of the Monthly could clear up this apparent 
discrepancy. Since Professor Halsey was Poélya’s colleague, and Pélya was Polya, I 
suspect the second quotation is correct. Incidentally, Hardy was fourth in the 
mathematical tripos of 1898. 


Michael Stueben 

4651 Brentleigh Court 
Annandale, VA 22003 
mstueben@ tihsst.vak12ed.edu 
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Squares Expressible as Sum of 
Consecutive Squares 


Laurent Beeckmans 


1. INTRODUCTION. The following problem was proposed in 1875 by Lucas [2]: 
when does a square pyramid of cannon-balls contain a number of cannon-balls which 
is a perfect square? This problem is clearly equivalent to solving the Diophantine 
equation 1* + --- +k? = O. Lucas claimed that the only solutions were k = 1 
and k = 24, but this was not proved correctly until 1918 by Watson [5]. A more 
general problem is to determine the set S of values of k for which there exists a 
square equal to the sum of k consecutive squares. For example, 600 € S since 


257 + 267 + --- +6237 + 624? = 90107. 
Little has been published about this problem but a good account of it can be found 
in [6], where it is shown that S is infinite and has density zero. Moreover, [6] 
contains a table giving all the elements of S less than 73. 

In the present paper, we extend this table to include all the elements of S less 
than 1000. But our main purpose is first to give necessary conditions that k must 
satisfy in order to belong to S. Then we will describe a general method to find the 
squares which are sums of k consecutive squares, for any given k, this method 
being an application of the theory of Pell’s equation. Furthermore, we will show 
that if k belongs to S, then there exist infinitely many squares that can be written 
as the sum of k consecutive squares if and only if k itself is not a square. 

As usual, we will write p“\|k iff p*|k but p**! +k; a always will denote a 
strictly positive integer. 


2. SOME NECESSARY CONDITIONS ON k 
Lemma 0. If x? + y* = 0 (mod 22”), then 2"|x and 2"ly. 


Proof: The assertion is trivial for n = 1. Suppose it is true for n — 1. x* + y?= 
(mod 27”) implies x* + y? = 0 (mod2?”"~7). By the induction hypothesis, x = 
2”-'a and y=2"7'b, and so 22"-2q? + 27"-*b*? = 0 (mod 2?”). Dividing by 
27"- 2 we get a* + b? = 0 (mod 4) from which we deduce that a and b are even. 
Thus 2”|x and 2”|y. 


Proposition. Let k be an element of S. 
(1) If 2°||k, then a is odd. 
(2) If 3°\|k, then a is odd. 
(3) If p is a prime, p > 3 and p%\|k with a odd, then p = +1 (mod 12). 
(4) If p is a prime, p > 3 and p*||k + 1 with p = 3 (mod 4), then a is even. 
(5) If 3°\|k + 1, then a is odd. 
(6) k # 3 (mod 9). 
(7) k # 2% — 1 or 2% (mod 2%*?) for any a > 2. 
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Proof: Since k © S, there exists n © N such that D*_ (nm +i)? = O. This can be 
rewritten in several different ways: 


k(k + 1)(2k + 1) 


kn? +k(k+1)n+ P =O (i) 
kn(n + + 1) + a DEED O (ii) 


9k(2n +k + 1)° + 3k(k? - 1) 


To prove (1), suppose that k = 2°r with a even (@ > 2) and r odd. Then, by (iii), 
9.2°%r(2n + 2% + 1)* + 3.2%r(27%r* — 1) = O and since 2° is a square, 9r(2n + 
2°%r + 1)* + 3r(2?%r* — 1) = O. Reducing modulo 4 we get 2 = 0 (mod 4), a 
contradiction. 

The proof of (2) is similar: suppose that k = 3°r with a even (a > 2) and 3 tr. 
Then, by (iii), 3%*?r(2n + 3% + 1)? + 3°*+'!rG?2%r? — 1) = O and since 3° is a 
square, 9r(2n + 3%r + 1)? + 37(7%r? — 1) = O. Reducing modulo 9 we get 6r = 
O (mod 9), a contradiction since 3 + r. 

(3) Let p be a prime > 3’such that p?||k with a odd. Since a is odd and 
kK[9Qn + k + 1)? + 3(k? — 1)] = O, p must divide the second factor on the left 
hand side, and so 9(2n + 1)* = 3 (mod p). Since p ¥ 3, 3(2n + 1)? = 1 (mod p) 
and so the Legendre symbol (3 /p) = 1, that is p = +1 (mod 12). 

(4) Let p be a prime > 3 such that p“||k +1 with p =3 (mod4). Then 
k+1=ap*% with p +a, and so (ap? — 1)9Qn + ap)? + 3ap“(ap* — 2))= O 
from which it follows that —(6n)* = 0 (mod p). If p + n, then (—1/p) = 1 (since 
p t+ 6), contradicting p = 3 (mod 4). Therefore n = bp® with p + b and B > O and 
we have 


O. (iii) 


(ap* — 1)|9(2bp? + ap*)” + 3ap*(ap* — 2)| = 0 (*) 


If a < 2B, then (ap* — 1)p°[9(4b2p28-* + 4abp® + a7p*) + 3a(ap* — 2)] = DO; 
in this product, the only factor divisible by p is p%, and so a is even. If a = 28, 
then a is obviously even. Suppose now a > 28; dividing (*) by p78, we get 
(ap* — 1)[9(2b + ap*~®)? + 3ap*~?8(ap* — 2)) = O and reducing modulo 
p,—(6b)* = O (mod p). Since p + 6b, we have (—1/p) = 1, contradicting again 
p = 3 (mod 4), and so a > 28 is impossible. 

(5) Let k + 1 =3%a where 3 + a and a > 2. Then (3%a — 1)[(2n + 3%a)? + 
3°—1q(3%a — 2)] = O, from which we deduce that —(2n)* = O (mod 3), and so 
n = 3°b, where 3 + b and B > 0. Therefore 


(3%a — 1)| (2.3% + 3%a)° + 3°" 'a(3%a — 2)| = 


Putting a’ = a — 1, the same argument as in (4) (comparing a’ and 2) shows that 
a’ is even, and so a is odd. 

(6) Starting from equality (ii), suppose that 4kn(n + k + 1) + Qk(k + 12k + 
1)/3) = O where k = 9 + 3. Then, reducing modulo 3, we get 23u + 1)Qu + 
418 + 7) = O (mod 3), that is 2 = O (mod 3), a contradiction. 

(7) Suppose that 


3k(k + 1)(2k + 1) 
; = 
where k = 2% + »2%**(a@ = 2). Reducing modulo 2%+!, we get 3.27 '=0 


9kn(n +k +1) + 
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(mod 2%*!). If a is odd, then a — 1 is even and 3 = O (mod 4). If a@ is even, then 
division by 2“~* yields 6 = O (mod 8). In each case we have a contradiction. The 
second case in (7), ic. kK = 2% — 1 + w2°%*? yields 3.2%~! — 9n? = O (mod 22*'). 
Suppose then that m? + 9n? = 3.2%! (mod 2°t'), which implies m? + 9n* = 
(mod 2“~!). If a is odd, Lemma 0 forces m = 2°¢~?/2a and n = 2°¢%~/25, and so 
2°-1q2 + 9.2%-!b2 = 3.2%-! (mod 22*!), from which we deduce that a* + b? = 3 
(mod 4), a contradiction. If a = 2, then m? + n* = 6 (mod 8), a contradiction. If a 
is even and a > 2, we have also m2 + 9n? = 0 (mod 2277), and so m = 27% /2q 
and n = 2°*-2/2b which gives 2%~2a? + 9.2%~*b? = 3.2%! (mod 2%*!) and a? + 
b? = 6 (mod 8), another contradiction. 


3. THE ELEMENTS OF S LESS THAN 1000. The following table gives the values 
of k less than 1000 for which there exists a sum of k consecutive squares equal to 


a square and, for each of these values of k, the smallest value of n + 1 for which 
(n+1)?+-:-4+n+k) = 


9985 301 
802 38 
183 177 

71752 126031 
10123 322 
16806 131 
94707486 197 

1081 6665193 

7989 516232 

1663 1122 

1678 

8617 

23172625 

65 

12116 

413 

454 

167065 688013 
3 3480 
310097726 3 
2927 51958 
25 9977 


4. SOME COMMENTS. The 7 necessary conditions given in the proposition 
exclude immediately 910 values of kK < 1000. The 90 remaining values are the 88 
ones given in the preceding table (for which there is a solution of the equation 
(n+1)? +--+ +(n + k)* = D), as well as two sporadic values k = 25 and k = 
842 for which there is no solution, but which nevertheless satisfy the 7 conditions. 
Thus for k < 1000, k # 25 and k # 842, the elements of S and the values of k 
satisfying the 7 conditions coincide. Are there other simple necessary conditions to 
be added to those seven in order to obtain a set of necessary and sufficient 
conditions? I could not find quick arguments showing that 25 and 842 ¢€ S, but 
this will be proved in the next section as an application of a method of resolution 
of Pell’s equation x* — ky? =a (see examples 2 and 4 below), the equation 
(n+ 1)? + ---+(n+k)* = 0 being a special case. 
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5, PELL’S EQUATION. Let 
x? —ky*=a (P) 


where k € N and a &€ Z. We will consider only the solutions of (P) for which x 
and y are positive integers. Consider first the case where k is a square. Then 
(x — yky Xx + yky ) =a, which has only a finite number of solutions: each pair 


u, v of integers such that uw =a, u>v and 2vk |(u —v) yields a solution 
x =(u+v)/2and y =(u —v)/2vk. 


Example 1. To find the squares which can be expressed as a sum of 49 consecutive 
squares, we must solve the Diophantine equation 49n? + 2450n + 40425 = m? 
(see equality (i)). Putting m = 7x, we get, n? + 50n + 825 = x’, which we reduce 
to Pell’s equation x? — y” = 200 by putting y = n + 25. Its solutions are (x, y) = 
(15, 5), (27, 23), (51, 49). Since n must be positive, we have y > 25 and so the only 
solution is m = 24 and m = 357; this gives the only sum of 49 consecutive squares 
which is a square, namely 257 + --- +737 = 3577. 


Example 2. To show that there is no square which is a sum of 25 consecutive 
squares, we write (as in Example 1) the equation 25n? + 650n + 5525 = m’, and 
we reduce it to x* — y* = 52 by putting m = 5x and y =n + 13. Then it is easy 
to check that the latter equation has no solution with n = 0. 


Suppose now that k is not a square. If (x,, y,) and (x,, y,) are two different 
solutions of (P), we will write (x,, y,) < (x5, y,) if x, <x, and y,; <yp. 


Lemma 1. The solutions of (P) can be ordered. 


Proof: Let (x,, y,) and (x, y,) be two different solutions of (P). Then xj — ky? = 
x3 — ky; =a, which implies x7 — x3 = k(yj — y3), and so x, —x, and y, —y, 
are both positive or both negative. So either (x,, y,) < (x2, y,) or (x5, y,) < 
(x4, y,). 


Lemma 2. The equation x* — ky? = 1 has always a solution different from (1, 0). 
For a proof of this well known result, we refer the reader to [3]. 


Lemma 3. Let (A, w) be the smallest solution of x? — ky? = 1 different from (1,0) 
and let M denote the matrix 


A kp 
mw AL 
Then, if (x,y) is a solution of (P), M*(x, y) is also a solution for every k & Z, 


where M° as the 2 X 2 unit matrix. (Note: for notational convenience, we sometimes 
write row vectors instead of column vectors.) 


Proof: Since M(x, y) = (Ax + kwy, wx + Ay) and 


sel: 


— r 
we only have to show that (Ax + ekyy, eux + Ay) with e« = +1 is a solution of 
(P), which is quite easy. 


Ax —kKpy 
—wx+aAy 
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The proofs of the following two lemmas are straightforward: 


Lemma 4. If (x,, y,) < (%5, yz), then M(x,, y,) < M(x,, y,) and M~"(x,, y,) < 
M(x, y>). 


Lemma 5. M~'(x, y) < (x, y) < M(x, y). 


CONCLUSION. The preceding results suggest the following method to solve (P). 
First, if a > 0, the smallest (real) solution of (P) is (Va,0). Since M(va ,0) 
= Va (A, »), if we can find all the integer solutions of (P) lying between (Va ,0) and 
Va (A, »), say (x,,¥,1),---5(%,, y,), then the set of all solutions of (P) is given by 
{M(x,, yJla € N,i = 1,...,r}. Likewise, if a < 0, it suffices to determine all the 
solutions (x;, y;) between 


(0,/-%) and M(0,/-%) = f- ¢(kn,A). 


In order to apply this method, we have to determine the smallest nontrivial 
solution (A, w) of x” — ky” = 1, which can be found by using continued fractions 
((4]). The table of Section 6 below shows that the distribution of (A, 4.) seems to be 
rather chaotic. 


Example 3. To find the squares which can be expressed as a sum of 11 consecutive 
squares, we have to solve the equation 11n? + 132n + 506 = m’. Putting m = 11y 
and x =n+ 6, we get x*—11y?= —10. For k = 11, we have A = 10 and 
yw = 3, and so it suffices to find all the solutions between 


J22(0,1) and 42 (33, 10) 


i.e. the solutions (x, y) with 1 <y < 9; these are (1,1) and (23,7). All the other 
solutions are given by 


x 10 33]°/1 
B}=['3 a0} Ly] 

(10 + 3V11) (1 + V11) + (10 — 3Vi11) (1 — V1) 
(10 + 3VIT)"(1 + vit) - 3v11) (1 - vil) 


1 
vir Oo 


10 33]°[ 23 

3 iol | 

' (10 + 3V11) (23 + 7V11) + (10 — 3V11) (23 — 7V11) 
2 = (10 + 3viT)"(23 + Wit) - (10 - 3¥iT)"(23 - WIT) | 


< & 
i 
I 


Example 4. To show that there is no square which is a sum of 842 consecutive 
squares, we must solve 842n7 + 709806n + 199337185 = m?’. Putting x = 2n + 
843, m = 421ly, we get x* — 842y* = —236321. For k = 842, we have A = 1683 
and pw = 58; so it suffices to find the solutions (x, y) with 0 <x < 818154 and 
17 < y < 28195, but an easy computer check shows that no solution exists in this 
interval (this check can be made very fast by noticing that x” = 281 (mod 842), and 
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so x = +165 (mod 842), from which it follows that only the integers x of the form 
842t + 165 with 0 < t < 971 must be considered). 


6. THE SMALLEST NON TRIVIAL SOLUTION(A, ») OF x? — ky? = 1 


~ 
= 
~ 
= 


15 2 
151 20 
19603 2574 
530 69 
31 4 
1766319049 226153980 
63 
8 
129 16 
65 
48842 5967 
33 4 
7775 936 
251 30 
3480 413 
17 2 
2281249 267000 
430 


p= 
CTwWwWhL NAN KB NH HW W LO 


Le oe) 


—_ 


2 
1 
4 
2 
3 
1 
6 
3 
2 
0 
4 
1 
8 
4 
9 
2 
2 
2 
5 
1 
0 
5 


we) 
a 


9801 1820 
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Square Roots mod p 


Stephen M. Turner 


We would like, given an odd prime p, and an integer y, to solve 


x? = y (mod p). (1) 
There are fast ways of determining whether there is a solution—if p does not 
divide y, Euler’s criterion says that there is a solution precisely when 


yP-D/2 = 1 (mod p) (2) 


and the calculation can be made more manageable using the so-called law of 
quadratic reciprocity (see almost any book on elementary number theory—like 
Rose’s [1], in the list at the end). If p does divide y, then we have a unique 
solution x = 0 mod p. 

Suppose then, that we have the equation (1) as above, where p is an odd prime 
that does not divide y, and that there is known to be a solution. How do we go 
about finding one? Finding one enables the equation to be solved completely, as 
we can find the only other solution by changing sign (mod p). Let us call solutions 
of (1) square roots (mod p). Values of y (mod p) for which (1) has solutions are 
usually called quadratic residues (mod p). 

From now on, all congruences are taken to be (mod p). In some cases, solutions 
are easy to find: if we can find a positive odd integer C such that 


yo= 
then we can just take either of 
x= +yerb/2 (3) 
for then we have 
xesyctlay 


and we are done. This is always possible if p = 3 (mod 4), for then we can take 
C = (p — 1)/2, using Euler’s criterion again. This choice for C is no good if 

= 1 (mod 4), but at least we have got a solution which works half of the time. We 
may be able to find a different C, though—see later. 

Note that we can only find such a C if there is a solution of the form 
x = (power of y), which need not be thé case in general. (Examples in a moment). 
This point is made by Gauss in his investigation of this problem (see [5]), and he 
goes on to give a procedure which sometimes simplifies it in other cases. 

The first general method of solution appears to be due to Shanks (1972). This 
method (see [2]) starts with a simple rule for choosing an odd number C, just as 
above, which either gives a solution immediately or can be used to generate one. 

Shanks suggests choosing C to be the largest odd divisor of p — 1: i.e., choose 
C by writing 


p-1=C.2° (4) 
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where C is odd and s is positive. When p = 3 (mod 4), this gives the same value 
for C as before, so we have generalized Gauss’s construction. 


Example. Take p = 13, so that C = 3. As 13 = 1 (mod 4), Gauss’s method is 
inapplicable. Euler’s criterion shows that both 3 and —1 have square roots mod 13. 
Trying the above method. 

(i) y = 3: then y© = 3° = 1 (mod 13). Take x = y? = 9. And we have found the 
solutions x = +9. 

(ii) y = —1: then y© = (—1)? = —1 (mod 13) and the method fails, although 
there are solutions x = +5. 


The second part of the example shows that it is not always possible to find an 
odd C with the required property, since we always have square roots of —1 when 

= 1 (mod 4) but (— 1)© = —1 mod p:p for any odd C (and —1 # 1 as p: is odd!). 

As mentioned already, Shanks uses this C in the first step of an algorithm which 
solves (1) completely in all cases. In his article, he claims that his choice for C 
solves the congruence immediately in 2/3 of all cases. This note makes this claim 
precise, and proves it. The proof requires very little previous knowledge: any 
results needed are quoted, and they seem quite interesting in their own right 
anyway. 

We certainly expect that the method solves (1) immediately in at least 3 of all 
cases, as half of the primes are congruent to 3 mod 4. 

Let p and y be as above, and let C and s be defined as in (4). Let E denote the 
event ‘congruence solved immediately’ and write ‘Prob’ for probability. The proof 
of Shanks’s claim goes like this: 

(a) we find Prob(E, for some fixed odd p, but randomly chosen y); 

(b) we choose a number M > 0, and find an expression for Prob(£, given odd 
p <M, but randomly chosen y)—given that y has square roots (mod p); 

(c) we let M approach infinity, and show that we can define what we mean by 
Prob(£) in an unambiguous way—where p and y are chosen randomly subject to 
the usual conditions—and that the limiting value is £, establishing Shanks’s claim. 


PART (A). In this section we use the following theorem about cyclic groups (a 
cyclic group is a group consisting entirely of the powers of one of its elements, the 
element being a generator of the group.) 


Theorem 1. Let G be a cyclic group with m elements, and suppose that n divides m. 
Then there are exactly n elements in G satisfying x" = identity of G. 


Given a fixed odd prime p, we know that there are p — 1 residue classes which 
are not divisible by p: in fact these form a cyclic group G (say). Since each 
quadratic residue has exactly two distinct square roots, there are exactly (p — 1)/2 
quadratic residues (mod p), and it is not hard to see that these form a subgroup H 
(say) of G. Then the probability we require is simply given by (say) 

#{y in H: y© = identity} 


Ip) = #{y in H} 
Noting that H is itself a cyclic group of (p — 1)/2 elements, we apply Theorem 
1 (noting that C divides #H) to get 


f(p) = (5) 


C 7 2 
(p-1)/2 2°" 
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Then define the function f on integers m (m # 1) by 
f(m) =2A/(m 1) (Sa) 


where A is the largest odd divisor of m — 1, so that we have generalized f; this 
will be useful later on. This concludes part (a). 


PART (B). Fix some M > 3, and write P(M) = {odd primes < M}. If we choose p 
randomly from P(M), and then a random quadratic residue y (mod p), we have 
(say) 


T(M) = Prob(E given pin P(M)) = )) Prob(Eandp=q) (6) 
qgeEP(M) 


where the probabilities add as the events are exclusive, and using the definition of 
conditional probability, namely that for any two events A and B, 


Prob( A given B) = Prob( A and B)/Prob(B), 


we can write 


T(M)=  Y Prob(E given p = q)Prob( p = q) 
qeP(M) 


and Prob(E given p = q) is just f(q) in the notation of part (a), so we have 


T(M)= ) f(q@)Prob(p = @). (7) 


qeP(M) 


For any q in P(M), we have f(q) = 2/(2') for some integer ¢t > 1; now we can 
rewrite the sum in (7) by summing not over the elements in P(M), but over the 
values attained by the function f—i.e. by grouping together m and n (say) 
precisely when f(m) = f(n). So we get 

2 2 2 
T(M) = L 57 Prob p in P(M) and f(p) = 5 (8) 
t= 
which is a finite series, as 2’ > M for large enough t. We want now to find what 
proportion of the odd primes q in P(M) satisfy f(q) = 2/(2‘) for any given t. 
Using the Definition (Sa) of the function f, we see that for any integer m (except 
1), and for any integer ¢ > 0, 


f(m) = 2/(2') precisely when m = (1 + 2‘) mod(2‘**), (9) 


since we must be able to write m — 1 = A.2‘' when this is so, for some odd A. In 
general there is no way of calculating the proportion of primes in P(M), which 
have this property except by counting them, so we need something a bit more 
sophisticated if we are to make any progress—that is if we hope to be able to write 
down an expression for such a quantity. We can however find an approximation 
which is good for sufficiently large M, which is all that we need anyway. This way 
forward is used in the next section. 


PART (C). We need the following well-known result, which is Dirichlet’s theorem 
on primes in arithmetic progression. 


Theorem 2. Let a and b be coprime, with b > 0. Then there are infinitely many 


primes congruent to a mod b (i.e. the arithmetic progression a,a + b,a + 2b,... 
contains infinitely many primes). 
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In fact, a stronger version of Theorem 2 will be useful, to state which we use the 
following definition. Euler’s phi-function is defined as follows: if n > 0, phi(n) is 
defined to be the number of congruence classes coprime to n—so that phi(10) is 4, 
since {1, 3, 7, 9} are coprime to 10, but {0, 2, 4,5, 6, 8} are not. We will use the facts 
that 

(i) phi(2‘*!) = 2'; and 

(ii) in Theorem 2, given b, there are phi(b) choices for a giving different 
progressions. 

The result we need strengthens Theorem 2 by stating that the primes which are 
coprime to b are distributed roughly equally among the progressions. 


Theorem 3. With the same a and b, let R be positive. Then 
#{p: p prime, p < R, p = amod b} 1 
PP —> 


#{p: p prime, p < R} phi(b) 
as R goes to infinity. (See [3] for a proof ). 


We can think of the quantity on the left in Theorem 3 as the probability that a 
randomly chosen prime which does not exceed R is congruent to a mod Db. In fact, 
we can suppose that we exclude from the denominator any primes which divide b 
—we will want to do this in practice—and the limiting value on the right in 
Theorem 3 is unaffected, since b has only finitely many prime divisors. Such 
primes are already excluded from the numerator, as a and b are coprime. We will 
write 


#{p: p prime, p < R, p = amod Db} 


G(a,b, R) = #{ p: p prime, p < R, p coprime to b} (10) 
Putting b = 2'*},a =1+2', R = M, we eet, using (9) 
GU + 2',2'*), M) = #{p:p in P(M), P = 1+ 2' mod2‘*'} 
#{ p: pin P(M)} 
as only the prime 2 needs to be excluded, and this is just the same as 
Prob( p in P(M) and f( p) = 2/(2')) 
the quantity which appears in (8). So we can write (8) as 
oc 2 
T(M)= »} aae + 2',2'*! M), (11) 


t=1 
recalling that T(.M) = Prob(E given p in P(M)). What we would like to do is to 
evaluate Lim,,_,,, 7(M), and call this Prob(E). There is a snag however, because 
we have to interchange the two limiting processes of “MM — ©” and the infinite 
summation in (11) to perform the calculation, as we cannot find T(M) directly. To 
express things more succinctly, we can write (11) itself as a limit, ie. write 
T(M) = Lim, _,,, ACM, N), where we define 


N 2 
A(M,N)= wage + 2',2't), M) (12) 
t=1 
A(M, N) is an example of a double sequence—that is, it is indexed by two sets 
like {1,2,3,...}. Note that since the primes themselves form a discrete set in the 


reals (a set consisting of isolated points), we can suppose that M is an integer, 
without affecting the limiting behaviour as M > o. 
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Thus we want to define Prob(E) = Lim, _,,.(Lim,_,,, ACM, N)), but cannot 
evaluate this directly. We can, however, easily find Lim, _,,,(Lim,,_,,, ACM, N)), 
the iterated limit in the other order since we have 


Lim A(M, N > ; “(1 =| 
mA, )= Dal aaary) = 3 a" 
and so 

Lim ( Lim A(M, N)) = 2/3. (13) 


Unfortunately, we cannot interchange the order of iteration without justification, 
since, for example 


Lim [Lim = () 
M>0\No>o M+N 


but 


Li Li = 
Noe | Mo M+N | 
We will see that in our case, the interchange can be justified. Firstly, we define, 
for a double sequence B(M, N), the double limit (if it exists) to be 
Lim B(M,N), where K = minimum( M,N). (14) 
K-00 


Note that a double limit may exist even if neither of the iterated limits does: to 
see this, consider the example 


B(M,N) = 


In our case, we will have, for the sequence A(M, N) defined above, that the 
double limit and the two iterated limits all exist and agree, with the common value 3, 
so proving Shanks’s claim. We will need the following (see [4]). 


Theorem 4. Let B(M, N) be any double sequence of complex numbers. Suppose that 
(i) Lim, 2. BCM, N) exists for all N; 
(ii) Lim, _,. BCM, N) exists for all M; and 
(iii) the double limit exists. 
Then the two iterated limits exist and agree. 


From this, it follows quite readily that the iterated limits also agree with the 
double limit. In our case, this means that Prob(E) exists and is defined unambigu- 
ously, with the value = (since this is the value found in (13)). Hence, Shanks’s claim 
will follow once we have shown that ACM, N) satisfies the hypotheses of Theorem 
4. Checking this: ° 

(i) already shown (in deriving (13)); 

(ii) for each fixed M, A(M, N) is just a finite sum as G(1 + 2/,2’*!, M) = 0 for 
all large enough t (see (8)); 

(iii) we will actually show directly that the double limit is 4. This is done as 
follows: 


It is enough to show that by choosing both M and N to be sufficiently large, we 
can make A(M, N) — % arbitrarily small in absolute value. Fix an « > 0, and 
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choose N, so that 
N2>N, = 2% => 6/e (15) 


where this comes from the two conditions which we will (with the benefit of 
hindsight!) require, namely: 1 — 1/4" > 1 — je, whenever N > Nj, and 


°° 2 
> (= < sé. 
t=Ny+1 


We know that, for each t, Lim, _,,,G( + 2',2'*', M) =27' so for each ¢ in 
{1,2,3,..., No}, there exists an M, such that 


l—-—e bombed lt+e 
M>M,=> y < G(1 + 2',2 »M) < —-- (16) 
Then write M, = max(M), M,,..., My,), so that, whenever M > Mo, substituting 
(16) into (12), we get (after summing the geometric series which results), that 
1 1 
(1 «)5(1- | < A(M, No) < (1 +e)z 5(1 - 7} (17) 
By construction of N,, we have — 
1 1 
1 — 5° <1- 4No <1, 
and substituting this into (17) gives (since $<? > 0) 
—e+%<A(M,N,) <= + $e. (18) 
Now suppose that N > Nj, and write A(M, N) = ACM, N,) + Z, where 
N 2 
Z= yb ae + 2',2't) M). (19) 
t=Not1 


Since the G’s are probabilities, and by our choice of No, 
N 2 1 


Finally, then if M > M,, and N = WN), and so certainly whenever min(M, N) = 
max(M,, Nj), 
-e+i<Z—-—e+$<A(M,N) <Z+ 5+ fese+ § 


and hence the double limit is $. This checks condition (iii) of Theorem 4 for the 
sequence A(M, N) and so establishes Shanks’s claim. 


SUMMARY. We have proved the following: given an odd prime p, chosen at 
random, and a random quadratic residue y (mod p), the square roots of y (mod p) 
are, in = of all cases, the values of x given by Equation (3), where C is the largest 
odd divisor of p — 1. Ct is also true that we can generate the square roots from 
these starting values in other cases, as Shanks shows.) 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


A son more famous than his father? 
(see page 458.) 
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Estimate (6) shows that b, < 27*/3. A simple calculation gives 


4N [N+] ] 4(N — 1) 

v= NGI La IN+1~— 
1 1 1 1 1 1 
pee ptget tayo 


Applying the squeeze law, we obtain that b,, tends to 2777/3 as N — ©. Using (3) 
and (5), we obtain that for this choice of (a,), the ratio of the left-hand side of (1) 
and (,,a2)!/? converges to 7 as N — ~. This proves that 7 is the best possible 
constant in (1), Q.E.D. 


Inequality (1) is known to be strict if (a,,) is nonzero. To see this for compactly 
supported nonzero sequences, observe that (6) is a strict bound for (5) since for 
some m and n, 2a,a,, < a2 + a?,. For general sequences, a further argument is 
needed since the passage to the limit will destroy the strict inequality. See [HLP] 
for details on this. 

To the best of my knowledge. the determination of the best constant for the /? 
inequality, 1 < p # 2 < © remains unresolved. Pichorides [P] solves this problem 
for the corresponding continuous operator. 

We end this note by asking a question: Is there a constant C such that for all 
square summable sequences (a,,) and all bounded sequences (A,), the following 
inequality holds? 


1/2 
a, 3 1/2 
E/E Awe < Coup I( Xe lal | (7) 
jeZ neZ jn j neEZ 
n#j 


If A,, = 1 for all n, then (7) reduces to (1) and one can take C = wr. 
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Answer to Picture Puzzle 
(p. 449) 


Henri Cartan, the son of Eli Cartan. 
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NOTES 


Edited by: John Duncan 


Kummer’s Test Gives Characterizations 
for Convergence or Divergence of all 
Positive Series 


Jingcheng Tong 


One of the basic facts about a positive series is that the series converges if and only 
if its partial sums are bounded. Equivalently, a positive series diverges if and only 
if its partial sums are unbounded. These observations, however, do not provide a 
practical way of determining convergence or divergence of positive series. Thus, it 
is surprising that Kummer’s test, which is a source of many useful convergence and 
divergence tests, is equivalent to the boundedness of the partial sums, and it is also 
surprising that this fact is not well publicized and possibly unknown until now. 
Kummer’s test is the sufficient part of our main result, which will be stated shortly. 

There is a short introduction on the history of Kummer’s test in [9]. The test was 
given by the German mathematician Ernst E. Kummer as early as 1835 (Journ. 
reine angew. Math. Vol 13, p. 172), though with a restrictive condition which was 
first recognized by U. Dini in 1867. Later it was rediscovered several times and 
gave rise as late as 1888, to a violent contention on questions of priority. 

Kummer’s test gives very powerful sufficient conditions for convergence or 
divergence of a positive series. As mentioned above it is the source of many other 
tests ((6], [10]). For instance, D’Alembert’s test, Raabe’s test, Bertrand’s test, and 
Gauss’ test are all special cases of Kummer’s test obtained by choosing specific 
“parameters” p,’s. 


Theorem. Let La, be a positive series. 

(1) La, is convergent if and only if there is a positive series Lp, and a real 
number c > 0, such that p,(a,/a,41) ~ Peay 2C- 

(2) La, is divergent if and only if there is a positive series Lp,, such that L1/p, 
diverges and p,(a,/a,41) — Pex, < 9. 


Proof: Although proofs of the sufficiencies of (1), (2) can be found in many books 
on advanced calculus, we feel their simplicity warrants their inclusion for com- 
pleteness. 


Sufficiency. 
(1) Suppose that Yp, is a positive series and there is a real number c > 0, such 
that p,(a,/a,,,) — Py4, = c. This implies that 


N N 
a, + » a, <a,+(1/c) > (Peay ~ Deri 4n+1) 
k=2 k=1 


N N+1 
=a,+(1/e)| LE pyae — Le Dey 
k=1 k=2 
Pn+19N+1 
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(2) Suppose that Lp, is a positive series for which Li1/p, diverges and 
D,Aa,/ax +1) ~ Prat S 0. This implies that 


P\a@, P24 Pp, 
a,= ,a,2 > yeeey 


P2 D3 D3 


and consequently, 


4a, = 4, + (p\a,) Le (1/9, )- 
k=1 k=2 


Necessity. 
(1) Suppose that La, is a convergent positive series. We are going to construct 
a positive series Lp, for which p,(a,/a,4)) — Pez, = 1. 


Since La, is convergent, it converges to a real number M, M = );_,a,. Let 
p, =(M — L*_,a;)/a,. It is easily seen that p, > 0 for any positive integer k and 


i=1 f=] 


k k+1 
Dy (4, /Axn41) — Pest = [a Ya, Anat [a ~ > a, Ast 


= Ay 41 /Ax4) = 1. 
(2) Suppose that La, is a divergent positive series. Let p, = Lj_,4;/a,. It is 
easily seen that p, > 0 and 
k k+1 
Py( 44 / On 41) ~ Pest = Le Gi/ Aga, — Le 4 / Aga, = ~ A 41/Aea = ~1<0. 
i=! i=1 
Now we prove that /1/p, diverges. We are going to show that for any given 
positive integer m, there is an n > m such that L7_,,1/p, > 1/2. 
Since La, diverges and a, > 0, for any given positive integer m, we can always 
find an n > m such that a, + ++: +a, >a,+ °:: +4a,,_,. Hence 


L/D = Ay /(a, ++ Ay) ++ $a, /(ay + $4) 


i 


> (a,, t+ -°° +4a,)/(a, + °°: +4a,) 


=1/(1 + (a, + +++ +@,,-1)/(4m + °°* +4,)) 
>1/(1 + 1) = 1/2. 
Therefore /}1/p, diverges by Cauchy’s criterion. 
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Isometries of Z , norm 


Chi-Kwong Li and Wasin So 


The goal of this note is to provide a short proof of the fact that the isometries of 
¢,-norm (p # 2) on R” are generalized permutation matrices, those matrices that 
can be written as a product of a diagonal orthogonal matrix and a permutation 
matrix. As usual, the ¢-norm on R” is defined by ¢(x) = (L7_, Ix, \P)'/? if 
1<p<, and ¢(x) = max,.;<,|x,| if p =». An isometry of ¢,-norm is an 
n Xn matrix A satisfying 


¢,(Ax) = 2 (x) forall x eR’. 


It is well-known that the isometries of the ¢,-norm are orthogonal matrices. A less 
well-known fact is the following theorem. 


Theorem. Suppose 1 <p <® and p#2. An n Xn matrix A is an isometry of 
¢,-norm if and only if A is a generalized permutation matrix. 


This result can be deduced from stronger statements about more general norms 
[2] [3] [7] or can be viewed as a special case of its infinite dimensional version [1, 
p. 119] [5, p. 112]. However, there is a direct proof which requires only a basic fact 
from the theory of norm on R” [4, Ch. 5]: an n Xn matrix A is an isometry of 
¢,-norm if and only if its transpose A’ is an isometry of ¢,-norm, where 1/p + 
1/q = 1. 


Proof: The sufficiency is easy to check. For necessity, suppose A = (a,,) is an 
isometry of the ¢,-norm. We may assume 1 < p < 2, otherwise consider the 
¢,-norm and A’. First consider the case that p # 1. For i = 1,..., 7, let e, “enote 
the ith column of the n X n identity matrix. Then ¢,( Ae;) = 1 for all i = 1. 

Thus |a,;| < 1 and L? ,_,|a;;\? =n. Since A’ is an isometry of the ¢,-norm, the 
same argument gives Li? la, jl =n. Notice that la; jl? < la, jl’ and equality 
holds if and only if |a;,| = 0 or 1. Since every column of A (respectively, A’) has 
¢,-norm (respectively, ¢,-norm) equal to one, the equality L? ,_,la, jl? =n = 
Li j-114 ; l* implies that each row and each column of A has exactly one nonzero 
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entry whose magnitude equals one, and the result follows. For the case p = 1, the 


above argument also yields L?,_,|a,,| =n = L7_, @,;, where a; = max, ejpenl@y/l 
Thus each row and hence each column "of A has only one nonzero entry with 
magnitude equal to one, and the result follows. a 


After this note was finished, the authors were informed that R. Mathias [6] had 
obtained the same proof for the complex case previously and independently. 


ACKNOWLEDGMENT. The authors wish to thank Professor R. Horn for bringing the references [6] 
[7] to their attention. 
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A Trace Inequality for Unitary Matrices 


Bo-Ying Wang and Fuzhen Zhang 


The aim of this note is to present a trace inequality for complex unitary matrices, 
relating the average of the eigenvalues of each of two unitary matrices to that of 
their product. 

trX, as usual, denotes the trace of the n-square matrix X, which is equal to the 
sum of the eigenvalues of X, and m(CX) designates the algebraic mean of the 
eigenvalues of X. We show that for any unitary matrices U and V 


Vi -|m(uv)? < V1 —|m(u)? + V1 -|m(v)? (1) 


with equality if and only if U or V is a unitary scalar matrix. 

(1) follows from an inequality in an inner product space V with inner product 
(-,- ), which is of interest itself. 

For any unit vectors u, v and w in V, we claim that 


V1-|(u,v)? < V1-|(uw)? + V1 -|(v,)/? (2) 


with equality if and only if w is a multiple of u or v. 
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To prove this, we first notice that any component of w that is orthogonal to the 
span of u and v plays no role in (2), i.e., we really have a problem in which u and 
v are arbitrary unit vectors, w is in the span of u and v, and (w,w) < 1. The case 
w = 0 is trival. If w # 0, scaling up w to have length 1 diminishes the right-hand 
side of (2), so we are done if we can prove inequality (2) for arbitrary unit vectors 
u, v and w with w in the span of u and v. The case in which uw and v are 
dependent is trivial. Suppose u and v are independent, and let {u, z} be an 
orthonormal basis of Span {u,v}, so that v = wu + Az and w = au + Bz for some 
complex numbers p, A, a and £B, then 

IAI? + Iul?=1 and lal? + Bl? = ], 


Now use these relations and the arithmetic-geometric mean inequality, together 
with the fact |c| > Re(c) for any complex number c, to compute 


JAB] = JAB (lel? + IAL? + lal? + |pl*)/2 
> |ABI(IAB| + lanl) 
= |AB|* + |ABap| 
= |AB|? + |ABaz| 
> |AB|° + Re(ABap), 
so that —2|AB| < —2|AB|? — 2Re(ABaz). Thus we have 
(IA| — IBl)* = IAP? — 2IAg1 + 181? 
< |Al’ + |p’ — 2lAgl* — 2Re(ABazz) 
= |Al? + Ial?(1 — Al) — ABI? — 2Re(ABajz) 
= (1 - wl?) + IePlwl? — lagi? — 2Re(ABaz) 
=1— |p|*(1 — |gl*) — laBl’ — 2Re(ABaz) 
= 1 - |wal* — |AB|* — 2Re(ABaz) 
=1- lap + BAl’. 
This gives 
(IAF - el) < V1 — law +Bal?, or -lAl < Il + V1 — law + BAl’, 
which is the same as 


Since |ul* = |(u,v)|’, lal? = (u,w)l?, and lag + BAl* = (au + Bz, wu + 
Az)|* = |(w, v)|?, the inequality (2) is proved. 
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Equality holds for the overall inequality if and only if equality holds at the two 
points in our derivation where we invoked the arithmetic-geometric mean inequal- 
ity and |c| > Re(c). Thus, equality holds if and only if |A| = |B| and |a| = |u|, as 
well as Re(ABap) = |ABap|. The former is equivalent to having A = e’°B and 
uw = e'?a for some real numbers @ and @, while the latter is then equivalent to 
Re{laBl*(e@-® — 1} = 0. Thus, a = 0, B = 0, or e” = e’*, so equality in (2) 
holds if and only if either w is a multiple of u (8 = 0) or w is a multiple of v 
(a = 0 or e” = e’®), 

It is readily seen that for unit vectors u, v and w in V, 


V1 —\(w,v)P - V1 lus)? | < V1 100.) P. 


(2) can be rewritten as 


Vs — llwiP|(u,v)? < Vs —lolP?\(uw)? + Vs — lullP?\(v,)), 


for any u, v and w in V, where s = (|u| [lull llwl)?. 

Now we return to the inequality (1) by considering the vector space of all n-by-n 
complex matrices with the inner product (A, B) = tr B*A for every pair of 
matrices A and B, where «means the conjugate transpose. 

Putting 


1 
w = —T, u=—=V and v = —U* 
vn vn vn 


in (2) where I is the n-by-n identity matrix, U and V are any n-by-n unitary 
matrices, we have 


1 7 1 7 1 7 
1-—|-trUV| < 1 -|“tr + 1 - [tev ; 
n n n 


equality occurs if and only if U or V is a unitary scalar matrix, that is 


V1 -|m(UV)? < ¥1—|m(U)? + V1 -|m(V)? 


with equality if and only if U or V is a unitary scalar matrix. 

We end this note by recalling the well-known fact that if A and B are close to 
the identity J in norm, then so is the product AB. (2) gives some information about 
the spectrum of the product; if the spectra of two unitary matrices are close to 1 
in the sense that the mean of the eigenvalues is near 1, then this “‘closeness”’ is 
essentially preserved under products. 


ACKNOWLEDGMENT. We thank the referee for helpful suggestions. 
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An Elementary Proof of the Square 
Summability of the Discrete 
Hilbert Transform 


Loukas Grafakos 


We would like to give an elementary proof of Hilbert’s inequality 


1/2 
5\ 17 


r— 
neZ Jan 
n#] 
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JEZ 


< | yr a?) (1) 


nEZ 


where the a,’s are real and square summable, and also prove that z cannot be 
replaced by any smaller number. 

Hilbert first proved a weaker version of inequality (1), where 7 was replaced by 
a larger constant. The original proof used trigonometric series and first appeared 
in Weyl’s [W] doctoral dissertation in 1908. Three years later, Schur [S] obtained a 
proof of (1), showing that 7 was the best possible constant. In his proof, he used a 
version of what we today refer to as Schur’s Lemma. This proof can be found in 
the book [HLP]. Many other proofs and generalizations have been given since 
then. 

The purpose of this note is to give an elementary proof of inequality (1). The 
proof uses convergence of sequences; remarkably, only one inequality 2ab < a* + 
b?; and the identity 


1 ar? 


» 27 yA 


zi 3 
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Before we present the proof, we would like to clarify a point about (1). If (a,) is 
square summable, it isn’t automatic that the left hand-side of (1) converges. Part of 
the inequality is the statement that the left hand-side of (1) is finite whenever the 
right hand-side is. 

Assume first that (a,) is compactly supported, i.e. a, = 0 except for finitely 
many n. We show below that the left hand-side of (1) is finite and we prove the 
required inequality for such sequences. Expand the square of the left hand-side of 
(1). All indices m,n, j below run from —© to unless there is some restriction 
stated. We obtain “ 


£Y Ya,a,—— 


J n#y> mF (j a n)(j a m) 


1 
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j#n,m (j _ n)(j ~ m) ) 


456 NOTES [May 


Two out of the three sums above are over finite sets of indices and the interchange 
of summations is justified. The sum over all m = n in (2) is clearly equal to 


Lan Ly ———3 Dal (3) 


n lo n)” 


Assume below that m # n. We calculate the sum over j in (2). We have 


y 1 1 | 1 1 | 
j#mn (j a n)(j — m) MTN samin J-m j-n 
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since the expression inside brackets above has limit 0 as k — ». Because of (4), the 
off-diagonal terms in (2) are exactly equal to: 


2 
OE 4,4 re i (5) 
n m#n “(m —n) 
Using the inequality 2a,,a, <a? + a2, we bound (5) by: 

a? Qa ; 
eee ee EO Oa 
n m#n (m —n) m n#=m a: n 

(6) 


Combining (3) and the estimate (6) for (5), we obtain inequality (1) for compactly 
supported sequences. A simple limiting argument gives (1) for general square 
summable sequences. 

We now turn to the fact that 7 is the best possible constant. We define b,, to be 
(5) divided by X,,a?, where (a,,) is the sequence 1 for |n| < N and 0 otherwise. 
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Estimate (6) shows that b, < 27*/3. A simple calculation gives 


4N [N+1 ] 4(.N — 1) 
N= ING La “IN+1~ 
1 1 1 1 1 1 
peer pt et tio 
N-1 


Applying the squeeze law, we obtain that b,, tends to 27*/3 as N — ©. Using (3) 
and (5), we obtain that for this choice of (a,,), the ratio of the left-hand side of (1) 
and (L,,a2)'/* converges to 7 as N — », This proves that a is the best possible 
constant in (1), O.E.D. 


Inequality (1) is known to be strict if (a,,) is nonzero. To see this for compactly 
supported nonzero sequences, observe that (6) is a strict bound for (5) since for 
some m and n, 2a,a,, < a* + a?,. For general sequences, a further argument is 
needed since the passage to the limit will destroy the strict inequality. See [HLP] 
for details on this. 

To the best of my knowledge, the determination of the best constant for the /? 
inequality, 1 < p # 2 < © remains unresolved. Pichorides [P] solves this problem 
for the corresponding continuous operator. 

We end this note by asking a question: Is there a constant C such that for all 
square summable sequences (a,,) and all bounded sequences (A, ), the following 
inequality holds? 


9\ 1/2 
a, )\1/? 
r) EY Aue < Csuplal( > 2," | (7) 
jeEZ neZ jan J nEZ 
n#j 


If A,, = 1 for all n, then (7) reduces to (1) and one can take C > 7. 


REFERENCES 


[HLP] G. H. Hardy, J. E. Littlewood and G. Pélya, Inequalities, Cambridge University Press (1934). 

[P] S. Pichorides, On the Best Values of the Constants in the Theorems of M. Riesz, Zygmund and 
Kolmogorov, Studia Mathematica 46 (1972), 164-179. 

[S] I. Schur, Bemerkungen zur Theorie der beschrankten Bilinearformen mit unendlich vielen 
Verdnderlichen, Journal f. Math. 140 (1911), 1-28. 

[W] H. Weyl, Singuldre Integralgleichungen mit besonderer Beriicksichtigung des Fourierschen Inte- 
graltheorems, Doctoral Dissertation, University of Géttingen (1908). 


Department of Mathematics 

Washington University, Campus Box 1146 

One Brookings Drive, St Louis, MO 63130-4899 
grafakos@math.wustl.edu 


Answer to Picture Puzzle 
(p. 449) 


Henri Cartan, the son of Eli Cartan. 
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THE EVOLUTION OF... 
Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


How Hyperbolic Geometry Became 
Respectable 


Abe Shenitzer 


Many accounts of the evolution of hyperbolic geometry mention the names of 
Beltrami and Klein but give few details of their contributions in this area. The 
present paper focuses on some of these very details. The reader is assumed to have 
some familiarity with basic concepts of differential geometry. 


HISTORICAL INTRODUCTION. The discovery of hyperbolic geometry* led to 
the realization—at the end of the 19th century—that the truths of mathematics 
are relative rather than absolute, and to the resolution of the millennial doubts 
about Euclid’s parallel postulate. J. Coolidge describes its effect as follows: 


The point which I wish to insist on... is that it is to the doubts about Euclid’s parallel postulate, 
and efforts of such thinkers as Saccheri, Lobachevski, Bolyai, Beltrami, Riemann and Pasch to 
settle these doubts, that we owe the whole modern abstract conception of mathematical science. 
({1], p. 87.) 


Another way of assigning to the discovery of hyperbolic geometry its due place is to 
note that it was one of the two greatest geometric discoveries (or inventions, if you 
are not a Platonist) of the 19th century, the other being the discovery (by 
Riemann) of the concept of an n-dimensional manifold. Of course, while the first 
of these discoveries marked a profound intellectual discontinuity, the second did 
not; in fact, one can safely say that an n-dimensional manifold and its geometry 
are direct “descendants” of the notions of a surface and of Gauss’ notion of the 
intrinsic geometry of a surface. 

The story of the discovery of hyperbolic geometry can be conveniently divided 
into three parts. The first was largely negative in the sense that it consisted of 
doomed attempts to deduce the Euclidean parallel postulate from the other 
postulates of Euclidean geometry. The first of these attempts was presumably due 


*The terms “hyperbolic geometry” and “hyperbolic plane” refer to the system of plane geometry 
based on Euclid’s axioms with his parallel axiom replaced by the “hyperbolic parallel axiom” which 
asserts the existence of a line a and a point A not on a such that there are at least two lines passing 
through A that are parallel to a. An isometric replica of the hyperbolic plane is called a model of that 
plane. 
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to Archimedes. Other such attempts were due to Arab mathematicians, and, more 
recently, to Saccheri (1733) and Legendre (1794). 

The second part of our story involved attempts in the first third of the 19th 
century to deduce basic logical consequences of the axioms of Euclidean geometry 
with its parallel axiom replaced by the hyperbolic parallel axiom. The mathemati- 
cians involved were Schweikart, Lambert, and, more importantly, Gauss, Bolyai, 
and Lobachevski. Gauss and Lobachevski considered the possibility that the 
geometry of the universe is hyperbolic rather than Euclidean. Bolyai’s chief 
concern was about the consistency of the new geometry. This is where matters 
stood in the 1830s. During the next 30 years the effect on mathematics of these 
investigations and insights was virtually nil. 

The third stage of the evolution of hyperbolic geometry began in the 1860s and 
ended in the 1880s. Here the key contributions—the first models of the hyperbolic 
plane—were due to Beltrami and, in part, to Klein. Their work provided a proof 
of the relative consistency of hyperbolic geometry and put it logically on a par with 
Euclidean geometry. 


TECHNICAL ACCOUNT. What follows is a description, in four parts, of the 
genesis and nature of the early models of the hyperbolic plane due to Beltrami 
and, in part, to Klein. The approach in the first part is based on an essay by A. M. 
Lopshits in volume II of [4]. 

(a) Beltrami studied mappings of surfaces in E* into the Euclidean plane that 
preserve geodesics, that is, map geodesics to (straight) lines in the plane. 

A familiar example of such a mapping is the central projection of a sphere to 
the plane. We give a few highly relevant details of this mapping. 

Consider (FiGurE 1) the projection of the lower hemisphere onto the plane. If 
mC(X, Y) is the plane image of M on the sphere, then we assign (X,Y) to M as its 
curvilinear coordinates. If the radius of the sphere is a, then its (Gaussian) 
curvature is 1/a? and its length element ds’ turns out to be 


(1 + KY?) dX* + (1 + KX’) dY? — 2KXYdXdY 
[1 + K(X?4+¥?)]’ 


with K the (Gaussian) curvature. 


ds? ’ (1) 


Figure 1 


In 1866 Beltrami showed, quite generally, that if a surface in E* can be mapped 
in a one-one geodesic manner into the plane, then, after transfer of the coordi- 
nates CX, Y) of the image points to the points of the surface, its length element ds” 
can be reduced to the form (1) with K its constant curvature. Furthermore, the 
only surfaces that admit such mappings are surfaces with constant curvature. Since 
surfaces with the same ds” are locally isometric, it follows that a surface with 


1994] HOW HYPERBOLIC GEOMETRY BECAME RESPECTABLE 465 


constant curvature K > 0 is locally isometric to a sphere, and a surface with 
constant curvature K = 0 is locally isometric to a plane. The case of greatest 
interest to Beltrami was that of pseudospherical surfaces, that is, surfaces in E° 
with constant curvature K < 0. Beltrami studied such surfaces in “Saggio...”, the 
first of the two papers he published in 1868. What follows is a brief summary of his 
findings. 

In (1) put K = —1/k’, X = kx, and Y = ky. Then the length element of a 
pseudospherical surface takes the form 


(1 — y*) de* + (1 — x*) dy? + 2xy dx dy 
(1 -—x?- y2)° 

where k is a positive constant. Since the necessarily positive discriminant of the 

form (2) is equal to 


ds = k? 5) (2) 


k*/(1 —x* — y2)°, (3) 
it follows that 
x? +y7< 1. 
This means that a geodesic mapping of a pseudospherical surface into a Euclidean 
plane carries each point of the surface to a point of the unit disk (see FiGuRE 2). 
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Figure 2 


Now there is a 1-1 geodesic mapping of the hyperbolic plane onto the unit disk 
(see [2]. Also see Ficure 3; the “dish” represents a unit disk that is part of a 
horosphere tangent to a hyperbolic plane. The curved lines represent hyperbolic 
straight lines belonging to the mapping pencil of parallel lines). If (as in the case of 


Figure 3 
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the stereographic projection of the hemisphere) we transfer the coordinates (x, y) 
of a point in the unit disk to its preimage in the hyperbolic plane, then the length 
element ds? of the latter takes the form (2) The sameness of the length elements of 
the pseudospherical surface and of the hyperbolic plane proves that these two surfaces 
are locally isometric. 

Beltrami’s realization of the geometry of (part of) the hyperbolic plane as the 
intrinsic geometry of a pseudospherical surface had a tremendous impact on his 
contemporaries. It was seen as changing what was thought to be a figment of the 
imagination into a mathematical fact. 

If a pseudospherical surface is to be an isometric replica of the full hyperbolic 
plane, then it must have the basic property that the geodesics through any of its 
points are infinite in both directions. This turns out to be equivalent to the 
assumption that there exists a pseudospherical surface that admits a one-one 
geodesic mapping onto the unit disk. Beltrami thought that this is a defensible 
assumption. It is not, at least for smooth surfaces in E°. In 1902 Hilbert showed 
that there is no such surface in E? that is an isometric replica of the hyperbolic 
plane. 

(B) While the main émphasis in Beltrami’s “Saggio...” is on showing that a 
pseudospherical surface is locally isometric to the hyperbolic plane, it also points 
to the possibility of turning the unit disk into a model of the hyperbolic plane by 
exploiting various features of the geodesic mapping linking it to a pseudospherical 
surface. Hence the term “the Beltrami disk model.” The relevant definitions are: a 
hyperbolic (straight) line is a chord of the unit disk, hyperbolic parallel lines are 
chords that meet at a point of the unit circle, and hyperbolic diverging lines are 
chords that don’t meet in the closed unit disk. The source of quantitative relations 
is the length element ds* given by equation (2). (Thus the length of a curve 
x =x(t), y = y(t) is fds, with suitable limits in the integral.) The key missing 
element in Beltrami’s disk model is the definition of a motion. 

(y) In 1868 Dedekind published Riemann’s inaugural lecture of 1854. After 
reading it, Beltrami published the second of his 1868 papers in which he once 
more obtained his earlier disk model as well as three conformal models of the 
hyperbolic plane. The following description of these models is taken from the 
introduction in J. Stillwell’s translation of the second of Beltrami’s 1868 papers. 

The starting point is a hemisphere model consisting of the open hemisphere 


x? +x54+x7=a", x>0 (4) 
in 3-dimensional (x ,, x, x)-space, with the line element 
y dx? + dx3 + dx? 
ds = Ro: (5) 


The geodesics for this metric are vertical sections of the hemisphere. Perpendicu- 
lar projection of this model onto the plane x = 0 which we take to be horizontal 
(FicurE 4) yields Beltrami’s earlier disk model. 
The third model is obtained by stereographic projection of the hemisphere onto 
its horizontal tangent plane (Ficure 5), and the line element is computed to be 
dé? + dé? 
ds = a (6) 
2 2 
1 — AR? (Ej + &5) 


a metric which was stated by Riemann (1854) to be of constant curvature. We 
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Figure 4 


Figure 5 


Figure 6 
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observe that (6) is a multiple of the Euclidean line element ¥dé{ + déj in the 
(€,,€,)-plane; the multiple of course varies with position but not with direction, 
hence angles are preserved. Thus we have the conformal, or “Poincaré disk” 
model, 14 years before its appearance in Poincaré (1882). 

The fourth model is obtained by stereographic projection of the hemisphere 
onto (the upper half of) a vertical plane (FiGurE 6). This gives the line element 


ydni + dn 
ds = R-——_—_— (7) 
7) 


for the “Poincaré half plane.” 

(6) In 1871 Klein used the unit disk and its chords to create his so-called 
projective model of the hyperbolic plane. His points, lines, parallel lines, and 
diverging lines were the same as Beltrami’s but he based his definitions of 
distance, angle, and motion on the ideas of Cayley. He defined the distance 
between points M, and M, (see FicureE 7) as, essentially, the logarithm of the 
cross ratio {P,P,, M,M,}, 


2 


k k 
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defined the angle between two lines by, essentially, a projective dual of his 
definition of distance, and defined a motion of his model as a projective transfor- 
mation of the plane of the disk that mapped its boundary onto itself. Klein’s 
version of the unit-disk model of the hyperbolic plane was the first instance of a 
geometry in the sense of his famous Erlangen Program of 1872. 


P 


Figure 7 


In sum we can say that the works of Beltrami and Klein that appeared between 
1866 and 1871 put hyperbolic geometry on a par with Euclidean geometry. From 
then on it became ever clearer that the truths of mathematics are relative rather than 
absolute. 
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The Spirit is Willing but the Ham is Rotten 
John Kinloch and Rick Norwood 


What is wrong with the following “proof” of the Ham Sandwich 
Theorem? 


Theorem: Given any sandwich composed of bread, ham, and 
cheese, there is a single plane which cuts the sandwich into two parts, 
such that the two parts contain equal amounts of bread, equal 
amounts of ham, and equal amounts of cheese. 


Proof: Consider the bread. It has a center of mass. Call this point 
p. Let qg be the center of mass of the ham. Let r be the center of mass 
of the cheese. There is a plane containing p, g, and r. This plane 
divides the sandwich into two parts containing equal amounts of 
bread, equal amounts of ham, and equal amounts of cheese. 


The theorem is truc. The “proof’’ is fallacious. Why? 
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PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on 
the inside front cover. Please include solutions, relevant references, etc. Three copies 
are requested. 


Solutions of published problems should arrive before October 31, 1994 at the 
MONTHLY PROBLEMS address given on the inside front cover. Solutions should be 
typed with double spacing, including the problem number and the solver’s name and 
mailing address. Two copies suffice. A self-addressed postcard or label should be 
included if an acknowledgment is desired. 


An asterisk (* ) after the number of a problem, or part of a problem, indicates that 
no solution is currently available. Partial solutions will be useful in such cases. 
Otherwise, the published solution is likely to be based on a solution which is complete 
and correct. Of course, an elegant partial solution or a method leading to a more 
general result is always useful and welcome. In addition, references to other 
appearances of MONTHLY problems or to solutions of these problems in the 
literature are also solicited. 


PROBLEMS 


10382. Proposed by Richard K. Guy, University of Calgary, Calgary, Alberta, 
Canada. 


Which integers are represented by 
(x+yt z) 
xyz 


where x, y, and z are positive integers? 


10383. Proposed by Kevin Ford (student), University of Illinois, Urbana, IL. 


Let B,, B,,..., B, denote subsets of a finite set B, and let A; = #(B,)/#(B) 
and A =A,+ -:-:: +A,. Show that, for every integer ¢ satisfying 1 < ¢t < A, there 
exist r),7,,...,7, with r, <r, < +: <+r, and 


#(B,B,0-:-AB,)>(A-t+ y(s) #8), 
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10384. Proposed by Franklin Kemp, East Texas State University, Commerce, TX. 


Suppose x, <x, < ... <x, and y, <y,< ...<y,. Define the correlation 
coefficient r in the usual way: 


r= Lil x; — ¥)(¥; — ¥) 
VIA x; —~ x) “LUCY; - y)° 
where x and y are the average values of the x, and y,, respectively, and the sums 
run from 1 to n. Show that r > 1/(n — 1). 


10385. Proposed by Nandor Sieben, Arizona State University, Tempe, AZ. 


Let X be a topological space. It is easy to see that if X is a Hausdorff space, 
then fixed-point sets are closed. That is, for any continuous function f: X — X, the 
set F = {x © X: f(x) = x} is closed. Is the converse true? More precisely, if X has 
the property that all fixed-point sets are closed, must X be a Hausdorff space? 


10386. Proposed by Jordan Tabov, Bulgarian Academy of Sciences, Sofia, Bulgaria. 


Let a tetrahedron with vertices A,, A,, A;, A, have altitudes that meet in a 
point H. For any point P, let P,, P,, P, and P, be the feet of the perpendiculars 
from P to the faces A,A,A,, A3;A,A,, A,A,A, and A,A,A3, respectively. 
Prove that there exist constants a,, a,, a, and a, such that one has 


a, PP, +a,PP, +a,PP, +a,PP, = PH 


for all points P. 


10387*. Proposed by Stanley Rabinowitz, Westford, MA, and Peter J. Costa, Univer- 
sity of St. Thomas, St. Paul, MN. 


Let T, = (t; ;) be the n by n matrix with ¢,, = tan + j — 1)x, ie. 


tan x tan 2x tan 3x a tan nx 
tan 2x tan 3x tan 4x ... tan(n+t)x 
T = | tan3x tan 4x tan 5x ...  tan(n + 2)x 
” . oy oy e 
tannx tan(n+1)x tan(m+2)x  ...  tan(2n —1)x 


Computer experiments suggest that 
n-1 r 
det(T,,) = (—1)'"/7! sec” nx [] (sin?(n — r).x sec rx sec(2n — r) x) 
r=1 ° 


x. | sin n’x if nodd, 
cosn*x if n even. 


Prove or disprove this conjecture. 


10388. Proposed by E. Sparre Andersen and Mogens Esrom Larsen, K@benhavns 
Universitet, K@benhavn, Denmark. 


474 PROBLEMS AND SOLUTIONS [May 


Find 
Cn k 
n\{| ——_--=tTD 
= 2p 


where n and p are positive integers. 


NOTES 


Notes (10386) Although the altitudes of any triangle are concurrent, only special 
tetrahedra will have altitudes that meet in a point, as hypothesized in this problem. 
A characterization of such tetrahedra might be a useful supplement to the stated 
problem. Also note that the equation to be proved is a vector equation. (10387) 
The determinant of 7, tests whether the sequence (tan nx) satisfies a linear 
recurrence. The conjectured value of det(T,) has been verified symbolically for 
n <5, and tested numerically for several values of x for 6 <n < 20. 


SOLUTIONS 


Some Integrals With a Common Value 


10241 [1992, 675]. Proposed by Roger W. Johnson, Carleton College, Northfield, 
MN. 


Let m and n be positive integers with m > n. Show that 


CF) (AP e-3 


Solution I by C. Georghiou, University of Patras, Patras, Greece. Let I(m,n) 
denote the given integral. Then an integration by parts gives 


(Se dx = f (sin x)" sin(mx) a{—) 


Xx Xx 


sin” x sin( mx) 


n 


nx 
1 .nsin”~* x cos xsin(mx) + msin” x cos( mx) 

2 paint» 00s xsin(me) + m sin” x cos(m) 
n 


x” 


from which we get for n > 1 
2nI(m,n) =(n+m)I(m+1,n—1)+(n-m)I(m—1,n-1). (1) 
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Note also that 7 (0,n) = 0 for n > 0 and J(m,0) = 7/2 for m = 1 (I(m,0) is 
essentially the standard Dirichlet integral.) 

The result follows by an easy induction using (1). Indeed, for n = 1 and m > 1, 
(1) gives [(m, 1) = 17/2 for m = 1. Suppose that for n = k(k = 1) and m =k we 
have I(m, k) = 7/2. Then it follows from (1) that for m >k +1, I(m,k + 1) = 
a /2, and the induction step is completed. 


Solution II by Donald A. Darling, Newport Beach, CA. Let S, =X, + 
X,+::: +X, be the sum of n independent random variables each uniformly 
distributed over the interval [—1,1]. Each xX, has the characteristic function 
E(e*) = sin t/t and since S, has a continuous distribution function the inversion 


formula yields 
o / Sint ” m 

_ _ —1té 
S| ; | sre dé dt 
1 .o /sint\”"/sin mt 

- SAT) (Fe) 
T J —« f f 

. 2 .©f sint\"/ sin mt 

- Sh) (A )e 
7 70 f f 


If m > n this probability is 1, yielding the required expression. It is not necessary 
that m be an integer. 


P(|S,| <m) 


Editorial comment. Readers employed a variety of other methods including 
complex contours and self-contained Fourier arguments along the lines of Solution 
II. The integral can also be found as item number 3.836.1 in I. S. Gradshteyn and 
I. M. Ryzhik, Tables of Integrals, Series and Products, prepared by A. Jeffrey, 
Academic Press, 1980. Many readers noted that m need not be an integer. 


Solved also by F. Alouges & R. Cerf (France), K. F. Andersen (Canada), J. Andraos (Canada), J. 
Anglesio (France), P. R. Arner & T. R. Falconieri, S.-J. Bang (Korea), R. J. Bass & S. Byrd, K. L. 
Bernstein, D. Borwein & P. Borwein (Canada), P. Bracken (Canada), J. L. Brown Jr., R. J. Chapman 
(U. K.), Y. Diao, R. DrnovSek (student, Slovenia), J. Elstrodt (Germany), M. Golomb, J. Hatcher, M. 
Hoffman, G. L. Isaacs, F.-A. Izadi (Iran), A. S. Izotov (Russia), I. Kastanas, M. S. Klamkin (Canada), 
O. P. Lossers (The Netherlands), T. L. McCoy, K. MclInturff, J. Milcetich, A. Pedersen (Denmark), F. 
W. Steutel (The Netherlands), D. Tan, N. S. Thornber, M. Vowe (Switzerland), J. Wimp, and the 
University of Wyoming Problem Circle. 


The Maximal Miquel Ratio 


10244[1992, 675]. Proposed by Ken Bromberg (student), Brown University and Stan 
Wagon, The Geometry Center, Minneapolis, MN and Macalester College, St. Paul, 
MN. 


A classical construction of Miquel starts with an n-vertex polygon and a point P 
in the plane (not a vertex of the n-gon), and forms another n-gon as follows: 

1. draw the perpendiculars from P to the (extended) sides of the polygon; 

2. connect the feet to obtain another n-gon. 

These steps are then repeated n times (provided that none of the polygons has 
P as a vertex). The resulting polygon, denoted M(P) is similar to the initial n-gon. 

(a) Given a triangle, construct the point P for which M(P) is largest. 

(b)* Given a quadrilateral, is there a Euclidean construction of the point P for 
which M(P) is largest? 
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Solution of (a) by the proposers. The solution uses the notation of Figure 10244. 

Start with A ABC and assume A >B2>C, where A denotes the triangle’s 
angle at A, etc. (Angles are measured so that they lie in the interval (0, 77).) Then 
the desired point is the excenter of the triangle opposite A, that is, the intersection 
of the angle bisector of A with the external angle bisectors at B and C. To prove 
this, we first show that the similarity constant between M(P) and the initial 
triangle is sin a, sina, sin a3, where a; (and B,) are the angles in Figure 10244. 
Consider the illustrated case, where P is inside A ABC. Let A,B,C,, A,B,C), 
A,B,C; = M(P) be the triangles produced by the Miquel construction. Then 
sin a, = PA,/PA. Further, because of the cyclic quadrilateral PA, BB,,2 PA,B, = 
Z PBB, = a, and so sina, = PA,/PA,. Similarly sina, = PA,/PA,. Therefore 
sin a, sin a, sin a, = PA,/PA, which is the desired similarity constant. A virtually 
identical argument works when P is outside the triangle. (Incidentally, this 
permutation of angles proves that M(P) is similar to A ABC, for it yields that the 
angles of A,B,C; are a, + B,,a, + B,, a3 + B3.) 


B 


Figure 10244 


Now, because sin a, sin a, sin a, = sin B, sin B, sinB, (easily proved; it also 
follows from the preceding argument by symmetry, since sin B, sin B, sin B, equals 
the same similarity constant), it suffices to maximize f(P) = 
sin a, sin B, sin a, sin B, sin a, sin B,. But because one of a; + B; is constant (the 
two cases depend on where P lies within the interior or exterior angle at the 
vertex), easy calculus shows that the maximum value of sin a;sin B,; occurs when 
a, = B; or a; = 7 — B;, that is, when P is on one of the internal or external 
bisectors of the angle at the vertex. So, at vertex B for example, the maximum 
value of sin a, sin B, is one of sin? B/2 or sin*(a — B)/2 (which equals cos* B /2); 
because B is acute, the maximum is the latter. Similarly, the largest that 
sin & sin B, can be is sin? C/2. : 

Now, the value of f(P) at the excenter opposite A is 

sin? A/2cos* B/2 cos’ C/2. 
If A is obtuse we are clearly done, because sin* 4/2 is then the maximum of 
sin a, sin B,, so f(P) is certainly the largest it could possibly be. If A is acute, 
however, then sin a, sin B, can be larger than sin* 4/2: if P is external to the 
angle at A then this value could be as great as cos* A /2. But if P is external to A 
then it can be external to at most one of B and C. It follows that for such 
P, f(P) < cos? A/2 cos” B/2 sin? C/2. But this last is less than the value of g at 
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the excenter opposite A, for that equals sin? A/2cos? B/2cos* C/2. Thus this 
excenter maximizes f(P), as claimed. This maximum value is not attained at any 
other point, since each of the three factors sin a, sin B; is maximized on a bisector 
of an angle or external angle of the triangle. 


Addendum. The maximum Miquel ratio, sin A/2cos B/2cos C/2, is equal to 
s(s — b)(s — c)/(abc), where s is the semiperimeter of the triangle and a, b, c are 
the side-lengths; this is a consequence of some well-known formulas (see Z. A. 
Melzak, Invitation to Geometry, Wiley, 1983). It follows that the maximum Miquel 
ratio is between 3/8 and 1, with the lower bound occurring when the initial 
triangle is equilateral. 


Editorial comment. As the solution shows, the product sin a, sin a, sin a3 Is 
equal to (r,r,r,)/(R,R,R;) where the R; are the distances from P to the vertices 
of A ABC and the r, are the distances to the sides. It is of interest that the 
maximum of (7,r,r3)/(R,R,R;) is known to be r/(4R), where r is the inradius 
and R the circumradius, in case P is constrained to the interior of 4 ABC (see D. 
S. Mitrinovi¢é, J. E. Peéari¢é and V. Volenec, Recent Advances in Geometric 
Inequalities, Kluwer, 1989). Thé proposers have shown that the maximum is r,/4R 
when there is no constraint on P, where r, is the exradius opposite the largest 
angle A. Erdés has shown that (3/2)R < max{r,,r,,7,} (see O. Bottema, et. al., 
Geometric Inequalities, Wolters-Noordhof, 1989), giving another way to see that the 
maximum Miquel ratio is at least 3/8. 


No other correct solutions were received. 
Surface Area of a Reactangular Parallelepiped 
10256 [1992, 872]. Proposed by Murray S. Klamkin, University of Alberta, Edmon- 


ton, Alberta, Canada. 


Let A,, A’(i = 1, 2,3, 4) be the vertices of a rectangular parallelepiped H, with 


l 


A’; diametrically opposite to A;. Let P be any interior point of H. Prove that 
S < 2( PA, PA) + PA, : PA’, + PA; + PA; + PA,: PA,) 


where S§ denotes the surface area of F&F. 


Solution by Robin J. Chapman, University of Exeter, U. K., Choose Cartesian 
coordinates with P at the origin, and suppose that the faces of # lie in the planes 
x= —a,x =A,y = —b,y = B,z = —-c and z =C. The inequality becomes 


Va*+b*4+c*VA2 + B74+C? + Va? +b*4+ C*VA* + B* +c? 
‘+ya*+ B*4+c?V474+674+C? +VA2 4+ 574+? Va? + B*4+C? 
>AB+Ab+aB+ab+CA+Ca+cA+ca+ BC+ Bce+bC + be. 
Now by the Cauchy-Schwarz inequality 


Va? + b*+c*VA* + B*4+C* >aB+bC+cA, 


and also 


Va*+b?+c?VA? + B*4+ C* > Ab+ Bc + Ca. 
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Hence 


Va? +b? +c? VA? + B? + C* > (aB+bC + cA + Ab + Be + Ca)/2 


and by adding the similar inequalities obtained from the other terms on the left 
hand side of the main inequality, we get the main inequality. 


Editorial comment. No other solver used coordinates based at P to simplify the 
fomulas. Also, note that it is not necessary to require P to be an interior point. 
The interpretation as surface area is possible whenever A + a, B+ b,andC +c 
are all positive, and this choice can be made for any P. One reader noted that the 
result is false if one reads the terms PA, - PA’, as inner products of vectors. In this 
interpretation, the sum of inner products is negative whenever P is an interior 
point. We apologize for not noticing that this confusion was possible. 


Solved also by D. Carlson, J. Fukuta (Japan), H. Kappus (Switzerland), I. Kastanas, R. A. Simon 
(Chile), and the proposer. One incorrect solution was received. 


Familiar Inequalities in Disguise 
10257 [1992, 872]. Proposed by José.A. Cavanati, Centro de Investigacion en Matem- 


aticas, A. C., Guanajuarto, Mexico. 


Let a,b be positive real numbers, a a nonnegative real number, and m a 
positive integer with a < m. Show that 


(a+b) a [an | 
Sma 1S | Tt. 
(a + b™)*/ m\ a" +b 


Solution by Eugene _A. Herman, Grinnell College, Grinnell, IA. If we let B = a/m, 
we can write the desired inequalities in the more revealing form 


(a+b)”\" (a +b)” 
(|S, -1 <0 9, - 1). 


These are immediate consequences of the two elementary inequalities: 


a” +b™ <(at b)” whenever m >1l,a,b>0; 
x? —1 < B(x — 1) wheneverx >1,0<£6 <1. 


Note also that m need not be an integer; we require only that m > 1. 


Solved by 47 readers and the proposer. 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E.. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 
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REVIEWS 


Edited by Darrell Haile 
Indiana University, Bloomington IN 47405 


The Lure of the Integers By Joe Roberts, Math. Assn. of America, 


Washington D.C., 1992, v + 310, $25 


Reviewed by Paul T. Bateman and Harold G. Diamond 


Mention an automobile model to a car enthusiast and you touch a memory—an 
uncle had that kind; another was an engineering marvel; a third was a commercial 
disaster. Discussion of food or wine will trigger a multitude of recollections in the 
gourmet. Say a number to the arithmetically inclined and you summon up thoughts 
on how it occurs in mathematics. For example, 1729 is the smallest number 
expressible as a sum of two cubes in two different ways, as was noted by Frenicle 
de Bessy and later by Ramanujan. Fascination with the properties of individual 
numbers may be as old as mathematics itself. Nicomachus gave the first four 
perfect numbers 6, 28, 496, 8128 around 100 A.D.; probably this list was known at 
least as far back as the time of Euclid [1]. 

More recently, this interest has led to the publication of several books on 
properties of individual numbers. One of the first, best known, and most accessible 
is Constance Reid’s From Zero to Infinity [3], which focuses mainly on properties 
of the integers 0 through 9. The Penguin Dictionary of Curious and Interesting 
Numbers by David Wells [5] considers a much larger collection of numbers than 
does Reid, but rather more briefly. Closest in scope and spirit to Roberts’ work is 
Les Nombres Remarquables by Francois le Lionnais [2]. Unlike Roberts, both Wells 
and le Lionnais discuss real numbers in general as well as integers. 

The material in The Lure of the Integers ranges from the whimsical, e.g. Richard 
Guy’s schoolgirl story (86)*, to the profound, e.g. Matijasevic’s prime producing 
polynomial (5). Most of the topics in Lure come from the fields of recreational 
mathematics, number theory, and combinatorics. As a reflection of these interests, 
the names cited most frequently in the index are W. Sierpifski, P. Erdds, and 
D. H. Lehmer. Readers will of course find some sections more appealing than 
others; since there is essentially no connection between different topics, those 
which do not delight simply can be omitted. 

What makes a number interesting? This is 2 question that we cannot answer, for 
interest is largely a matter of personal taste. It is unlikely that two people’s lists of 
interesting numbers would be identical or that each number which appeared on 
two lists would be chosen for the same reason. The number 39 appears in 


*A number in parentheses refers to a section of the book. 
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le Lionnais’ collection as the first positive integer not possessing an interesting 
property (aside from this one). An inductive argument can be based on this remark 
to show that all positive integers are interesting. 

Here are a few of the many items in Lure that we found attractive: (8) 
Schoenberg’s theorem [4] that a regular simplex exists in Z” if and only if n + 1 is 
an odd square, a sum of two odd squares, or a multiple of 4; (163) the amazing 
irrational approximations to integers based on the fact that Q(/ — 163) has unique 
factorization; and (561) Carmichael numbers, a sparse infinite set of composite 
numbers n having the property that a” — a is divisible by n for every integer a. 

Exercising our right to disagree on what is interesting, we mention some topics 
which did not appeal to us: We were not captivated by digital properties of 
integers, such as the fact that 27 is the largest number equal to the sum of the 
digits of its cube (27); since the sum of the digits of n* is less than 9(k log,, n + 1), 
an infinite string of such assertions must hold. And our interest was not piqued by 
lists of exceptions to unsharp inequalities, such as the fact that 12 is the largest 
value for which the inequality 7(1) <n?’ fails to hold (12), where r(n) represents 
the number of positive integers dividing n. Indeed, since 7(n) = O(n®), for any 
fixed 5 > 0, an infinite number of results of this type can be given. 

It is incumbent on reviewers to point out at least a few items that ought to be 
changed. Here is our list: The reference to the first reviewer in Section 18.1 
omitted his coauthor, P. Erddés. In the formula for B(x) in Section 12.2, / log x 
should appear in place of the first log x. Following this correction, the sentence 
“He showed that NH(x) has the same form except that the first log x is changed 
to /log x” should be amended to end with the word “form.” In Section 6.2 
reference is made to a solution of Monthly Advanced Problem 5735. We note that 
this solution was subsequently found to be incorrect, and an explanation and 
correction were given in This Monthly, Vol. 97, page 937, 1990. 

Roberts has a concise writing style. He provides the reader with statements, 
comments, and references but very few proofs, hints, or even definitions. Readers 
who are unacquainted with a topic will generally have to look elsewhere for an 
introduction. It is worth noting, as the author himself does, that the level of 
difficulty varies significantly among the topics and it may not be easy to decide 
whether an assertion is closer in difficulty to the triangle inequality or the 
Riemann hypothesis. 

The strong point of Lure is the wealth of material it contains. The interested 
reader can test this assertion by the following experiment: write down several 
propositions each featuring a specific integer and see how many of them occur in 
the book. We found that this volume covers a significant proportion of the results 
we were familiar with. Also, the bibliographic documentation is very extensive, 
giving the book considerable value as a reference resource. The Lure of the Integers 
is a welcome contribution to the field of number-related literature. 
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Excursions in Calculus: An Interplay of the Continuous and the Discrete. By 


Robert M. Young, Mathematical Association of America, 1992, w+ 417, 
$36.00. 


Reviewed by Anita E. Solow 


Calculus, Calculus, Calculus! Sometimes it seems that all I hear about these days is 
calculus. Over the past six years, there have been grants to reform calculus, books 
published to help us teach the course, and conferences to spread the word to both 
the initiated and the uninitiated. I have spent a good part of the past six years of 
my life concerned with the issues of calculus reform. So, when I was asked to 
review Excursions in Calculus by Robert Young, I was all set to write the definitive 
essay on calculus and the way it should be taught. After all, the title seemed to 
imply that this book has something to do with calculus. The preface strengthened 
that belief: “Excursions in Calculus is one possible supplement to a more tradi- 
tional calculus course.” However, like any conscientious reviewer, my first step was 
to actually read the book. And when I did that, I realized that this book is not a 
calculus supplement at all. 

Excursions in Calculus is a beautifully written, fascinating book. The book 
consists of six essays on the topics of Mathematical Induction, the Binomial 
Theorem, Fibonacci numbers, Averages, Approximations, and Infinite Series. 
Interwoven through these chapters are elementary number theory, combinatorics, 
probability theory, calculus, and geometry. Each chapter includes the history and 
development of these ideas, along with the connections among them. And each 
chapter is a gem. The writing is clear and compelling. The excercises at the end of 
each section are excellent. These include difficult, and even unsolved, problems 
that are fun to tackle and that develop many of the most interesting and 
historically famous results. For example, you will find developed in the problems 
the “3x + 1 Problem,” Derangements, Public Key Cryptography, Stirling Num- 
bers, Bell Numbers, Partitions, Buffon’s Needle Problem, Catalan Numbers, Stir- 
ling’s Formula, Bézier splines, Egyptian fractions, the Cantor Function, and the 
Sierpinski Triangle. 

However, as wonderful as this book is, there are several reasons why it is not a 
supplement for calculus. First, I know of precious few first-year math students who 
could read this book. It is not that the book requires many prerequisites. After all, 
it assumes no mathematics above elementary calculus. However, the level of the 
exposition is too sophisticated for beginning mathematics students. The second 
problem is that the topics of this book are, for the most part, not really suitable for 
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inclusion in a calculus course, even if it were used only by the instructor to enrich 
the lectures. In the Preface, Paul Halmos is quoted as saying “there is no such 
subject as calculus; it is not a subject because it is many subjects.” I agree that 
calculus contains many ideas that students have seen previously, and many that 
they will see again at a higher level. However, I do believe that there is a core of 
material that makes up the subject of calculus, and I do not think it makes sense to 
change our calculus courses by throwing in lots of other mathematical topics. 

The more I thought about it, the more I decided that this book is mistitled. One 
of my colleagues even suggested that a more appropriate title might be Excursions 
out of Calculus, since this book ties the continuous ideas of elementary calculus 
with the discrete ideas of number theory and combinatorics. Another, more 
appropriate, title might be Excursions into Mathematics, because of the wide 
variety of mathematical ideas that are explored. However, the latter is the title of a 
not dissimilar 1969 book by Anatole Beck, Michael Bleicher, and Donald Crowe 
[1]. It may be instructive to briefly compare these two books. On the surface there 
are many similarities. Both books consist of six essays on mathematical topics, and 
there is a nontrivial intersection of topics, with Beck containing more emphasis on 
geometry. Since Young’s book presumes a knowledge of calculus and the other 
does not, Young is able to delve into more advanced topics. The biggest difference 
in my mind is the structure of the chapters. Excursions into Mathematics contains 
six independent chapters. One of the strengths of Young’s Excursions in Calculus 
is the interdependence of the mathematics in the chapters. For example, ideas 
about prime numbers occur throughout the book, and are not neatly segregated 
into one area. 

After I finished reading this book, I began to wonder how it could be used by 
students. It was clear to me that it should be read by them. But, when I looked at 
this book and looked at our mathematics curriculum, I could not get the two to 
mesh. Excursions in Calculus is full of beautiful, elegant mathematical ideas that 
most of us would agree should be part of the common culture of math students. I 
think that we would agree that all of our students should know some number 
theory and should have heard of the famous unsolved problems of the field, such 
as the Goldbach Conjecture and Fermat’s Last Theorem. Our students should 
understand Euler’s original argument that £1 /n’ = 77/6 and what the shortcom- 
ings of that argument were. They should understand the idea of approximation, 
whether it be approximating the value of 7, or the idea of approximating a 
function by a polynomial, or the idea that n log approximates the n'" prime 
number. They should be familiar with the search for prime numbers and should 
see the Bernoulli numbers in the context of summing the n'" powers of the 
integers. 

Unfortunately, many of these ideas do not appear in the formal mathematics 
courses that we teach. Our students will see some of these ideas more by chance 
than by design. I believe that this is largely due to the separation of our discipline 
into distinct subdisciplines. So, when we teach combinatorics, we do not discuss 
number theory, and when we look at the idea of approximation in calculus, we do 
not look at approximations in number theory or numerical analysis. We all 
understand why our curriculum is structured the way it is, but it is unfortunate that 
the effect of this separation is to minimize the probability that the student will see 
the connections among these ideas. 

Excursions in Calculus could be used to counter this deficiency in our curricu- 
lum. I could envision this book as the basis for a mathematical seminar for all math 
majors. Depending on the sophistication and talent of the students, the book could 
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easily be supplemented by more advanced writings on particular topics. At the 
joint AMS-MAA Mathematics Meetings in San Antonio (1993), there were well 
attended sessions on “capstone” courses for senior math majors. I would like to 
suggest this book as the foundation for such a course. Because of the variety of 
topics in the book, and because the purpose of the book is to put ideas from 
various mathematics courses together, it would serve the purpose of providing the 
structure upon which to hang a fascinating senior mathematical experience. 

There are, of course, other ways to use this book; I encourage you to find one. 
All of our mathematics majors, not to mention all of us, would benefit from 
reading this book. 
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Like Poetry, Mathematics is Beautiful 


Timidly I ask 

each one I meet if they 

find mathematics beautiful 

or useful, and each one dares to say, 
“Useful, of course. I use it every day.” 
And if I seem to want a proof, 


Suppose, instead, I ask 

if poetry is beautiful 

or useful. Will each person say, 
“Useful, of course. I use it every day.” 
And if I seem to want a proof, 

they will go on to say that they 
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they all go on to tell 

that daily they subtract and add 

to keep a checkbook; sometimes also 
they multiply to find how many squares 
they need to tile the kitchen floor. 


Mathematics is not only plus 

and minus, not just counting one, 

two, three. There are rules to bend 
defiantly, so parallels 

will meet before infinity. Look 

at the magic of unending terms 

that converge to a finite sum: 

start with one-half plus half of one-half 
plus half of the last again and again. 
Though we go on forever, we never 


Pass one. Do you find me difficult? Oh, dear! 


REVIEWS 


use rhymes to call to mind the days 
of a month—like ‘“‘Thirty hath 
September’ —and to remember 
how to spell words with “i” and ‘‘e’’. 


I have a faint, enduring hope 
that someday folks will see 
mathematics to be 

as lovely 

as poctry. 


JoAnne Growney 
Department of Mathematics and Computer Science 
Bloomsburg University / Bloomsburg, PA 17815 
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General, S*(14—18), L**. Essays in Hu- 
manistic Mathematics. Ed: Alvin M. White. 
MAA Notes No. 32. MAA, 1993, xii + 
212 pp, $24 (P). [ISBN 0-88385-089-3] 22 
brief, readable, sometimes provocative, some- 
times elegant essays on broadly cultural as- 
pects of mathematics—doing, learning, teach- 
ing. Mathematics is more than formal systems, 
say these authors: like the other humanities, 
mathematics reflects social, cultural, intellec- 
tual, and psychological contexts. A valuable, 
unusual resource. PZ 


General, S(13—14), L. Mathsemantics: Mak- 
ing Numbers Talk Sense. Edward MacNeal. 
Penguin USA, 1994, 294 pp, $22.95 (P). [ISBN 
0-670-85390-9] Numeracy, says the author, 
requires attaching context-appropriate mean- 
ings (“mathsemantics”) to otherwise formal 
numbers and operations. The case is made, 
and remedies suggested, in an anecdotal, often 
amusing, sometimes rambling style, built on an 
extended analysis of 200 job applicants’ perfor- 
mance on a quantitative reasoning test. PZ 


Reference, L**. Table of Integrals, Series, and 
Produets, Fifth Edition. 1.S. Gradshteyn, I.M. 
Ryzhik. Ed: Alan Jeffrey. Transl: Scripta Tech- 
nica, Inc. Academic Pr, 1994, xlvii + 1204 pp, 
$54.95. [ISBN 0-12-294755-X] Corrected 
and expanded. Many new entries and sections. 
(1980 edition, TR, December 1980.) AO 


Mathematics Appreciation, T(13: 1). Math- 
ematical Thinking in a Quantitative World. 
Linda R. Sons, Peter J. Nicholls. Kendall/Hunt, 
1992, 289 pp, $34.95 (P). [ISBN 0-8403-7924- 
2] Text for quantitative reasoning course: cov- 
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ers statistics, logical statements and arguments, 
graphical solution of systems of equations and 
inequalities, extrema of linear and quadratic 
functions, average rate of change, error anal- 
ysis, business applications. Extensive exercise 
sets; good, nonroutine problems. Assumes ele- 
mentary algebra, geometry. KES 


Mathematics Appreciation, S(13—14), L. Jn- 
vitation to Mathematics. Konrad Jacobs. 
Princeton Univ Pr, 1992, xi + 247 pp, 
$29.95 (P); $60. [ISBN 0-691-02528-2; 0- 
691-08567-6] Wide-ranging introduction to 
mathematical thinking. Topics include pos- 
sible and impossible constructions, symme- 
try groups, Marriage Theorem, network flows, 
games, knots, fixed point theorems, dynamical 
systems. No exercises. Translation of 1987 
German version. KES 


Recreational Mathematics, S(13), L. A Math- 
ematical Pandora’s Box. Brian Bolt. Cam- 
bridge Univ Pr, 1993, 126 pp, $15.95 (P). 
[ISBN 0-521-44619-8] 142 puzzles, games, 
tricks, and recreations, with solutions, at same 
level as in author’s three previous puzzle books 
(Amazing Mathematical Amusement Arcade; 
The Mathematical Funfair, TR, March 1991; 
and Mathematical Cavalcade). LCL 


Recreational Mathematics, P. Mathemati- 
cal Magic. William Simon. Dover, 1993, 
187 pp, $5.95 (P). [ISBN 0-486-27593-0] 
Clever tricks that require little mathematics be- 
yond arithmetic. An entertaining collection, 
well worth the price. Topics: magic with num- 
bers, magic of shape, calendar magic, magic 
squares, and magic with playing cards. MK 
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Elementary, S(13). Mathématiques générales: 
Problémes résolus. Jacques Bair, Geneviéve 
Hamende. De Boeck Université (avenue Louise 
203, B-1050, Bruxelles), 1992, iii + 236 pp, 145 
FF (P). [ISBN 2-8041-1603-4] Problems and 
solutions in elementary mathematics: set the- 
ory, combinatorics, maxima and minima, ex- 
ponential and logarithmic functions, etc. Ac- 
companies authors’ Mathématiques générales 
(2éme édition): 320 exercices, 380 problémes 
avec leurs solutions, 160 tableaux. LC 


Precalculus, T(13: 1, 2). Mathematics for Cal- 
culus, Second Edition. James Stewart, Lothar 
Redlin, Saleem Watson. Brooks/Cole, 1993, 
xviii + 773 pp, $51.50. [ISBN 0-534-20250-0] 
Many new examples, exercises that use technol- 
ogy. Chapters end with problem-solving high- 
lights. Margins include biographies, applica- 
tions. (First Edition, TR, October 1989.) TH 


Precalculus, T(13: 1, 2). College Algebra and 
Trigonometry, Third Edition. Bernard Kolman, 
Michael L. Levitan, Arnold Shapiro. Saunders 
College, 1993, xx + 768 pp, $50.75. [ISBN 
0-03-046933-3] New to this edition: scien- 
tific notation, critical value method for solv- 
ing inequalities, partial fractions, determinants, 
polynomial functions, new material on ana- 
lytic geometry. Over 350 graphing calculator 
problems. Features progress checks, carefully 
worked examples, warnings on pitfalls. (Sec- 
ond Edition, TR, November 1987.) TH 


Education, P. Street Mathematics and School 
Mathematics. Terezinha Nunes, Analucia Dias 
Schliemann, David William Carraher. Cam- 
bridge Univ Pr, 1993, viii + 170 pp, $49.95; 
$16.95 (P). [ISBN 0-521-38116-9; 0-521- 
38813-9] Comparison of mathematics use in 
and out of school, across wide range of ages, 
cultures, and occupations. Concludes that 
“street mathematics is not the learning of par- 
ticular procedures repeated in an automatic un- 
thinking way, but involves the development of 
mathematical concepts and processes.”  Pri- 
mary lesson for “realistic mathematics” teach- 
ing: respect the social context of problems. MW 


Education, T(16—18: 1), S. Problem Posing: 
Reflectious and Applications. Eds: Stephen 
I. Brown, Marion I. Walter. Lawrence Erl- 
baum Assoc, 1993, xvii + 336 pp, $24.95 (P). 
[ISBN 0-8058-1065-X] 30 essays reflect on 
problem-posing rationale and strategies, pri- 
marily in pre-college mathematics. Excellent 
text for pre-service or in-service teachers. MW 


Education, P. Japanese Grades 7-9 Mathe- 
matics. Ed: Kunihiko Kodaira. Transl: Hiromi 
Nagata, George Fowler. UCSMP Textbook 
Transl. Univ of Chicago School Math Project, 
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1992. Grade 7, xii + 185 pp [ISBN 0-936745- 
53-3]; Grade 8, xii + 205 pp [ISBN 0-936745- 
54-1]; Grade 9, xii + 199 pp. [ISBN 0-936745- 
55-X] Example-driven presentations of topics 
that U.S. students see in grades 7-11. Virtually 
all Japanese students complete grade 9; all are 
responsible for material in these texts. The les- 
son here is the depth to which all ninth graders 
are expected to learn mathematics. Don’t look 
for innovative content or teaching ideas (man- 
ual calculation of square roots is a ninth grade 
Advanced Topic for Individual Study). MW 


Education, T(13—14: 2). Mathematics for EI- 
ementary Teachers: An Interactive Approach. 
Thomas Sonnabend. Saunders College, 1993, 
xviii + 917 pp, $46.75. [ISBN 0-03-020709- 
6] Emphasis on discovery, discussion, appli- 
cations, and BASIC programming. Each chap- 
ter ends with selected NCTM Curriculum Stan- 
dards for students to connect to chapter material 
and to current elementary textbooks. Tradi- 
tional table of contents. MW 


Education, P. How to Use Children’s Lit- 
erature to Teach Mathematics. Rosamond 
Welchman-Tischler. NCTM, 1992, iv + 75 pp, 
$8.50 (P). [ISBN 0-87353-349-6] Excellent 
resource for elementary teachers. Classroom 
activities, designs for worksheets and activity 
cards, and follow-up activities for twenty books. 
Sketches ideas for using additional books. KES 


Education, P. Rational Numbers: An Inte- 
gration of Research. Eds: Thomas P. Carpen- 
ter, Elizabeth Fennema, Thomas A. Romberg. 
Stud. in Math. Thinking & Learning. Lawrence 
Erlbaum Assoc, 1993, xi + 372 pp, $79.95. 
[ISBN 0-8058-1135-4] 13 chapters report on 
research programs that address teaching, learn- 
ing, curriculum, and assessment from an in- 
tegrated perspective. Aims to “understand 
the effects of instruction” in rational numbers, 
not to “develop prescriptions for more effec- 
tive instruction.” Chapters on benefits of re- 
search linking teaching and learning are rele- 
vant throughout mathematics. MW 


Education, P. The Wonderful World of Mathe- 
matics: A Critically Annotated List of Children’s 
Books in Mathematics. Eds: Diane Thiessen, 
Margaret Matthias. NCTM, 1992, xi + 241 pp, 
$17 (P). [ISBN 0-87353-353-4] Describes 
and rates usefulness of almost 500 trade books 
that emphasize mathematics concepts. Orga- 
nized by concept. KES 


History, S(13-17). In the Wake of Chaos: 
Unpredictable Order in Dynamical Systems. 
Stephen H. Kellert. Univ of Chicago Pr, 1993, 
xiv + 176 pp, $19.95. [ISBN 0-226-42974-1] 
By a philosopher; starts with readable accounts 
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of both mathematics and history of dynamical 
systems. Later chapters on philosophical is- 
sues: “determinism” vs. “predictability;” social 
and cultural biases in research, etc. KS 


Logic, S(15-—16), L. Popular Lectures on Math- 
ematical Logic. Hao Wang. Dover, 1993, vi 
+ 281 pp, $8.95 (P). [ISBN 0-486-67632-3] 
Reprint of 1981 edition (with new postscript). 
A wide-ranging treatment of logic and its con- 
nections to other areas of mathematics, com- 
puter science, and applications. Part technical, 
part philosophical. RM 


Combinatorics, T(16—18: 2), S, P, L. A Course 
in Combinatorics. J.H. van Lint, R.M. Wil- 
son. Cambridge Univ Pr, 1992, x11 + 530 pp, 
$80; $29.95 (P). [ISBN 0-521-41057-6; 0-521- 
42260-4] Sophisticated introduction; abstract 
algebra recommended. Topics include Ramsey 
theory, extremal graphs, generating functions, 
codes and designs, association schemes, alge- 
braic graph theory. AD 


Combinatorics, P. Oriented Matroids. Anders 
Bjorner, et al. Ency. of Math. & Its Applic., 
V. 46. Cambridge Univ Pr, 1993, xii + 516 pp, 
$89.95. [ISBN 0-521-41836-4] For experts 
and novices. Chapters treat axiomatics, face 
lattices, topological models, realizability, con- 
vex polytopes, linear programming. Complete 
in motivating idea of oriented matroids, in pre- 
senting topics, in exercises, and in bibliography. 
A thoroughly thorough book. SG 


Number Theory, P. Elementary Theory of 
L-functions and Eisenstein Series. Haruzo 
Hida. London Math. Soc. Student Texts, 
V. 26. Cambridge Univ Pr, 1993, xi + 386 pp, 
$69.95; $22.95 (P). [ISBN 0-521-43411-4; 
0-521-43569-2] A thorough introduction to 
p-adic modular forms and L-functions includ- 
ing Eichler-Shimura isomorphisms and func- 
tional equations for Hecke L-functions. SG 


Number Theory, P. Computational Algebraic 
Number Theory. Michael E. Pohst. DMV Semi- 
nar, B. 21. Birkhauser, 1993, 88 pp, $26.50 (P). 
[ISBN 0-8176-2913-0] Describes algorithms 
developed recently to calculate a basis for the 
ring of integers in an algebraic number field, its 
unit group, and its ideal class group. SG 


Linear Algebra, T(14: 1). Linear Algebra 
for Mathematics, Science, and Engineering. 
Edward M. Landesman, Magnus R. Hestenes. 
Prentice Hall, 1992, xiii+ 551 pp. [ISBN 0-13- 
529561-0] Standard introductory text. Cov- 
ers eigenvalues and eigenvectors before ab- 
stract vector spaces, linear transformations. 
Treats Gerschgorin intervals and disks, least 
squares, pseudoinverses, Rayleigh quotients, 
Gauss-Seidel methods. LC 
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Group Theory, P. Geometric Group Theory, 
Volume I. Eds: Graham A. Niblo, Martin A. 
Roller. London Math. Soc. Lect. Note Ser., 
V. 181. Cambridge Univ Pr, 1993, 212 pp, 
$34.95 (P). [ISBN 0-521-43529-3] Proceed- 
ings of 1991 Sussex University symposium. 


Algebra, T(16—-17: 2), S, L. Abstract Alge- 
bra with Applications, in Two Volumes. Karl- 
heinz Spindler. Marcel Dekker, 1994, $75 each, 
$125 set. Volume I: Vector Spaces and Groups, 
xvii + 756 pp [ISBN 0-8247-9144-4]; Volume 
II: Rings and Fields, xv + 531 pp. [ISBN 0- 
8247-9159-2] A self-contained graduate text 
stressing clarity, not brevity. With historical 
motivation, concrete examples, informal com- 
ments. “Applications” of algebraic concepts 
are to geometry, topology, Markov chains, dif- 
ferential equations, combinatorics, number the- 
ory, and algebraic geometry. 58 chapters, each 
with many exercises. LCL 


Algebra, T(14-16: 1, 2), L. Abstract Algebra: 
A First Undergraduate Course, Fifth Edition. 
Abraham P. Hillman, Gerald L. Alexanderson. 
PWS, 1994, xv + 480 pp. [ISBN 0-534-19128- 
2] Relatively minor changes in what has been 
(and will be) a widely used and highly regarded 
text. (Fourth Edition, TR, January 1989.) JS 


Algebra, T(17-18: 2), L. Algebra. Mark 
Steinberger. PWS, 1994, xvi + 558 pp. [ISBN 
0-534-93678-4] Elementary group theory; 
Sylow theory; categories; rings; field exten- 
sions; tensor products; linear algebra; Ga- 
lois theory; semi-simple and hereditary rings; 
Dedekind domains. Assumes relatively little 
background. JS 


Algebra, T(17), P*. Noncommutative Algebra. 
Benson Farb, R. Keith Dennis. Grad. Texts in 
Math., V. 144. Springer-Verlag, 1993, xiv + 
223 pp, $34. [ISBN 0-387-94057-X] A nice 
introduction to noncommutative algebra from 
a homological point of view. Core consists of 
Wedderburn structure theorems, Jacobson radi- 
cal, double-centralizer theorems, and the Brauer 
group. Further topics include representation 
theory, primitive rings, global dimension. With 
a wealth of exercises. TH 


Calculus, T(13: 1, 2). Calculus, Part I & 
II, Revised Edition. George W. Best, David 
A. Penner (Phillips Academy, Andover, MA 
01810), 1992, $45 set (P). Part I, 293 pp; Part 
IT, 854 pp. Concepts precede techniques: in- 
troduces the definite integral and the deriva- 
tive function before limits, differentiation tech- 
niques, or the fundamental theorem. Covers tra- 
ditional single-variable calculus topics as well 
as parametric equations and vectors, differential 
equations, and sequences and series. AO 
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Calculus, §(13). Exploring Precalculus and 
Calculus with the TI-81 Graphics Calculator. 
George W. Best, David A. Penner. Venture Pub, 
1992, 188 pp, $25 (P). Exercises and projects 
on functions, derivatives, and integrals. Em- 
phasizes graphical and numerical approaches. 
First chapter introduces calculator capabilities, 
including programming. AO 

Calculus, T(13: 2). Technical Calculus with 
Analytic Geometry, Third Edition. Peter Kuh- 
fittig. Brooks/Cole, 1994, xii + 525 pp, 
$65. [ISBN 0-534-21852-0] Nonrigorous 
treatment stressing applications. Three chap- 
ters on differential equations, including Laplace 
transforms. Exploits graphing calculators. HD 


Real Analysis, P. Lectures on Ergodic Theory 
and Pesin Theory on Compact Manifolds. Mark 
Pollicott. London Math. Soc. Lect. Note Ser., 
V. 180. Cambridge Univ Pr, 1993, ix + 162 pp, 
$37.95 (P). [ISBN 0-521-43593-5] First half 
covers basic ergodic theory (recurrence, ergodic 
theorems, entropy); remainder treats Pesin the- 
ory, the study of non-uniformly hyperbolic dif- 
feomorphisms. SB 


Complex Analysis, T(18), S, P, L. Invari- 
ant Distances and Metrics in Complex Analysis. 
Marek Jarnicki, Peter Pflug. Expos. in Math., 
V.9. Walter de Gruyter, 1993, xi+ 408 pp, DM 
178. [ISBN 3-11-013251-6] Theory of holo- 
morphically constructible objects on domains 
in n-dimensional complex space: Carathéodory 
and Kobayashi pseudodistances and pseudo- 
metrics, complex Green’s function, Bergman 
distance and metric, etc. Many examples, exer- 
Cises, open problems, new results. KS 


Complex Analysis, T(18), L? Rational Itera- 
tion: Complex Analytical Dynamical Systems. 
Stud. in Math., V. 16. Walter de Gruyter, 1993, 
ix +189 pp, DM 108. [ISBN 3-11-013765- 
8] A comprehensive account of substantive 
work of Fatou and Julia—so often alluded to 
in superficial treatments rushing to the now fa- 
mous pictures. Quasiconformal mappings are 
de-emphasized, except as needed to sketch the 
idea of Sullivan’s theorem on no wandering do- 
mains. Assumes basic complex analysis. AWR 


Differential Equations, S(18), P. Treatise on 
Analysis, Volume VIIT. J. Dieudonné. Trans!: 
Laura Fainsilber. Pure & Appl. Math., V. 10- 
VIII. Academic Pr, 1993, xi + 356 pp, $110. 
[ISBN 0-12-215508-4] Chapter 23, Part II of 
Dieudonne’s magnum opus, concentrating on 
boundary value problems. The style is well 
known, which is to say dense—not a gentle 
introduction for the uninitiated. AWR 

Differential Equations, T*(18: 2), P. Appli- 
cations of Lie Groups to Differential Equations, 
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Second Edition. Peter J. Olver. Grad. Texts in 
Math., V. 107. Springer-Verlag, 1993, xxviii 
+ 513 pp, $59. [ISBN 0-387-94007-3] The 
book to read to learn about symmetry groups 
of differential equations and their many appli- 
cations (solving PDE’s and ODE’s explicitly, 
finding conservation laws, etc.). New edition 
adds brief discussion of pseudo-differential op- 
erators, and maintains a wealth of examples, 
frank and interesting historical notes, and a most 
readable expository style. JO 


Differential Equations, T(14-15: 1). Or- 
dinary Differential Equations with Applica- 
tions, Third Edition. Bernard J. Rice, Jerry 
D. Strange. Brooks/Cole, 1994, ix + 533 pp, 
$63. [ISBN 0-534-21318-9] Well-chosen ex- 
amples, clear writing style, many applications. 
Stresses linear equations. Technology is used 
(and recommended) throughout. DS 


Differential Equations, T(18: 1), P. De- 
lay Differential Equations With Applications in 
Population Dynamics. Yang Kuang. Math. 
in Sci. & Eng., V. 191. Academic Pr, 1993, 
xii + 398 pp, $59.95. [ISBN 0-12-427610-5] 
Research results in population dynamics using 
delay differential equations. Reviews basic the- 
ory, motivated by applications. DS 


Differential Equations, S(14—15). Differential 
Equations with Mathematica. Martha L. Abell, 
James P. Braselton. Academic Pr, 1993, viii 
+ 631 pp, $44.95 (P). [ISBN 0-12-041538-0] 
Supplements a standard ODE course; demon- 
strates uses of Mathematica. Not a full-fledged 
text; no exercises, insufficient exposition. JO 


Partial Differential Equations, P. De- 
generate Parabolic Equations. Emmanuele 
DiBenedetto. Universitext. Springer-Verlag, 
1993, xv + 387 pp, $39 (P). [ISBN 0-387- 
94020-0] Overview of recent results. JO 


Partial Differential Equations, T(15: 1). Par- 
tial Differential Equations for Scientists and En- 
gineers. Stanley J. Farlow. Dover, 1993, ix + 
414 pp, $12.95 (P). [ISBN 0-486-67620-X] A 
Standard introduction, 47 “semi-independent” 
lessons. Unusual topics include control theory, 
calculus of variations, Monte Carlo methods, 


and potential theory. Relatively few exercises. 


(1982 Wiley edition, TR, January 1983.) JO 


Partial Differential Equations, T(18), P. 
Partial Differential Equations IV: Microlocal 
Analysis and Hyperbolic Equations. Eds: 
Yu. V. Egorov, M.A. Shubin. Ency. of Math. 
Sci., V. 33. Springer-Verlag, 1993, 241 pp, 
$79. [ISBN 0-387-53363-X] Two indepen- 
dent themes from current research in PDE’s: 
(1) survey of results in local analysis of cotan- 
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gent bundle space; (2) linear hyperbolic equa- 
tions and systems. DS 


Dynamical Systems, P. Hyperbolicity and 
Sensitive Chaotic Dynamics at Homoclinic Bi- 
furcations: Fractal Dimensions and Infinitely 
Many Attractors. Jacob Palis, Floris Takens. 
Stud. in Adv. Math., V. 35. Cambridge Univ 
Pr, 1993, x + 234 pp, $54.95. [ISBN 0-521- 
39064-8] A self-contained introduction to ho- 
moclinic bifurcation theory. Begins with de- 
tailed treatments of several classical examples. 
Stresses interplay between homoclinic tangen- 
cies and non-trivial basic sets. Includes a geo- 
metric proof of Newhouse’s Theorem. MPR 


Numerical Analysis, P. Mathematical and 
Computational Techniques for Multilevel Adap- 
tive Methods. Ulrich Rtide. Frontiers in Appl. 
Math., V. 13. SIAM, 1993, xii+ 140 pp, $23 (P). 
[ISBN 0-89871-320-X] Basic theory of mul- 
tilevel adaptive methods for solving PDE’s nu- 
merically. Discusses data structures needed for 
such methods, how object-oriented program- 
ming implements these data structures. JO 


Operator Theory, T(17), L. Hilbert Space: 
Compact Operators and the Trace Theorem. 
J.R. Retherford. London Math. Soc. Stud. 
Texts, V. 27. Cambridge Univ Pr, 1993, xii 
+ 131 pp, $44.95; $19.95 (P). [ISBN 0-521- 
41884-4; 0-521-42933-1] A readable intro- 
duction to Hilbert space and Banach space tech- 
niques, moving toward the Lidskij trace theo- 
rem as culmination. Aims to be comprehen- 
sible to beginning graduate students, useful to 
advanced students; assumes advanced calculus 
and linear algebra. AWR 

Functional Analysis, T(16—17: 1, 2), S, L. Ap- 
plied Algebra and Functional Analysis. An- 
thony N. Michel, Charles J. Herget. Dover, 
1993, x + 484 pp, $10.95 (P). [ISBN 0-486- 
67598-X] Good text for engineering and sci- 
ence students. Covers algebra (including Jor- 
dan canonical form), metric spaces, normed 
spaces, inner-product spaces, and operator the- 
ory through the spectral theorem for compact 
normal operators. Applications to differential 
and integral equations, random variables, opti- 
mal control, minimization. HD 


Functional Analysis, P. Function Spaces, 
Differential Operators and Nonlinear Analysis. 
Eds: Hans-Jiirgen Schmeisser, Hans Triebel. 
Teubner-Texte zur Mathematik, Band 133. BG 
Teubner Leipzig, 1993, 308 pp, DM 54,80 (P). 
[ISBN 3-8154-2045-8] Proceedings of a 1992 
conference in Friedrichroda, Germany. 


Analysis, T*, P*, L. Harmonic Function 
Theory. Sheldon Axler, Paul Bourdon, Wade 
Ramey. Grad. Texts in Math., V. 137. Springer- 
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Verlag, 1992, xii + 231 pp, $39.50. [ISBN 
0-387-97875-5] Harmonic function theory in 
IR", starting with the basics. Topics include 
Bocher’s theorem, spherical harmonics, har- 
monic Hardy and Bergman spaces, Dirichlet 
problem. Assumes “solid foundation in real and 
complex analysis, and some basic results from 
functional analysis.” Engaging style; plenty 
of motivation. Many good problems end each 
chapter. Well worth a look. BH 


Analysis, P, L. Inverse Problems in the Mathe- 
matical Sciences. Charles W. Groetsch. Friedr 
Vieweg & Sohn, 1993, v + 152 pp, $30. 
[ISBN 3-528-06545-1] “Broad-based intro- 
duction” to “some of the main lines of research 
in inverse and ill-posed problems.” Numerous 
physical models, sketch of some operator theory 
background, survey of some numerical meth- 
ods. Extensive annotated bibliography. BH 


Analysis, T(17), S, L. Fundamentals of Con- 
vex Analysis: Duality, Separation, Representa- 
tion, and Resolution. Michael J. Panik. The- 
ory & Decision Lib., Ser. B, V. 24. Kluwer 
Academic, 1993, xxii + 294 pp, $108. [ISBN 
0-7923-2279-7] Considerable overlap in cov- 
erage and viewpoint with Rockafellar’s Convex 
Analysis (TR, April 1970), but written for stu- 
dents of economics, management science, and 
engineering rather than mathematicians. Nicely 
provides the theory necessary for linear pro- 
gramming, duality, separation theorems, etc. 
Some proofs are omitted. AWR 


Analysis, T(17-18: 1, 2), S, L. Problems in 
Real and Complex Analysis. Bernard R. Gel- 
baum. Prob. Books in Math. Springer-Verlag, 
1992, ix + 488 pp, $59. [ISBN 0-387-97766-X] 
A large, useful collection of problems, with full 
solutions, in real and complex analysis. Diffi- 
culty ranges from routine to quite challenging. 
Excellent for self-study, preparation for doc- 
toral prelims, professional brushing-up. PZ 


Analysis, P. Lie Groups and Lie Algebras I: 
Foundations of Lie Theory, Lie Transformation 
Groups. Ed: A.L. Onishchik. Ency. of Math. 
Sci., V. 20. Springer-Verlag, 1993, 235 pp, $89. 
[ISBN 0-387-18697-2] Two major chapters: 
Foundations of Lie Theory (covers basics of Lie 
groups and Lie algebras and their relationship, 
the universal enveloping algebra, and general- 
izations of Lie groups) and Lie Transformation 
Groups (considers Lie group actions on mani- 
folds, various homogeneous spaces, and actions 
of compact Lie groups). SG 

Optimization, T(16-17: 1, 2), L*. Net- 
work Flows: Theory, Algorithms, and Appli- 
cations. Ravindra K. Ahuja, Thomas L. Mag- 
nanti, James B. Orlin. Prentice Hall, 1993, xv 
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+ 846 pp. [ISBN 0-13-617549-X] A com- 
prehensive, up-to-date introduction to shortest 
path, maximum flow, and minimum cost flow 
problems. Relatively few computational exer- 
cises; most exercises address theory or applica- 
tions. Extensive reference notes. AO 


Mathematical Modeling, T(16—17: 1). Math- 
ematical Modelling of Complex Mechanical 
Systems, Volume 1: Discrete Models. K. Ar- 
czewski, J. Pietrucha, C.M. Leech. Math. & 
Its Applic. Ellis Horwood, 1993, 293 pp, $42. 
[ISBN 0-13-563750-3] Unified treatment of 
models from mechanical engineering, thermo- 
dynamics, fluid mechanics, etc. DS 


Control Theory, P. Nonlinear Feedback Con- 
trol Systems: An Operator Theory Approach. 
Rui J.P. de Figueiredo, Guanrong Chen. Aca- 
demic Pr, 1993, ix + 220 pp, $59.95. [ISBN 
Q-12-208630-9] Presents one approach to uni- 
fying mathematical results of nonlinear control 
theory. Aims for a mathematical theory that 
will facilitate the formulation of “. .: nonlinear 
problems, the understanding of the underlying 
system properties, and the construction of the 
pertinent algorithms.” JO 


Probability, T(13: 1). Introduction to Prob- 
ability. John E. Freund. Dover, 1993, viti 
+ 247 pp, $6.95 (P). [ISBN 0-486-67549-1] 
Elementary, easy-reading text covers combi- 
natorics, conditional probability, expectation, 
probability distributions, the law of large num- 
bers. Republication of a 1973 Dickenson text 
(TR, August-September 1973). RWJ 


Probability, T(15-16: 1). Introduction to 
Probability Models, Fifth Edition. Sheldon 
M. Ross. Academic Pr, 1993, xi + 556 pp, 
$59.95. [ISBN 0-12-598455-3] This edition 
includes examples for greedy algorithms, min- 
imizing highway encounters, tracking AIDS, 
compound Poisson process applications, and 
theory of options pricing including arbitrage 
theorem and Black-Scholes option pricing for- 
mula. 120 new exercises. (Fourth Edition, TR, 
February 1990.) MK 


Probability, T(16—17: 1, 2), L. Probabil- 
ity. Alan F. Karr. Texts in Stat. Springer-Verlag, 
1993, xxi+ 282 pp, $39. [ISBN 0-387-94071-5] 
Probability, random variables, independence, 
expectation, convergence of sequences of ran- 
dom variables, characteristic functions, limit 
theorems, prediction and conditional expecta- 
tion, and martingales. Preliminary section on 
random walks serves as introduction. Assumes 
undergraduate real analysis. RWJ 


Stochastic Processes, S(18), P. Contin- 
uous-Time Markov Chains: An Applications- 
Oriented Approach. William J. Anderson. 
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Ser. in Stat. Springer-Verlag, 1991, xii + 
355 pp, $69. [ISBN 0-387-97369-9] Theory 
of continuous-time Markov chains via transi- 
tion functions, approached through backward 
and forward equations and integral recursions. 
Assumes background in analysis, probability, 
Markov chains. No exercises. MK 


Stochastic Processes, T(13: 1). An Introduc- 
tion to Probability and Stochastic Processes. 
Marc A. Berger. Texts in Stat. Springer-Verlag, 
1993, xii + 205 pp, $39. [ISBN 0-387-97784- 
8] Elegant presentation from a non-measure- 
theoretic viewpoint. Terse exposition, discus- 
sion, proofs; excellent problem sets. Topics in- 
clude Markov chains, passage phenomena, sta- 
tionary distributions and steady states, Markov 
jump processes, ergodic theory with applica- 
tions to fractals. MK 


Stochastic Processes, T(18: 2). Brownian Mo- 
tion and Stochastic Calculus, Second Edition. 
Ioannis Karatzas, Steve E. Shreve. Grad. Texts 
in Math., V. 113. Springer-Verlag, 1991, xxiii 
+ 470 pp, $39.50 (P). [ISBN 0-387-97655-8] 
Advanced treatment of the continuous-time 
Markov (and Martingale) property. Brownian 
motion is studied thoroughly at outset, leading 
to stochastic integration, Brownian motion and 
PDE’s, stochastic DE’s, and Levy’s theory of 
Brownian local time. MK 


Stochastic Processes, T(16—17: 2). An Intro- 
duction to Stochastic Modeling, Revised Edi- 
tion. Howard M. Taylor, Samuel Karlin. Aca- 
demic Pr, 1994, xi + 566 pp, $59.95. [ISBN 0- 
12-684885-8] Revisions include many more 
exercises (ordered by difficulty), Wald’s equa- 
tion now appears in renewal theory chapter, ex- 
amples of sum quota sampling. (First Edition, 
TR, May 1985.) KB 


Stochastic Processes, P. Poisson Processes. 
J.C. Kingman. Oxford Stud. in Prob., V. 3. 
Clarendon Pr, 1993, viii + 104 pp, $39.95. 
[ISBN 0-19-853693-3] Surveys Poisson pro- 
cesses in One or more dimensions. Several ap- 
plications to ecology. Avoids measure theory, 
but contains ample mathematical detail. Top- 
ics include marked Poisson processes, Cox pro- 
cesses, stochastic geometry, Poisson-Dirichlet 
distribution. No exercises. MK 


Elementary Statistics, T(13: 1). A First 
Course in Statistics, Fourth Edition. James T. 
McClave, Frank H. Dietrich II. Dellen, 1992, 
xv + 583 pp. [ISBN 0-02-378561-6] Deletes 
“optional” topics from previous edition (TR, 
October 1983) to shorten text. Comparisons 
of means and proportions are treated in sepa- 
rate chapters. Numerous examples from current 
events and scientific research areas. Includes a 
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substantial demographic data set, many exer- 
cises. Emphasizes mechanics. MK 


Elementary Statistics, T(13: 1). Statisti- 
cal Methods. Rudolf J. Freund, William 
J. Wilson. Academic Pr, 1993, xix + 
644 pp, $52.50. [ISBN 0-12-267470-7] Non- 
calculus-based introduction to statistics. Em- 
phasizes drawing conclusions from analyses, 
not mechanics. Many exercises. Covers topics 
through multiple linear regression, factorial ex- 
periments, weighted least-squares, logistic re- 
gression, and log-linear models. MK 
Elementary Statistics, T(13: 1, 2), S, C. Ay- 
perstat: Macintosh Hypermedia for Analyzing 
Data and Learning Statistics. David M. Lane. 
Academic Pr, 1993, x + 128 pp, $59.95 (P), 
with disks. [ISBN 0-12-436130-7] Combines 
a Statistics text, a data analysis program, and 
interactive exercises. Hypertext links related 
concepts. Main text is on disk; accompanying 
manual explains general use, specific statisti- 
cal procedures, explorations options. Assumes 
Macintosh basics, but no prior knowledge of 
statistics. KB 


Elementary Statistics, T(14: 1), L?? Statis- 
tics in Theory and Practice. Robert Lupton. 
Princeton Univ Pr, 1993, x + 188 pp, $24.95. 
[ISBN 0-691-07429-1] A text for math and 
science majors. Assumes knowledge of calcu- 
lus ideas and techniques. “There is very little 
real data in this book, and very few exercises 
of the sort that ask you to apply the techniques 
presented to real problems.” MLR 


Mathematical Statistics, T(14-15). Founda- 
tions of Probability and Statistics. William C. 
Rinaman. Saunders College, 1993, xii + 773 pp, 
$58.75. [ISBN 0-03-071806-6] Stresses the- 
ory; few “real” data sets or case studies. Tradi- 
tional contents. Examples, exercises are many 
and useful. MK 


Mathematical Statistics, S(15—16). Solutions 
in Statistics and Probability (Second Edition). 
Edward J. Dudewicz. Amer. Ser. in Math. & 
Management Sci., V. 3. American Sciences 
Pr, 1993, iv + 318 pp, $98.75 (P). [ISBN 0- 
935950-35-4] Detailed solutions to problems 


in Introduction to Statistics and Probability by ~ 


Dudewicz, and Modern Mathematical Statistics 
by Dudewicz and Mishra. (First Edition, TR, 
April 1982.) RWJ 


Mathematical Statistics, T(14). Probability 
and Statistics for Engineers and Scientists, Fifth 
Edition. Ronald E. Walpole, Raymond H. My- 
ers. Macmillan, 1993, xiv + 766 pp. [ISBN 
0-02-424201-2] Newedition emphasizes data 
analysis more than before (Fourth Edition, TR, 
December 1985). Includes new exercises em- 
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phasizing “real-life” science and engineering 
applications, and complete case studies with 
computer printout (MINITAB), graphics, and 
diagnostic measures. Section on Taguchi’s ro- 
bust parameter design. MK 


Statistical Methods, T(17). Time Series: 
Forecasting, Simulation, Applications. Gareth 
Janacek, Louise Swift. Math. & Its Applic. 
Ellis Horwood (US Distr: Simon & Schuster), 
1993, 331 pp, $74.95. [ISBN 0-13-918459- 
7] Introduction to time series, theory and ap- 
plications. State space models, Kalman filter 
are introduced early, with emphasis on exact 
maximum-likelihood estimation. These ideas 
are then recast for ARMA model development 
and for structural models; also treats frequency 
domain ideas. Special topics: multiple series, 
missing data, long-memory processes, etc. MK 


Statistical Methods, T(16—17: 1), P, L. Sta- ‘ 
tistical Design and Analysis for Intercropping 
Experiments, Volume I: Two Crops. Walter T. 
Federer. Ser. in Stat. Springer-Verlag, 1993, xx 
+ 298 pp, $59. [ISBN 0-387-97923-9] Sur- 
veys statistical design and analysis of intercrop- 
ping experiments. Applies statistical models to 
relative indices, yield-density relations, biolog- 
ical competition, and spatial-intimacy arrange- 
ments. Includes several nice data sets. RWJ 
Statistical Methods, P. Bayesian Statistics 
4. Eds: J.M. Bernardo, et al. Clarendon Pr, 
1992, xiii + 859 pp, $95. [ISBN 0-19-852266- 
5] Proceedings of the Fourth Valencia Inter- 
national Meeting on Bayesian Statistics, April 
1991. MK 


Statistics, P. Nonlinear Statistical Mod- 
els. Andrej Pazman. Math. & Its Applic., 
V. 254. Kluwer Academic, 1993, ix + 259 pp, 
$119. [ISBN 0-7923-2247-9] Treats Lo es- 
timators in (multivariate) nonlinear regression 
models, their properties (using differential- 
geometrical methods), their computation, and 
local and global approximations of their proba- 
bility densities. Introductory chapter on linear 
regression, concluding chapter on nonlinear ex- 
ponential families. RWJ 


Algorithms, P. Efficient Algorithms for List- 
ing Combinatorial Structures. Leslie Ann 
Goldberg. Cambridge Univ Pr, 1993, xv 
+ 160 pp, $44.95. [ISBN 0-521-45021-7] 
Award-winning dissertation. LC 


Computer Organization, P. High Perfor- 
mance Computing. Kevin Dowd. O’Reilly & 
Assoc, 1993, xxv + 371 pp, $25.95 (P). [ISBN 
1-56592-032-5] For “people who are inter- 
ested in computer performance or who need to 
understand it for their job.” Treats RISC archi- 
tectures, modern memory systems, optimizing 
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compilers, parallel processing, bench-marking, 
porting code. A brisk, practical overview. JO 


Computer Systems, S(16—17), P, C. Introduc- 
tion to Computer Performance Analysis with 
Mathematica. Arnold O. Allen. Comp. Sci. & 
Scient. Computing. Academic Pr, 1994, xx + 
356 pp, $49.95 with disk. [ISBN 0-12-05 1070- 
7] Tools for modeling, simulation, and perfor- 
mance analysis of computer systems; all imple- 
mented in Mathematica, with code on accom- 
panying disk. Suitable for self-study. RM 


Computer Systems, P. ORACLE Performance 
Tuning. Peter Corrigan, Mark Gurry. O’Reilly 
& Assoc, 1993, xxxv + 603 pp, $34.95 (P). 
[ISBN 1-56592-048-1] 


Computer Graphics, P. Quick Reference to 
Computer Graphics Terms. Roger T. Stevens. 
Academic Pr, 1993, vi + 237 pp, $29.95 (P). 
‘ [ISBN 0-12-668310-7] Adictionary of graph- 
ics terms aimed at readers who, like the author, 
read graphics literature and find new terms and 
acronyms around every corner. A good idea. JO 


Artificial Intelligence, T*(15—18: 1, 2), L. Ar- 
tificial Intelligence: Structures and Strategies 
for Complex Problem Solving, Second Edition. 
George F. Luger, William S. Stubblefield. Ben- 
jamin/Cummings, 1993, xxiv + 740 pp. [ISBN 
0-8053-4780-1] Attempts to unify artificial 
intelligence theory and problem solving tech- 
niques. Stresses representational formalisms, 
search techniques; artificial intelligence within 
scientific tradition. New to this edition: ma- 
chine learning, object-oriented design with the 
Common Lisp Object System. RJA 


Computer Science, T(16—18: 1, 2), P, L. Ac- 
tion Semantics. Peter D. Mosses. Tracts in 
Theor. Comp. Sci., V. 26. Cambridge Univ 
Pr, 1992, xx + 372 pp, $49.95. [ISBN 0-521- 
40347-2] Action semantics concepts and for- 
malism, action notation, and examples of ac- 
tion semantic descriptions. Relates action se- 
mantics to other frameworks, briefly sketches 
its development. Appendices cover formal de- 
tails, serve as complete, self-contained refer- 
ence manual. RWJ 


Computer Science, T(17-18: 1). Algorith- 
mic Algebra. Bhubaneswar Mishra. Texts & 
Mono. in Comp. Sci. Springer-Verlag, 1993, 
xii + 416 pp, $39.95. [ISBN 0-387-94090-1] 
Effective algorithms for computing various al- 
gebraic structures—Grobner bases, character- 
istic sets, resultants, and semi-algebraic sets. 
For anyone who wants to understand the al- 
gorithmic underpinnings of computer algebra 
systems. DS 


Applications (Engineering), C. HP 48S8X En- 
gineering Mathematics Library. John F. Hol- 
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land. Academic Pr, 1992, xxiv + 632 pp, 
$139.95. [ISBN 0-12-352380-X] Book docu- 
ments an immense library (over 700 programs, 
supplied on an HP memory card, included) of 
HP 48SX-based mathematical functions, pro- 
grams, and applications, especially useful in 
engineering mathematics. Includes utilities for 
complex analysis, special functions, linear al- 
gebra, statistics, signal processing, differential 
equations, engineering applications, plotting of 
all sorts, etc. A reference manual both to 
software and (via appendices) to considerable 
amount of underlying mathematics. PZ 
Applications (Fluid Dynamics), T(15—16: 1). 
A Mathematical Introduction to Fluid Mechan- 
ics, Third Edition. Alexandre J. Chorin, Jer- 
rold E. Marsden. Texts in Appl. Math., V. 4. 
Springer-Verlag, 1993, xi+ 169 pp, $34. [ISBN 
0-387-97918-2] Informal mathematical intro- 
duction to theory of fluid mechanics. Concepts 
are carefully motivated and developed, some- 
times informally rigorous. Many examples, fig- 
ures. (Second Edition, TR, February 1991.) DS 
Applications (Physical Science), T(17: 2), P. 
Spectral Analysis for Physical Applications: 
Multitaper and Conventional Univariate Tech- 
niques. Donald B. Percival, Andrew T. Walden. 
Cambridge Univ Pr, 1993, xxvii + 583 pp, 
$89.95; $39.95 (P). [ISBN 0-521-35532-X; 0- 
521-43541-2] Introduction to univariate spec- 
tral analysis. Covers traditional topics and re- 
cent advances. Numerous, useful exercises. 
Clear exposition; many examples motivate and 
implement the theory. MK 


Applications (Physical Science), P. Statis- 
tics and Physical Oceanography. National Re- 
search Council. National Academy Pr, 1993, x 
+ 62 pp, (P). Monograph on research problems 
in Statistics and applied probability that arise in 
oceanography. Topics include statistical issues 
in multiple-scale variability of oceanographic 
fields; Lagrangian and Euler data and mod- 
els; interpolation, non-linear smoothing, filter- 
ing, and prediction; model comparison; non- 
Gaussian random fields. MK 


Reviewers 


RJA: Richard J. Allen, St. Olaf; KB: Karla Ballman, 
Macalester; LC: Laura Chihara, St. Olaf; AD: Amy Davi- 
dow, Macalester; HD: Hung Dinh, Macalester; SG: Steven 
Galovich, Carleton; TH: Tom Halverson, Macalester; BH: 
Bruce Hanson, St. Olaf; RWJ: Roger W. Johnson, Car- 
leton; MK: Michael Kahn, St. Olaf; LCL: Loren C. Lar- 
son, St. Olaf; RM: Richard Molnar, Macalester; JO: Jeff 
Ondich, Carleton; AO: Arnold Ostebee, St. Olaf; MLR: 
Margaret L. Reese, St. Olaf; MPR: Matthew P. Richey, 
St. Olaf; AWR: A. Wayne Roberts, Macalester; KS: Karen 
Saxe, Macalester; JS: John Schue, Macalester; DS: Dan 
Schwalbe, Macalester; KES: Kay E. Smith, St. Olaf; MW: 
Martha Wallace, St. Olaf; PZ: Paul Zorn, St. Olaf. 
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No matter how you 
express it, it still means 
DERIVE. Is half price. 


CIE UVEZ 


The DERIVE A Mathema- 
tical Assistant program lets 
you express yourself symbol- 
ically, numerically and 
graphically, from algebra 
through calculus, with vec- 
tors and matrices too—all 
displayed with accepted 
math notation, or 2D and 3D 
plotting. DERIVE is also easy 
to use and easy to read, 
thanks to a friendly, menu- 
driven interface and split or 


DERIVE is a 
registered trademark of 
Soft Warehouse, Inc. 


overlay windows that can 
display both algebra and 
plotting simultaneously. 
Better still, DER/VE has been 
praised for the accuracy and 
exactness of its solutions. 
But, best of all the suggested 
retail price is‘now only $125. 
Which means DERIVE is now 
half price, no matter how you 
express It. 


Soft Warchouse: 


massssssne HON OLULU*HAWATII 


System requirements 


DERIVE: MS-DOS 2.1 or 
later, 512K RAM, and one 33" 
disk drive. Suggested retail 
price now $125 (Half offi). 


DERIVE ROM card: 

Hewlett Packard 95LX & 
100LX Palmtop, or other PC 
compatible ROM card 
computer. Suggested retail 
price now $125! 


DERIVE XM 

(eXtended Memory): 

386 or 486 PC compatible 
with at least 2MB of extena- 
ed memory. Suggested list 
price now $250! 


Soft Warehouse, Inc.* 3660 Waialae Ave. 
Ste. 304 « Honolulu, Hl, USA 96816-3236 
Ph: (808) 734-5801 + Fax: (808) 735-1105 


NEW IN THE SPECTRUM SERIES 
The Words of Mathematics 


An Etymological Dictionary of Mathematical Terms Used in English 


Steven Schwartzman 
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The Words of Mathematics explains the origins 
of over 1500 mathematical terms used in English. 
While other dictionaries of mathematics define 
technical terms, this book concentrates on where 
those terms came from and what their literal 
meanings are. The words included here range 
from simple to advanced. Elementary school 
teachers may be surprised to learn that inch and 
ounce are really the same word, that eleven means 
literally “one left over,” that a thousand is a 
“swollen hundred,” and that the original meaning 
of times was “divide.” High school teachers will 
find out that asymptote means “not falling to- 
gether,” that an area used to be a threshing floor, 
and that focus is a Latin word meaning “fire- 
place.” College teachers who want to explain 
heteroscedastic, amphicheiral, and eigenvalue to 
their students will find the origins of those words 
in this book. 

This dictionary is easy to use. Although some of 
the entries are highly technical, the book explains 
them in plain English. The introduction gives an 
overview of how the ancient language known as 
Indo-European developed into Latin, Greek, 
French, and English, the languages from which 
most of our mathematical vocabulary has been 
derived. Another section discusses the many ways 


Name 
Address 
City 


State Zip Code 
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in which mathematicians have borrowed and cre- 
ated their specialized vocabulary over the centu- 
ries. A glossary explains historical and linguistic 
terms used throughout the book. 

As in any dictionary, the entries themselves are 
arranged alphabetically. The words are drawn 
from arithmetic, algebra, geometry, trigonom- 
etry, calculus, number theory, topology, statis- 
tics, graph theory, logic, recreational mathemat- 
ics, and other areas. Over 200 illustrations accom- 
pany the dictionary entries, especially some of the 
less familiar ones. Connections to related 
nonmathematical English words are often pointed 
out. Key numbers attached to many entries lead 
interested readers to an appendix which groups 
mathematical terms that come from a common 
source. 

This dictionary is an indispensable reference for 
every library that serves teachers and students of 
mathematics. It is anatural source of information 
for courses in the history of mathematics and for 
mathematics courses intended for liberal arts stu- 
dents. 
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A Century of Mathematics 
Through the Eyes of the Monthly 


John Ewing, Editor 


The Monthly is 100 years old! 


We are celebrating this special birthday with the publication of 
A Century of Mathematics: Through the Eyes of the Monthly. 


This is the story of American mathematics during 
the past century. It contains articles and excerpts 
from acentury of the Monthly, giving the reader 
an opportunity to skim all one hundred volumes 
without actually opening them. It samples math- 
ematics year by year and decade by decade. 
Along the way, readers can glimpse the math- 
ematical community at the turn of the century, 
and the divisions between the mathematical com- 
munities of teachers and researchers. They read 
about the struggle to prevent colleges from elimi- 
nating mathematics requirements in the 1920s, 
the controversy about Einstein and relativity, the 
debates about formalism in logic, the immigra- 
tion of mathematicians from Europe, and the 
frantic effort to organize as the war began. At the 
end of the war, they hear about new divisions 
between pure and applied mathematics, heroic 
efforts to deal with large numbers of new students 
in the universities, and the rise of federal funding 
for mathematics. In more recent times, they see 
the advent of computers and computer science, 
the problems faced by women and minorities, and 
some of the triumphs of modern research. 
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This is a book about mathematics—about teach- 
ing and research, applied and pure, elite universi- 
ties and community colleges. Browsing through 
its pages, readers see what has changed (the kinds 
of mathematics in fashion, for example) and what 
has stayed the same (our concern about teaching 
and our complaints about the deplorable state of 
our students). 


This is a book about history—a sampling of 
history, meant to be savored rather than studied. 
For one hundred years, the Monthly has con- 
tained articles by some of the greatest mathema- 
ticians in the world, as well as articles by students 
and faculty from small midwestern colleges where 
those great names were barely known. This book 
gives a glimpse of both worlds. It tells a story 
rather than the details of history. 


This is the story of a century of mathematics in 
America. 
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Knot Theory 


Charles Livingston 


I learned more about knots after an hour with the book 
than I thought I could, and Iam glad that it is here on 
my desk so that I may spend more time with it and, I 
hope, learn more. —Paul Halmos 


Knot Theory, a lively exposition of the math- 
ematics of knotting, will appeal to a diverse 
audience from the undergraduate seeking ex- 
perience outside the traditional range of studies 
to mathematicians wanting a leisurely introduc- 
tion to the subject. Graduate students begin- 
ning a program of advanced study will find a 
worthwhile overview, and the reader will need 
no training beyond linear algebra to under- 
stand the mathematics presented. 

Over the last century, knot theory has pro- 
gressed from a study based largely on intuition 
and conjecture into one of the most active areas 
of mathematical investigation. Knot Theory 
illustrates the foundations of knotting as well as 
the remarkable breadth of techniques it em- 
ploys—combinatorial, algebraic, and geomet- 
ric. 

The interplay between topology and algebra, 
known as algebraic topology, arises early in the 
book, when tools from linear algebra and from 
basic group theory are introduced to study the 
properties of knots, including the unknotting 
number, the braid index, and the bridge num- 
ber.. Livingston guides you through a general 
survey of the topic showing how to use the 
techniques of linear algebra to address some 
sophisticated problems, including one of math- 
ematics’ most beautiful topics, symmetry. The 
book closes with a discussion of high-dimen- 


Name 
Address 
City 


State Zip Code 


of the recent advances in the subject—the 
Conway, Jones and Kauffman polynomials. A 
supplementary section presents the fundamen- 
tal group, which is a centerpiece of algebraic 
topology. 

An extensive collection of exercises is included. 
Some problems focus on details of the subject 
matter; others introduce new examples and 
topics illustrating both the wide range of knot 
theory and the present borders of our under- 
standing of knotting. All are designed to offer 
the reader the experience and pleasure of work- 
ing in this fascinating area. 
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The Search for E.T. Bell 


also known as John Taine 


Constance Reid 


No one today writes about mathematics and mathemati- 
cians with more grace, knowledge, skill, and clarity, and 
no one is going to produce a more delightful, informative, 
accurate account of Enc Temple Bell and his work, and 
that of his alter-ego, the prolific pioneer of science fiction, 
John Taine. This is a fine book. § —Martin Gardner 


Eric Temple Bell has been one of my heroes for 60 years...1 
congratulate Constance Reid on a remarkable achieve- 
ment. I hope tt 1s greeted with the success it deserves, and 
revives interest in an extraordinary and multi-talented 
man. —A. C. Clarke 


Eric Temple Bell (1883-1960) was a distinguished 
mathematician and a best selling popularizer of 
mathematics. His Men of Mathematics, still in print 
after almost sixty years, inspired scores of young 
readers to become mathematicians. Under the 
name “John Taine,” he also published science 
fiction novels (among them The Time Stream, Before 
the Dawn, and The Crystal Horde) that served to 
broaden the subject matter of that genre during its 
early years. 

In The Search for E.T. Bell, Gonstance Reid has 
given us a compelling account of this complicated, 
difficult man who never divulged to anyone, not 
even to his wife and son, the story of his early life 
and family background. Her book is thus more of 
a mystery than a traditional biography. It begins 
with the discovery of an unexpected inscription in 
an English churchyard and a series of cryptic 
notations in a boy’s schoolbook. Then comes an 
inadvertent revelation, by Bell himself, in a re- 
spected mathematical journal... You will have to 
read the book to learn the rest. 
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Originally agreeing to write only a profile of 
Bell, Mrs. Reid soon found herself involved in a 
full-length biography. The discoveries she made in 
the course of her five years of research will neces- 
sitate a fresh evaluation of his extensive math- 
ematical work and his science fiction novels as well 
as the revision of almost every statement currently 
in print about his family background and early life. 
Mrs. Reid is already well known as the author of 
acclaimed biographies of David Hilbert, Richard 
Courant, and Jerzy Neyman. 


Includes a collection of over 75 photographs. 
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Game Theory and Strategy 


Philip D. Straffin, Jr. 


This valuable addition to the New Mathematical 
Library series pays careful attention to applica- 
tions of game theory in a wide variety of disci- 
plines. The applications are treated in consider- 
able depth. The book assumes only high school 
algebra, yet gently builds to mathematical think- 
ing of some sophistication. Game Theory and 
Strategy might serve as an introduction to both 
axiomatic mathematical thinking and the funda- 
mental process of mathematical modelling. It 
gives insight into both the nature of pure math- 
ematics, and the way in which mathematics can be 
applied to real problems. 

Since its creation by John von Neumann and 
Oskar Morgenstern in 1944, game theory has 
contributed new insights to business, politics, eco- 
nomics, social psychology, philosophy, and evolu- 
tionary biology. In this book, the fundamental 
ideas of game theory share the stage with applica- 
tions of the theory. How might strategic business 
decisions depend on information about a rival 
company, and how much would such information 
be worth? When is it advantageous to vote for a 
candidate who is not your favorite? What are the 
optimal strategies for teams in the football draft, 
and what paradoxes can result from following 
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those strategies? What is a fair way to share the 
costs of a development project? What can we learn 
about the problem of “free will” by imagining 
playing a game with an omnipotent Being? How 
might natural selection lead to altruistic behavior 
in animal species? Game theory gives insight into 
all of these questions. 

The book includes many exercises, with answers, 
which allow the reader to try out calculations, and 
explore alternative formulations of game-theo- 
retic ideas. 
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Essays in Humanistic 
Mathematics 


Alvin White, Editor 


A dazzling array of essayists reveal humanistic 
mathematics in this volume, and in so doing go 
beyond the facts, formulas, and algorithms that most 
students associate with mathematics to a presenta- 
tion of mathematics as an intellectual discipline with 
a human perspective and a significant history. Hu- 
manistic mathematics challenges dogmatic teach- 
ing styles that expect students to parrot the lecturer. 
It demands creativity from both the teacher and 
student. 

Teaching mathematics humanistically seeks to 
place the student more centrally in the position of 
inquirer than is generally the case, while at the same 
time acknowledging the emotional climate of the 
activity of learning mathematics. This type of 
teaching encourages students to learn from each 
other and to better understand mathematics as _ so- 
cially constructed knowledge, rather than as an 
arbitrary discipline. 

Teaching humanistic mathematics brings the fo- 
cus less upon the nature of the teaching and learning 
environment and more upon the need to reconstruct 
the curriculum and the discipline of mathematics 
itself. This reconstruction relates mathematical 
discoveries to personal courage, discovery to verifi- 
cation, mathematics to science, truth to utility, and 
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mathematics to the culture in which it is embedded. 

The humanistic mathematics movement, which 
began as the personal vision of a few, has now 
become a major part of mathematical culture. What 
was viewed with skepticism is now accepted and 
expected. Humanistic mathematics is not a new 
discovery. Itis arecent rediscovery of ideas that go 
back to Plato. It has provided a vocabulary for 
previously unarticulated concepts and approaches. 

The essays in this volume illustrate and help to 
define humanistic mathematics. The variety and 
scope indicate the richness and fruitfulness of the 
concept. Although each essay is independent, a 
sense of unity emerges. A glimpse at the table of 
contents will give you an idea of the excitement and 
range of the ideas presented. 
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Memorabilia Mathematica 


The Philomath’s 
Quotation Book 


Robert Edouard Moritz 


When Robert Edouard Moritz compiled his book 
of quotations, Memorabilia Mathematica, which 
appeared in 1914, he stated that his primary 
objective was to seek out the exact statement ofand 
exact references for famous passages about math- 
ematics. He searched the writing not only of 
mathematicians, but poets, philosophers, histori- 
ans, statesmen, and scientists as well. His sources 
ranged from the works of Plato to the writings of 
Hilbert and Whitehead. His second objective was 
to produce a volume that would be a source of 
pleasure, encouragement, and inspiration to both 
mathematicians and non-mathematicians alike. 

This work was a ten-year labor of love, and it is 
a tribute to his discerning eye that this selection of 
passages should remain one of the most stimulat- 
ing works about mathematics ever published. It 
was the first collection of its kind in English and it 
conveys a sense of the full range of mathematics, its 
enormous accomplishments, and the living per- 
sonalities of great mathematicians. 

The more than eleven-hundred fully anno- 
tated selections in this book, gathered from the 
works of three hundred authors, cover a vast range 
of subjects pertaining to mathematics. Grouped 
in twenty-one chapters, they deal with such topics 
as the definitions and objects of mathematics; the 
teaching of mathematics; mathematics as a lan- 
guage or aS a fine art; the relationship of math- 
ematics to philosophy, to logic, or to science; the 
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nature of mathematics, and the value of mathemat- 
ics. Other sections contain passages referring to 
specific subjects in the field such as arithmetic, 
algebra, geometry, calculus, and modern math- 
ematics. Of special interest is the extensive amount 
of material on great mathematicians which pro- 
vides irreplaceable glimpses into the lives and 
personalities of mathematical giants. 

To mathematicians the book will be a great 
source of pleasure, inspiration, and encourage- 
ment. To teachers of mathematics and writers 
about mathematics, it will remain of inestimable 
value as a source of quotations and ideas. To the 
layperson, it will be a revelation. It should dispel 
forever the narrow notion that mathematics is a 
cut-and-dried affair, isolated from other compart- 
ments of life and thought. 
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Out of the Mouths 
of Mathematicians 


A Quotation Book for Philomaths 


Rosemary Schmalz 


Published as a companion volume to Robert 
Edouard Moritz’s Memorabilia Mathematica, Rose- 
mary Schmalz’s Out of the Mouths of Mathemati- 
cians picks up where Moritz left off. Her work will 
give you a sense of the “story” of twentieth century 
mathematics. You will encounter the mathemati- 
cians, their collaborations and disputes, the move- 
ment from abstraction to application, the emer- 
gence of new areas of research, the impact of 
computers on mathematics, the challenges in math- 
ematics education, and more. 

Out of the Mouths of Mathematicians: A Quo- 
tation Book for Philomaths is a compilation of 727 
quotations from 292 contributors, almost all of 
whom are twentieth century mathematicians. Tak- 
ing the advice of Abel to learn mathematics by 
reading the masters, the author offers the reader a 
unique perspective on this century’s mathematics 
through the words of the mathematicians who are 
its creators. Stories about these mathematicians, 
their exhortations to their students, their descrip- 
tions of their efforts, successes, and failures, all 
make this century's mathematics come alive. The 
book also offers readers the opportunity to broaden 
their ideas about what mathematics is by offering 
many definitions of mathematics, making com- 
parisons of mathematics to computing and to the 
fine arts, and showing similarities between many 
aspects of mathematics and religion. The com- 
plete reference for each quotation allows the reader 
to continue exploration into a favorite area. A 
large topic index makes the book quite user- 
friendly. Some of the subject categories include: 
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The Development of Mathematics, Exhortations 
to Aspiring Mathematicians, Pure and Applied 
Mathematics, About Mathematicians (by name), 
Anecdotes and Miscellaneous Humor, Particular 
Disciplines in Mathematics, Moments of Math- 
ematical Insight, Mathematics and the Arts,... and 
much more. 

This book will give pleasure to any philomath. 
It can be used to facilitate a literature search or to 
give quick access to an appropriate quote for writ- 
ers and speakers. It will be particularly useful to 
teachers of mathematics at all levels, to encourage, 
motivate, and amuse their students. Along with R. 
E. Moritz’s earlier book of this type, Memorabilia 
Mathematica: The Philomath’s Quotation Book, it 
offers the story of mathematics from its primary 


source, the mathematicians themselves. 
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NEW IN THE SPECTRUM SERIES 


Complex Numbers and Geometry 


Liang-shin Hahn 


The purpose of this book is to demonstrate that 
complex numbers and geometry can be blended 
together beautifully, resulting in easy proofs and 
natural generalizations of many theorems in plane 
geometry—such as the Napoleon theorem, the 
Ptolemy—Euler theorem, the Simson theorem, 
and the Morley theorem. 


Beginning with a construction of complex num- 
bers, readers are taken on a 140-page guided tour 
that includes something for everyone, even those 
with advanced degrees in mathematics. Yet, the 
entire book is accessible to a talented high-school 
Student. 


The book is self-contained—no background in 
complex numbers is assumed—and can be cov- 
ered at a leisurely pace in a one-semester course. 
Many of the chapters can be read independently. 
Over 100 exercises are included. The book would 
be suitable as a text for a geometry course, or for 
a problem solving seminar, or as enrichment for 
the student who wants to know more. 
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fjans-shin Wpbe. 


Table of Contents | 
1. Complex Numbers 


Introduction to Imaginary Numbers; Definition 
of Complex Numbers; Quadratic Equations; Sig- 
nificance of the Complex Numbers; Order Rela- 
tion in the Complex Field; The Triangle Inequal- 
ity; The Complex Plane; Polar Representation of 
Complex Numbers; The nth Roots of 1; The 
Exponential Functions; Exercises 


2. Applications to Geometry 


Triangles; The Ptolemy—Euler Theorem; The 
Clifford Theorems; The Nine-Point Circle; The 
Simson Line; Generalizations of the Simson 
Theorem; The Cantor Theorems; The Feuerbach 
Theorem; The Morley Theorem; Exercises 


3. Mobius Transformations 


Stereographic Projection; M6ébius Transforma- 
tions; Cross Ratios; The Symmetry Principle; A 
Pair of Circles; Pencils of Circles; Fixed Points 
and the Classification of Mébius Transforma- 
tions; Inversions; The Poincaré Model of a Non- 
Euclidean Geometry; Exercises 
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NEW IN THE SPECTRUM SERIES 


Cryptology 


Albrecht Beutelspacher 


FR BRX XOFHUVWCQG WKLV? If you 
can’t decipher this coded message, you must 
read this book! 


How can messages be transmitted secretly? 
How can one guarantee that the message ar- 
rives safely in the right hands exactly as it was 
transmitted? Cryptology—the art and science 
of “secret writing” —provides ideal methods to 
solve these problems of data security. 


Technology advances have stimulated interest 
in the study of cryptology. Of course, comput- 
ers can break cryptosystems much more effi- 
ciently than humans can. Computers allow 
complex and sophisticated mathematical tech- 
niques which achieve a degree of security un- 
dreamt of by previous generations. Today the 
applications of cryptology range from the en- 
cryption of television programs sent via satel- 
lite, to user authentication of computers, to new 
forms of electronic payment systems using 
smart cards. 


The first half of the book studies and analyzes 
classical cryptosystems. Here we find Caesar’s 
cipher, the Spartan scytale, the Vigenére ci- 
pher, and more. The theory of cipher systems 
is presented, including a description of the best 
possible cipher, the one-time pad. An intro- 
duction to linear shift registers, which serve as 
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building blocks for most presently used ci- 
phers, is also given. 


The second half of the book looks at the excit- 
ing new directions of public-key cryptology, 
which since its invention in 1976, has revolu- 
tionized data security. The author also looks at 
the famous RSA-algorithm, algorithms based 
on “discrete logarithms,” the so-called zero- 
knowledge algorithms, and the smart cards that 
bring cryptographic services to the man-on- 
the-street. 


Although the mathematics covered is nontrivial, 
the book is fun to read, and the author presents 
the material clearly and simply. Many exer- 
cises and references accompany each chapter. 
The book will appeal to a wide audience includ- 
ing teachers, students, and the interested lay- 
man. 


Cryptology was originally published in Ger- 
man by Vieweg. This edition has been exten- 
sively revised. 
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Proofs Without Words 
Exercises in Visual Thinking 


Roger B. Nelsen 


Just what are “proofs without words?” First of all, 
most mathematicians would agree that they cer- 
tainly are not “proofs” in the formal sense. Indeed, 
the question does not have asimple answer. Proofs 
without words are generally pictures or diagrams 
that help the reader see why a particular math- 
ematical statement may be true, and howone could 
begin to go about proving it. While in some proofs 
without words an equation or two may appear to 
help guide that process, the emphasis is clearly on 
providing visual clues to stimulate mathematical 
thought. Proofs without words bear witness to the 
observation that often in the English language to 
see means to understand, as in “to see the point ofan 
argument.” 


Proofs without words have a long history. In this 
collection you will find modern renditions of proofs 
from ancient China, classical Greece, twelfth-cen- 
tury India—even one based on a published proof 
by a former President of the United States! How- 
ever, most of the proofs are more recent creations, 
and many are taken from the pages of MAA jour- 
nals. 


The proofs in this collection are arranged by topic 
into six chapters: Geometry and Algebra; Trigo- 


nometry, Calculus and Analytic Geometry; Inequali- 
ties; Integer Sums; Sequences and Series; and 
Miscellaneous. Teachers will find that many of the 
proofs in this collection are well suited for class- 
room discussion and for helping students to think 
visually in mathematics. 


The readers of this collection will find enjoyment 
in discovering or rediscovering some elegant visual 
demonstrations of certain mathematical ideas that 
teachers will want to share with their students. 
Readers may even be encouraged to create new 
“proofs without words.” 
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EXCURSIONS IN CALCULUS: 


an Interplay of the Continuous 


and the Discrete 


Robert M. Young 


An excellent source of projects for well motivated 
students. This list of 463 references is a valuable 
aid for those who wish to dig deeper. _ CHOICE 


The presentation is clear and the topics very 
interesting. ..fully accessible to students for whom 
the book is intended. The book will be influential in 
awakening students' awareness for good classi- 
cal mathematics. —Paulo Ribenboim 


Printed with eight full-color plates. 


The purpose of this book is to explore, within the 
context of elementary calculus, the rich and el- 
egant interplay that exists between the two main 
currents of mathematics, the continuous and the 
discrete. Such fundamental notions in discrete 
mathematics as induction, recursion, combinato- 
rics, number theory, discrete probability, and the 
algorithmic point of view as a unifying principle are 
continually explored as they interact with tradi- 
tional calculus. The interaction enriches both. 


The book is addressed primarily to well-trained 
calculus students and their teachers, but it can 
serve as a supplement in a traditional calculus 
course for anyone who wants to see more. 
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Robert M. Young 


| Dokcane Mathomaucal Expositions Number 13 
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FAP MATHEMATICAL ASSOCIATION OB AMERICA. 


CONTENTS: 


¢ Infinite Ascent, Infinite Descent: The Principle 
of Mathematical Induction 

¢ Patterns, Polynomials, and Primes: Three 
Applications of the Binomial Theorem 

¢ Fibonacci Numbers: Function and Form 

¢ On the Average 

¢ Approximation: from Pi to the Prime Number 
Theorem 

¢ Infinite Sums: A Potpourri 


The problems, taken for the most part from 
probability, analysis and number theory, are an 
integral part of the text. Many point the reader 
toward further excursions. There are over 400 
problems presented in this book. 
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Comments 


High school mathematics is in a very unhealthy condition at the present time. When a 
doctor seeks to cure a patient, it is important for him to determine the cause of the malady. 

Fifty years ago, mathematics was king of the required subjects; mathematics was 
supposed to develop, strengthen, and discipline the mind. However, early in the present 
century, educators and administrators began to question seriously the value of the 
mathematics then being taught. Teachers with little knowledge of mathematics or rigor were 
constantly insisting on mathematical rigor. Mortality in mathematics was very high; and as 
the great influx of pupils of lesser ability began, this became a serious problem. Finally the 
entire doctrine of formal discipline was swept away ... 

Unfortunately, leaders in mathematics did not first put their house in order before 
essaying its defense. As a result we have had a forty-year war between the educator and the 
mathematician, with mathematics constantly on the losing end. Like most wars, this has been 
unnecessary. More than fifty years ago the great leaders, Klein, Tannery, Perry, Borel, Eliot, 
Judd, Young, and Moore, pointed the way to reform that would have given us mathematics 
so easy to defend that no defense would have been necessary. 

We can easily double the value of high school mathematics, make it far more 
interesting to our pupils and much easier to teach by heeding the advice given by E. H. 
Moore in 1903 and by dozens of other great leaders since his time. . . . 

To be brief at the expense of a certain amount of oversimplification, the reforms 
needed are as follows: 

1. We must stop trying to teach mathematics by the stupid "water-tight 
compartment" method . .. W. D. Reeve says, "Our traditional 'water-tight compartment’ 
method of teaching algebra, then geometry, then intermediate algebra, leads to a great deal 
of unnecessary repetition of subject matter that results in the loss of time and energy." 

2. We must remove the large amount of relatively useless material that is found in 
our best modern texts, and introduce more interesting and more practical mathematics to take 
the place of the dead wood eliminated. More than twenty-five years ago a committee of the 
Association reported: ". . . The present standards of drill work, largely on non-essentials, 
were set up about fifty years ago. A considerable number of teachers . . . believe that the 
amount of time spent by pupils on abstract work in difficult problems in division, factoring, 
fractions, simultaneous equations, radicals, et cetera, is excessive; that such work leads to 
nothing important in the science." 

3. We must have ability grouping in mathematics. There must be at least two and 
at best three types of levels of mathematics in high schools. ... We have always had ability 
grouping in athletics. If the results in the teaching of mathematics were as immediately 
determined as in athletics, we should have ability grouping at once. Ability grouping is the 
essence of true democracy, that is, each pupil entitled to be developed to his limit without 
being held back by less capable pupils. . . . 

If the problem can be solved, it will be worth many millions of dollars yearly to the 
country and will be one of the most important projects in the history of American education. 
To secure the needed and long overdue reforms in mathematics we must have a committee 
of experts with adequate financing, so that they may devote their full time to this most 
important project. The final report should be printed and given wide distribution. Part of this 
report could be a set of specimen or model textbooks... 

— C. N. Shuster, MONTHLY 55(1948), 472-475 
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Juggling Drops and Descents 


Joe Buhler, David Eisenbud, Ron Graham, and Colin Wright 


As circus and vaudeville performers have known for a long time, juggling is fun. In 
the last twenty years or so this has led to a surge in the number of amateur 
jugglers. It has been observed that scientists, and especially mathematicians and 
computer scientists, are disproportionately represented in the juggling community. 
It is difficult to explain this connection in any straightforward way, but music has 
long been known to be popular among scientists; juggling, like music, combines 
abstract patterns and mind-body coordination in a pleasing way. In any event, the 
association between mathematics and juggling may not be as recent as it appears, 
since it is believed that the tenth century mathematician Abu Sahl started out 
juggling glass bottles in the Bagdad marketplace ((3], p. 79). 

In the last fifteen years there has been a corresponding increase in the 
application of mathematical and scientific ideas to juggling ((1], [2], [7], [11], [13], 
[18]), including, for instance, the construction of a juggling robot ((8]). In this 
article we discuss some of the mathematics that arises out of a recent juggling idea, 
sometimes called “site swaps.” It is curious that these idealized juggling patterns 
lead to interesting mathematical questions, but are also of considerable interest to 
“practical” jugglers. The basic idea seems to have been discovered independently 
by a number of people; we know of three groups or individuals that developed the 
idea around 1985: Bengt Magnusson and Bruce Tiemann ({12], [11]), Paul Klimek 
in Santa Cruz, and one of us (C. W.) in conjunction with other members of the 
Cambridge University Juggling Association. A precursor of the idea can be found 
in [14]. 

Although our interests here are almost entirely mathematical, the reader 
interested in actual juggling or its history might start by looking at [21] and [19]; a 
leisurely discussion of site swaps, aimed at jugglers, can be found in [12]. 

In the first section we describe the basic ideas, and in the second section we 
prove the basic combinatorial result that counts the number of site swaps with a 
given period and a given number of balls. This theorem has a non-obvious 
generalization to arbitrary posets ([6]). Special cases of that result can be inter- 
preted in terms of an interesting generalization of site swaps; we find it delightful 
that a question arising from juggling leads to new mathematics which in turn may 
say something about patterns that jugglers might want to consider. 


1. JUGGLING. As mathematicians are in the habit of doing, we start by throwing 
away irrelevant devail. In a juggling pattern we will ignore how many people or 
hands are involved, ignore which objects are being used, and ignore the specific 
paths of the thrown objects. We will assume that there are a fixed number of 
objects (occasionally referred to as “balls” for convenience) and will pay attention 
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only to the times at which they are thrown, and will assume that the throw times 
are periodic. Although much of the interest of actual juggling comes from peculiar 
throws (behind the back, off the head, etc.), peculiar objects (clubs, calculus texts, 
chain saws, etc), and peculiar rhythms, we will find that the above idealization is 
sufficiently interesting. 

Suppose that you are juggling b balls in a constant rhythm. Since the throws 
occur at discrete equally-spaced moments of time, and since in our idealized world 
you have been juggling forever and will continue to do so, we identify the times ¢ 
of throws with integers t € Z := {..., —2, —1,0,1,2,...}. 

Since it would be silly to hold onto a ball forever, we assume that each ball is 
thrown repeatedly. We also assume that only one ball is thrown at any given time. 
With these conventions, a juggling pattern with b balls is described, for our 
purposes, by b doubly-infinite disjoint sequences of integers. 

The three ball cascade is perhaps the most basic juggling trick. Balls are thrown 
alternately from each hand and travel in a figure eight pattern. The balls are 
thrown at times 


ball 1: ... ~6, —3,0, 3, 6,... 
ball 2: ... 75, —2,1,4,7,... 
ball 3: ... ~4, —1,2,5,8,. 


This pattern has a natural generalization for any odd number of balls; if you tried 
to do this pattern with an even number of balls (in a symmetrical way) then two 
balls would collide at the middle of the figure eight. 

Another basic pattern, sometimes called the fountain or waterfall, is most 
commonly done with an even number of balls and consists of two disjoint circles of 
balls. 


Figure 1. A cascade. 
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Figure 2. A fountain. 


The four ball waterfall gives rise to the four sequences {4n +a: n € Z} of 
throw times, for a = 0, 1, 2,3. 

The last truly basic juggling pattern is called the shower. In a shower the balls 
travel in a circular pattern, with one hand throwing a high throw and the other 
throwing a low horizontal throw. The shower can be done with any number of 
balls; most people find that the three ball shower is significantly harder than the 
three ball cascade. The three ball shower corresponds to the sequences 


ball 1: ... ~6, —5,0,1,6,7... 
ball 2: ... 74, -3,2,3,8,9... 
ball 3: ... 72, -1,4,5,10,11... 


We should mention that although non-jugglers are often sure that they have 
seen virtuoso performers juggle 17 or 20 balls, the historical record for a sustained 
ball cascade seems to be nine. Enrico Rastelli, sometimes considered the greatest 
juggler of all time, was able to make twenty catches in a 10-ball waterfall pattern. 
Rings are somewhat easier to juggle in large numbers, and various people have 
been able to juggle 11 and 12 rings. 

Now we return to our idealized form of juggling. Given lists of throw times of b 
balls define a function f: Z — Z by 


f(x) = y if the ball thrown at time x is next thrown at time y 
x if there is no throw at time x. 


This function is a permutation of the integers. Moreover, it satisfies f(t) >t for 
allt © Z. This permutation partitions the integers into orbits which (ignoring the 
orbits of size one) are just the lists of throw times. 

The function f(t) = t + 3 corresponds to the 3-ball cascade, which could be 
graphically represented as in FiGurE 4. 

Similarly, the function f(t) = t + 4 represents the ordinary 4-ball waterfall. The 
three ball shower corresponds to a function that has a slightly more complicated 
description. The juggler is usually most interested in the duration f(t) —t¢ 
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Figure 3. A shower. 


Figure 4. t > ¢ + 3. 


between throws which corresponds, roughly, to the height to which balls must be 
thrown. 


Definition. A juggling pattern is a permutation f: Z — Z such that f(t) = ¢ for all 
t € Z. The height function of a juggling pattern is df(t) := f(t) — t. 


The three ball cascade has a height function df(t) = 3 that is constant. The 
three ball shower has a periodic height function whose values are ...5,1,5,1,.... 
The juggling pattern in FIGURE 5 corresponds to the function 


f(x) = x + 4if x = 0,1 mod3 
x+1ifx = 2mod3 


4 


Figure 5. 441 
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which is easily verified to be a permutation. The height function takes on the 
values 4, 4,1 cyclically. This trick is therefore called the “441” among those who 
use the standard site swap notation. It is not terribly difficult to learn but is not a 
familiar pattern to most jugglers. 


Remarks: 


1. We refer to df(t) as the height function even though it more properly is a 
rough measure of the elapsed time of the throw. From basic physics the 
height is proportional to the square of the elapsed time. The elapsed time is 
actually less than df(t) since the ball must be held before being thrown. For 
a more physical discussion of actual elapsed times and throw heights 
see [11]. 

2. Although there is nothing in our idealized setup that requires two hands, or 
even “hands”’ at all, we note that in the usual two-handed juggling patterns, 
a throw with odd throw height df(t) goes from one hand to the other, and a 
throw with even throw height goes from one hand to itself. 

3. If f(t) = ¢t, so that df(t) = 0, then no throw takes place at time ¢. In actual 
practice this corresponds to an empty hand. 

4. Nothing in our model really requires that the rhythm of the juggling pattern 
be constant. We only need a periodic pattern of throw times. We retain the 
constant rhythm terminology in order to be consistent with jugglers’ stan- 
dard model of site swaps. 

5. The catch times are irrelevant in our model. Thus a throw at time ¢ of 
height df(t) is next thrown at time ¢t + df(t) = f(t), but in practice it is 
caught well before that time in order to allow time to prepare for the next 
throw. A common time to catch such a throw is approximately at time 
f(t) — 1.5 but great variation is possible. A theorem due to Claude Shannon 
((13], [7]) gives a relationship between flight times, hold times, and empty 
times in a symmetrical pattern. 


Now let f be a juggling pattern. This permutation of Z partitions the integers 
into orbits; since f(t) => t, the orbits are either infinite or else singletons. 


Definition. The number of balls of a juggling pattern f, denoted B(f), is the 
number of infinite orbits determined by the permutation f. 


Our first result says that if the throw height is bounded, which is surely true for 
even the most energetic of jugglers, then the number of balls is finite and can be 
calculated as the average value of the throw heights over large intervals. 


Theorem 1. /f f is a bijection and df(t) = f(t) — t is non-negative and bounded then 
the liynit 
lim Le U(X) 
|| > 00 [I 
exists and is equal to B( f), where the limit is over all integer intervals 
I ={a,a+1,...,b} CZ. 
Proof: Suppose that df(t) < B for all ¢t. If J is an interval such that |/| > B then 


any infinite orbit intersects J. The sum of df(t) over the points in J lying in a given 
infinite orbit is bounded above by |/| and below by |/| — 2B. If J is large enough 
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<B <B 


Figure 6. One orbit. 


then the sum of df(t) for t © J can be made arbitrarily close to the number of 
infinite orbits of f; the singleton orbits don’t contribute since df(t) = 0 for those 


orbits. Thus in the limit the average of df over an interval {a, a+ 1,...,b} of 
consecutive integers must become arbitrarily close to the number of infinite orbits 
of the permutation. a 
Remarks: 


1. The limit is clearly a uniform limit in the sense that for all positive ¢ there is 
an m such that if J is an interval of integers with more than m elements 
then the average of df over J is within « of B(f). 

2. As an example illustrating the theorem we note if f is the 441 pattern 
described earlier, then the height function df(t) is periodic of period 3. The 
long term average of df(t) over any interval approaches the average over 
the period, i.e., (4 + 4+ 1)/3 = 3, which confirms what we already knew: 
the 441 pattern is a 3-ball trick. 

3. The hypothesis of bounded throw heights is necessary. Indeed, if T(O) = 0 
and, for nonzero t, T(t) is the highest power of 2 that divides ¢ then the 
pattern f(t) = ¢ + 2 - T(t) has unbounded throw height and infinite B(f), 
as in FIGURE 7. More vividly: you can juggle infinitely many balls if you can 
throw arbitrarily high. 


Figure 7. Infinitely many balls. 


2. PERIODIC JUGGLING. From now on we want to juggle periodically. A 
juggling pattern is perceived to be periodic by an audience when its height function 
is periodic in the mathematical sense. 


Definition. A  period-n juggling pattern is a byection f: Z— Z such that 
df(t + n) = df(t) for all t € Z. 
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If df is of period n then it might also have a period m for some divisor m of n. 
If n is the smallest period of df then any other period is a multiple of 7; in this 
case we will say that f is a pattern of exact period n. 

A period-n juggling pattern can be described by giving the finite sequence of 
non-negative integers df(t) for t = 0,1,...,n — 1. Thus the pattern 51414 de- 
notes a period-5 pattern; by Theorem 1 it is a 3-ball pattern since the “period 
average” of the height function df(t) is 3. 

Which finite sequences correspond to juggling patterns? Certainly a necessary 
condition is that the average must be an integer. However this isn’t sufficient. The 
sequence 354 has average 3 but does not correspond to a juggling pattern—if you 
try to draw an arrow diagram for a map f as above you'll find that no such map 
exists. This is also easy to see directly, for if df(1) = 5 and df(2) = 4 then 


fl) = 1+ df(1) = 6 =2 + df(2) = f(2) 


and such a map isn’t a bijection. 


Remarks for Jugglers Only 


1. The above description is geared towards the standard model: two hands 
throwing alternately, in constant rhythm. In fact there could be any number 
of hands and it is not necessary to assume that the rhythm is constant. 

2. The practical meaning of the throw heights 0,1, and 2 in the standard 
model requires a little thought. A throw height of 0 corresponds to an 
empty hand. A throw height of 1 corresponds to a rapid shower pass from 
one hand to another that is thrown again immediately. A throw height of 2 
would ordinarily indicate a very low throw from a hand to itself that is 
thrown again by that hand immediately. This is actually rather unnatural in 
practice; the conventional interpretation ({11], [12]) is that a throw height of 
2 is a held ball. 

3. The paradigm for categorizing juggling patterns here is very interesting in 
practice, although many of the patterns require considerable proficiency. 
Several jugglers who have spent time in working on site swaps describe the 
same gain in flexibility and conceptual power that mathematicians seem to 
report from the use of well-chosen abstractions. The simplest non-obvious 
site-swap seems to be 441; it is similar to, but not the same as, the common 
3-ball pattern of throwing balls up on the side while passing a ball back and 
forth underneath in a shower pass from hand to hand. The 3-ball 45141 
pattern is also amusing, and the 4-ball 5551 pattern looks very much like the 
5-ball cascade. The range of feasible and interesting tricks seems to be 
unlimited; we mention the following sample: 234, 504, 345, 5551, 40141, 561, 
633, 55514, 7562, 7531, 566151, 561, 663, 771, 744, 753, 426, 459, 9559, 831. 

4. A number of programs are available that simulate site swaps on a computer 

» screen, sometimes with quite impressive graphics. These programs take a 
finite sequence of non-negative integers as input and dynamically represent 
the pattern. The Internet news group rec.juggling is a source of information 
on site swaps and various juggling animation software. 


In order to find out which finite sequences represent juggling patterns we start 
by noting that a period-n pattern induces a permutation on the first 1 integers. 


Lemma. [f f is a period-n juggling pattern then 
s=tmodn = f(s) =f(t) modn. 
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Proof: If df(t) is periodic of period n then the function f(t) = ¢ + df(t) is of 
period n modulo n. a 


The Lemma implies that a juggling pattern f induces a well-defined injective, 
and hence bijective, mapping on the integers modulo n. Let [n] denote the set 
{0,1,...,2 — 1} and let S, denote the symmetric group of all permutations 
(bijections) of the set [n]. Then for every period-n juggling pattern f there is a 
well-defined permutation 7, © S, that is defined by the condition 


f(t) = 7,(t) mod n, O<t<n. 


Theorem 2. A sequence a,a, --: a,_, of non-negative integers satisfies df(t) = a, 
for some period-n juggling pattern f if and only if a, + t modn is a permutation 


of [n]. 


Proof: Suppose that f is a juggling pattern and a, = df(t). Then f(t) = 7,(t) 
mod n so there is an integer-valued function g(t) such f(t) = 7,(t) + n - g(t) and 


df(t) = f(t) —t=a/(t) —t+n- a(t) 
and 
a,+t=df(t)+t=7,(t) modn 


and a, + ¢ is a permutation as claimed. 
Conversely, suppose that 


is such that a, + ¢ is a permutation of [n]. If we define a, for all integers t by 
extending the sequence periodically and then define f(t) =a, +t then f is the 
desired juggling pattern. To see that f is injective note that if f(t) = f(u) then 
t = u modn since f(t) is injective modulo n. Then a, = a,. From f(t) =a,+t = 
flu) = a, + u it follows that ¢ = u and f is injective as claimed. To show that f is 
surjective, suppose that u © Z. Since t + a, mod n is a permutation of [n] we can 
find a ¢ such that f(t) = t + a, = u mod n. By adding a suitable multiple of 1 we 
can find a ¢’ such that f(t’) =u. This finishes the proof of the fact that any 
sequence satisfying the stated condition comes from a juggling pattern. a 


To see if 345 corresponds to a juggling pattern we add ¢ to the ¢t-th term and 
reduce modulo 3. The result is 021, which is a permutation, so 345 is indeed a 
juggling pattern (in fact a somewhat difficult one that is quite amusing). On the 
other hand, the sequence 354 leads, by the same process, to 000 which certainly 
isn’t a permutation of [3]. 

Let N(b, n) denote the number of period-n juggling patterns f with B(f) = b. 
Our-goal is to calculate this number. From the juggler’s point of view it might be 
more useful to count the number of patterns of exact period n and to count cyclic 
shifts of a pattern as being essentially the same as the original pattern. Later we 
will see that this more natural question can be answered easily once we know 
N(b, n). 

The basic idea in the determination of N(b, n) is to fix a permutation 7 € S, 
and count the number of patterns f such that m, =. From the proof of the 
previous theorem we have the formula 


f(t) =am,(t) +n: g(t) =w(t) +n- g(t), O<t<n. 
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Thus we must count the number of functions g: [n] — Z such that if f is defined 
by the above formula then df(t) => 0 and B(f) = b. 

The number of balls of such a pattern f is equal to the average of df(t) over 
[n]. Thus 


n-1 


1 1 n=! 
Bf) = E df(t) == E (w(t) t+ 8(1)). 


Since 7(t) is a permutation of [n] we see that this reduces to 


n—-1 


B(f) = Li a(t). 


t=0 


Thus a function g determines a pattern with B(f) = b if the sum of its values is 
equal to b. 
The condition that df(t) = 0 is a little bit more intricate. Since 


df(t) =m(t) —t+n- g(t) 


we see that g(t) must be non-negative and also must be strictly positive whenever 
mt) <t. 


Definition. An integer t € [n] is a drop for the permutation 7 € S, if w(t) < ¢; 
moreover, we define 


if ¢ is a drop for 7 
if ¢ is not a drop for 7r. 


d,(t) = f 


Write G(t) = g(t) — d_(t) so that 
f(t) =w(t) t+n-d (t) +n: G(t). 


Let k be the number of drops of 7. Then B(f) = b if and only if the sum of the 
values of G is equal to b — k. 

We can summarize this discussion so far as follows. The number N(b, 7) of 
period-n juggling patterns with b balls is equal to the sum over all permutations 
a €S, of the number of non-negative functions G(t) on [n] whose value-sum is 
b — k, where k is the number of drops of 7r. 

A standard combination idea can be used to count the number of sequences of 
non-negative integers with a given sum. 


Lemma. The number of non-negative n-tuples with sum x is 
; x+n-1 
n-1 


Proof: A standard “stars and bars” argument (in Feller’s terminology, e.g., p. 38 of 
[9]) gives the answer. The number of such sequences is equal to the number of 
ways of arranging n — 1 bars and x stars in a row if we interpret the size of each 
contiguous sequence of stars as a component of the n-tuple and the bars as 
separating components. The number of such sequences of bars and stars is the 
same as the number of ways to chose n — 1 locations for the bars out of a total of 
x +n — 1 locations, which is just the stated binomial coefficient. a 
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Let 6,(k) be the number of permutations in S, that have k drops. By 
combining the earlier remark with the lemma we arrive at 


N(b,n) = ¥8,(4) (” *° a '). 


Later it will be convenient to consider the number of period-n juggling patterns 
with fewer than b balls. If this number is denoted N- (b, n) then, using a familiar 
binomial coefficient identity, we find that 

b-1 b-l1n-1 

EMany= 5 Lacey ("tek 1) 

a=0 


a=0 k=0 


N.(b,n) 


n—-1 


¥ 5K) E (" +a ~k- : _ E3,ck)(” +b—k— 1), 


In order to simplify this further we recall the idea of a descent of a permutation 
and show that even though drops and descents aren’t the same thing, the number 
of permutations with k drops is the same as the number with k descents. 


Definition. If 7 € S, then i €[n] is a descent of aw if wi) < wi + 1) where 
0 <i <n — 1. The number of elements of S, with k descents is denoted 


n 
k 
and is called an Eulerian number. 
We will write permutations as a list of nm integers in which the i-th element is 
a(i), €.g., 


aw(O)aw(1)...a(n — 1). 


A descent in 7 is just a point in this finite sequence in which the next term is 
lower than the current term. 


Example. The permutation 10432 in S,; has three descents and two drops. 


If 7 is a permutation then it can also be written in cycle form in the usual way. 
In order to specify this form uniquely we write each cycle with its largest element 
first and arrange the cycles so that the leading elements of the cycles are in 
increasing order, where we include the singleton cycles. 


Definition. If 7 < S, let 7 be the permutation that results from writing 7 in cycle 
form, as above, and then erasing parentheses. 


Example. The permutation 7 € S, corresponding to the sequence 16037425 has a 
cycle decomposition (0162475) that has the canonical form (36201765). There- 
fore 7 is 36201754. 


Note that the map taking 7 to 7 is bijective since 7 can be uniquely 
reconstructed from 7 by inserting left parentheses before every left-to-right 
maximum and then inserting matching right parentheses. This permutation of S, is 
certainly bizarre at first glance, but it plays a surprisingly crucial role in various 
situations (see [5] or [15]). 
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Lemma. The number of permutations of |n] with k descents is equal to the number 


with k drops, i.eé., 
_ fn 


Proof: A descent of 7 must lie inside a cycle of zr since our conventions guarantee 
that the last element in a cycle is followed by a larger integer. By the meaning of 
the cycle decomposition 7 (namely, that elements within cycles are mapped to the 
next element in the cycle) we see that a descent of 7 corresponds to a drop of 7. 
Conversely, a drop in 7 must occur within a cycle (i.e., not in passing from the last 
element of a cycle to the first) and corresponds to a descent in 77. Thus the number 
of permutations with k descents is equal to the number 6,(k) with k drops. a 


Example, again. The permutation az = 16037425 has drops at t = 2, 5, 6, 7, and 
the permutation 7 = 36201754 has descents at i = 1, 2, 5, 6. 

The Eulerian numbers 6,(k) = ¢;) play a role in a variety of combinatorial 
questions beyond drops and descents ((10], [15], [16]), although no notation seems 
to be standard yet. We recall some of their basic properties. If a permutation 
aw = (0)7(1)... a(n — 1) has k descents then its reversal mw’ = r(n — 1)r(m — 
2)... 7(0) has n — k — 1 descents. Thus 


(e) = (n-k-a} m 


By relating permutations of [n] to permutations of [n — 1] in the usual way, a more 
involved combinatorial argument shows that 


("ha (K+ a(" 1 }+(n=k)(E oI). (2) 


Using this recursion, it is easy to tabulate Eulerian numbers; for n <5 they are 
given in Table 1. 


TABLE 1. Eulerian Numbers 


k 
n 


OM & WN 


Finally, the Eulerian numbers arise as coefficients of the linear relations 
connecting the polynomials x” with the polynomials (- + ‘|, 


Worpitzky’s Identity. 
n—-1 
n n x+k 
vm TEM a) 
k=0 k n 
This identity can be readily proved by induction using equation (2). It appar- 


ently first appeared in [20] (see also [10] and [16]); in [15] it appears as a special 
case of a much more general statement. 
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Theorem 3. The number of period-n juggling patterns with fewer than b balls is b”, 
1.é., 
N.(b,n) = b”. 


Proof: Our previous formula for N < (b, n) was 


N.(b.n) = © 3,09(" +b k—1) E (2\(n b= k-1) 


Replace k by n — k — 1 and use (2) to get 


n—-1 
n 
ne(ovm) = EE) 54) 
k=0 

The claim is then an immediate consequence of Worpitzky’s identity. a 


The simplicity of the final result is surprising. The astute reader will note that 
we could have avoided introducing the concept of descents by proving equations 
(1) and (2) directly for the counting function 6,(k) for drops. It is a pleasant 
exercise to provide a direct combinatorial argument. We took the slightly longer 
route above because it is amusing and useful in proving the more general result in 
[6]. 

By the theorem there are (b + 1)” — b” patterns of period n with exactly b 
balls if cyclic shifts are counted as distinct. Let M(n, b) be the number of patterns 
of exact period n with exactly b balls, where cyclic shifts are not counted as 
distinct. Thus M(n, b) is probably the number that is of most interest to a juggler. 

If d is a divisor of n then each pattern of exact period d will occur d times as 
pattern of length n. Thus 


(b+1)° —b" = ¥ dM(d,b). 
d|n 


By Mobius inversion we obtain the following corollary to the previous theorem. 


Corollary. 


1 
M(n,b) = — Yiu(n/d)((b + 1) ~ 6’). 
d|n 


For instance, there are 12 genuinely distinct patterns with period three with 
three balls. The reader may find it instructive to list all of them explicitly. 

Several people have reproved Theorem 3 from other points of view. Richard 
Stanley sent us a proof using results in [15]. Jeremy Kahn sent us a bijective proof 
using a different labeling function for juggling patterns. Walter Stromquist sent us 
an interesting bijective proof that uses a very curious relabeling of site swap 
patterns. Adam Chalcraft ((4]) sent us a proof using ideas similar to those of 
Stromquist. It is striking that the result seems to be of considerable interest to a 
number of people. 

Several of these proofs are shorter than ours, and some are much closer to 
being more transparent “biyective” proofs. However, the proof given here, in 
addition to using some interesting combinatorics, is the special case of the proof of 
the more general result in [6]. The basic motivation of that result is to replace the 
set [nm] with an arbitrary poset. For some posets we can give a natural interpreta- 
tion of that more general result in terms of juggling patterns in which more than 
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one ball can be thrown at once, but we still haven’t been able to give a juggling 
interpretation for arbitrary posets. After hearing of our results from Richard 
Stanley, E. Steingrimsson reproved ([17]) the general results about posets using 
results from his thesis. Among many other things, he generalizes the notions of 
descents and drops (actually, in his terminology, a mirror notion he calls “excee- 
dances’’) to certain wreath products of symmetric groups. 


NOTE ADDED IN PROOF: In their recent preprint, “Juggling and applications to q-analogues,”’ 
Richard Ehrenborg and Margaret Readdy give a q-analogue of our main result. In addition they 
generalize the ideas to multiplex patterns (in which a hand can catch and throw more than one ball at 
once) and give applications to qg-Stirling numbers and the Poincare series of an affine Weyl group. 


REFERENCES 


ped 


H. Austin, A Computational View of the Skill of Juggling, M.1.T. Artificial Intelligence Laboratory, 
1974. 
2. P. J. Beek, Juggling Dynamics, Free University Press, Amsterdam, 1989. 
3. J. L. Berggren, Episodes in the Mathematics of Medieval Islam, Springer Verlag, 1986. 
4. A. Chalcraft, manuscript in preparation. 
5. D. Bayer and P. Diaconis, Trailing the Dovetail Shuffle to its Lair, Technical Report, Department 
of Statistics, Stanford, 1989. 
6. J. Buhler and R. Graham, A note on the drop polynomial of a poset, in preparation. 
7. J. Buhler and R. Graham, Fountains, showers, and cascades, The Sciences, Jan.—Feb. 1984, 44-51. 
8. M. Donner, A real-time juggling robot, IBM research preprint. 
9. W. Feller, Introduction to Probability Theory and its Applications, 3rd edition, John Wiley & Sons, 
1968. 
10. R. Graham, D. Knuth, and O. Patashnik, Concrete Mathematics, Addison Wesley Co., 1989. 
11. B. Magnusson and B. Tiemann, The physics of juggling, Physics Teacher, 27 (1989) 584-589. 
12. B. Magnusson and B. Tiemann, A notation for juggling tricks, Juggler’s World, summer 1991, 
31-33. 
13. C. Shannon, Scientific Aspects of Juggling, unpublished manuscript. 
14. C. Simpson, Juggling on paper, Juggler’s World, winter 1986, 31. 
15. R. Stanley, Enumerative Combinatorics, Wadsworth & Brooks /Cole, 1986. 
16. D. Stanton, Constructive Combinatorics, Springer-Verlag, 1986. 
17. E. Steingrimsson, Permutation statistics of indexed and poset permutations, Ph.D. dissertation, 
MIT, 1991. 
18. B. Summers, Juggling as performing mathematics, Co-Evolution Quarterly, summer 1980. 
19. M. Truzzi, On keeping things up in the air, Natural History, 1979, 44—-5S. 
20. J. Worpitzky, Studien iiber die Bernoullischen und Eulerschen Zahlen, Journal fiir die reine und 
angewandte Mathematik , 94 (1881) 103-232. 
21. K.-H. Ziethen and A. Allen, Juggling, The Art and its Artists, Werner Rausch & Werner Liift Inc., 


1985. 
Buhler: Eisenbud: 
Reed College Brandeis University 
Portland, OR 97202 Waltham, MA 02254 
jpb@reed.edu eisenbud(@math. brandeis.edu 
Graham: Wright: 
AT&T Bell Laboratories The University of Liverpool 
Murray Hill, NJ 07974 Liverpool, L69 3BX, England 


rlg(@research.att.com 


1994] JUGGLING DROPS AND DESCENTS 519 


Teaching Integration by Substitution 


David Gale 


The current boom in calculus reform programs has been going on now for more 
than six years at a cumulative cost of well over five million dollars. A major theme 
of the program has been the need to get away from so-called cook book calculus, 
to teach concepts rather than techniques, understanding rather than rote memo- 
rization. This is of course a worthy goal but just how one goes about achieving it is 
not at all obvious. What I want to do in the paragraphs which follow is to look at 
this question in the context of a special case by treating a particular calculus 
question which has been bothering me on and off for more than 50 years. 

To begin at the beginning, I took freshman calculus in 1940 and in the 
intervening years I have taught virtually that same course to others dozens of 
times. I found the course back then mildly disappointing in that it seemed to 
consist for the most part of working hundreds of drill problems, but I managed to 
master enough of the tricks to struggle through with a grade of B. There was one 
thing in the course, though, that really bothered me, and that was the matter of 


integration by substitution. I didn’t have any trouble with integrands like x¥1 — x? 
and In(x)/x. I understood that in general one hopes that the integrand will have 
the form f(g(x))g’(x), in which case if one happens to know an antiderivative of 
f, call it F, then the antiderivative one is looking for is F(g(x)). It was also clear, 
as the book explained, that in fact this was nothing but the chain rule in reverse. 
But then one day we had to integrate ¥(1 — .x*) without the extra x on the 
outside, so the book, ‘“‘Calculus’’ by Arnold Dresden, said, well, make the substitu- 
tion x = sin(t). Then dx = cos(t)dt etc. etc. Again, I could go through the 
mechanics without difficulty but this time it seemed to me the operations were not 
justified by anything we had done up to that time. In the earlier cases we made a 
substitution of the form u = g(x) but now we were supposed instead to write 
x = g(t), which didn’t seem to me to be the same thing. 

Because of the current interest in calculus instruction I decided now, after more 
than half a century it would be interesting to see how textbooks these days are 
handling the substitution problem that had thrown me off as a student. To this end 
I looked in 10 fairly traditional texts, some of which are among the current best 
sellers. The authors are Anton (to be abbreviated A), Edwards and Penny (EP), 
Ellis and Gulick (EG), Lang (L), Larson and Hostetler (LH), Marsden and 
Weinstein (MW), Stein (SN), Stewart (ST), Swokowski (SW), Thomas and Finney 
(TF). Also I looked at a draft of an as yet unpublished text by the Harvard 
Calculus Consortium (H). Here are some of my findings. 

All of these books use and prove the “direct” substitution theorem devoting an 
entire section to the subject. None of them proves what I will call the inverse 
substitution theorem although all of them except L and H use it fairly intensively, 
devoting a whole section to Trigonometric Substitution. Only ST explicitly recog- 
nizes the fact that inverse substitution is not the same as direct substitution, and I 
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will return to this in a moment. I then looked at how the direct substitution 
theorem was treated, and all of the books did indeed show how it followed from 
the chain rule. In six of the books, however, A, FP, L, MW, ST and TF, I found the 
following formula. If u = g(x) then 


[f(e())a'(x) a = | f(u) du. (1) 


Of course the equation is false. The expression {f(x) dx stands for antiderivative, 
as in a table of integrals, and the variable, be it x, t, u or anything else is a 
dummy. Clearly the antiderivatives on the left and right above are not equal. What 
the books mean, no doubt, is that if you substitute g(x) for u after taking the 
antiderivative on the right you get the antiderivative on the left. I expect some 
readers will say I am being pedantic or that there is no need to be so rigorous at 
the freshman level, but I think this kind of lapse is symptomatic of a rather strange 
set of standards and perhaps it sheds light on why none of the books proves the 
inverse substitution theorem. It is because none of them formulates it. Once one 
does, the proof becomes more or less mechanical and one sees at once that it is 
not a mere application of the chain rule but involves other things, as we will see in 
a moment. The book ST is interesting. It uses equation (1) above to describe 
substitution while inverse substitution [the book’s italics] is described by x = g(t) 
and 


[ f(x) & = [f(¢(t))9'(t) at (2) 


where g is required to have an inverse. Notice that (1) and (2) are the same (false) 
equation. Only the letters for the (dummy) variables are different (of course these 
equations become correct when one considers definite integrals and puts in the 
appropriate limits). 

Let us now turn to the mathematics which to my surprise turned out to be 
rather interesting. There are (at least) two different proofs of the inverse substitu- 
tion theorem. The first is direct (brute force), and slightly messy. The second, 
suggested to me by Ole Hald, is short and elegant and makes use of some 
“theory”. For the sake of cleanness I will use the circle notation for composition of 
functions. People who prefer the more traditional f(g(x)) notation should have no 
trouble translating the argument below, at the cost of having to carry around 
masses of parentheses and a lot of, in my view, superfluous x’s. (The subject of 
notation, which is rather interesting, will be considered in an appendix.) Differenti- 
ation will be denoted by a prime, ’, and for typographical clarity I will use a dot, -, 
for multiplication of functions. Finally I will use a block, ™@, for “proof” as well as 
for “Q. E. D.”, a notational reform I have been trying to persuade the mathemati- 
cal community to adopt for the past 25 years with no success whatsoever. First, 
then, we have the chain rule, 


, (foes) =(f'eg) 8’. (Ch) 
Now, both the substitution rules described in the preceding paragraphs deal 
with the situation where we have three functions h, f and g and 


h=(feg)-s. (*) 
In the direct substitution case we know an antiderivative for f and want to find one 
for h. The answer is given by, 
Direct Substitution Theorem. Jf F’ = f then 
(Fog) =h. (3) 
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@ From the chain rule, (Fo gyY = (F’og):g’=(fog):g’=h. 


In the inverse substitution case we know an antiderivative of 4 and want to find 
one for f. 


Inverse Substitution Theorem. Jf H' = h and g has an inverse then 
(Hog ') =f. (4) 


Let me suggest at this point that the reader take two minutes to work out the 
direct proof of (4) in order to see what is involved. 

As one might expect, one needs to use not only the chain rule but also the 
formula for the derivative of the inverse of a function which in our notation is 


(g7') = 1/(g'eg""). (Inv) 
m@ From (Ch) and (*«) we have 
(Hog!) = (H’eg™!)-(g7') = (((feg) -8’)og*): (g7'). 
We must now simplify the term on the right hand side and we need several facts. 


The first is the general but not so familiar identity that for any functions a, b 
and c 


(a-b)oc=(acc):(bec). (5) 
The right hand side then becomes 
((fog)og™"):(s'eg™")(g71), 
which by the associative law for composition and the fact that gog~! is the 
identity function, simplifies to 
f-(g'eg")- (ge), 


but the product on the right is 1 by application of (Inv). @ 


There is an interesting observation to be made about this proof. It makes no 
direct assumptions about the function f, not even continuity, but ends up proving 
that f is integrable (meaning antidifferentiable). The second proof, which will now 
be given, requires the antidifferentiability of f as a hypothesis. Thus, it assumes 
that 4 has an antiderivative H, and f has an antiderivative F, but it does not need 
to make use of (Inv). 

m Since F’ =f we have by (Ch), (FogyY =(feg):g' =h, so H and Fog 
have the same derivative and hence (theory) they differ by a constant, thus, 


Fog=H+c. (6) 
Now composing both sides of (6) on the right with g~! gives 
Fogog 1!=F=Hog !+cecog !=Hog'+e, 
so(Hog '!=F' =f. & 
In addition to the standard textbooks above, I looked at some nonstandard 
ones, those of De Leeuw, Spivak and Moise (there is also the book by Menger but 
it requires essentially learning a new language so I did not consider it). Believe it 


or not, the book of Moise actually states and proves the inverse substitution 
theorem very much as in the first proof above. I could find no other book that does 
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this. De Leeuw has an entire short section entitled Integration by Inverse Substitu- 
tion but he leaves the theorem itself as an exercise, which I think is a rather good 
idea and I will elaborate on this at the end. Spivak’s book is supposed to be a 
completely rigorous calculus. He presents one example in which he lets x = sin u 
and says “this really means that we are using the substitution u = arcsin x” —but 
he does not give the details. (Spivak and De Leeuw both point out, by the way, that 
equation (1) is not to be taken literally but is merely intended to be “‘suggestive’’). 

I will return to the inverse substitution theorem in a moment, but first I want to 
record some further findings of my quickie textbook survey. The overwhelming 
impression one gets is that of uniformity, both “vertical” and “horizontal”. Over 
the 50 plus years I could detect no general trend toward any kind of change among 
the standard texts. As for the current crop, if one looks at their tables of contents 
they seem almost identical up to permutation of chapters and sections. Thus 
“Trigonometric Substitution” is either Section 5 or 6 of the chapter “Techniques 
of Integration” and is invariably preceded by a section on Integrals Involving 
Trigonometric Functions. Integration by Substitution is around Section 5 of the 
chapter which introduces the integral. Indeed, the whole calculus catechism seems 
to have become quite rigidly codified. I wonder if this is a strictly national 
phenomenon. Are French, Russian, Chinese calculus texts very much like ours or 
are we perhaps teaching ‘“‘American Calculus”? Maybe one of the 35 NSF grantees 
has looked into this question. 

I want now finally to relate the foregoing discussion to the question of teaching 
concepts instead of techniques. First, I should say that I believe the inverse 
substitution theorem is important and should be part of any calculus course, not 
just because it enables one to find antiderivatives of square roots of quadratic 
polynomials but more importantly because it is an example of the change of 
variable technique which is fundamental in much of mathematics. My proposal is 
to try to induce understanding this material by giving the students a fairly intensive 
workout on composition of functions, a subject which they often seem to find 
difficult. The following sequence of problems followed by annotations in brackets 
is intended to illustrate how this might be done. 


Exercises 
1. Let f(x) =2x +1, g(x) =x — 2. 
(a) Calculate fog and gof. 
[Routine but illustrates noncommutativity of composition. ] 
(b) Find functions h and k such that foh = g and kof = g. 
[Not so routine but students will learn something by figuring these out.] 
2. (a) Show that the linear function f(x) = ax +b has a linear inverse, 


- , provided a # 0, that is, there is a linear function which we denote by f~! 
such that (f~'o fx) =x. Write out the function f~!. 


[This is intended to reinforce the idea that arithmetic calculations are often 
made with letters rather than numbers. | 


(b) From (a) calculate (f° f~'Xx). 
[In contrast to 1 (b), the inverse of a function does commute with the function.] 


(c) If g(x) = cx + d use (a) and (b) to find A and k such that foh =g 
and ko f = g. (Hint. Try h =f! 8.) 


[Again to illustrate the difference between right and left composition.] 
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3. Let f be any function. Use the chain rule to find the equation obtained by 
differentiating both sides of the equations 


(fof ')\(x) =x,  (fclef)(x) =x. 
[This will be needed in the last problem. ] 


4. Let f(x) = 2x +1 and g(x) =x? - 1. 

(a) Calculate fo g and gof. 
[Routine but a little more complicated.] 

(b) Find h and k such that foh = g and kof =g. 
[Corresponding to 1(b) at the next level.] 

(c) Is there any polynomial h such that goh = f? Explain. 
[One should demand a clearly expressed argument here. ] 


5. Let f(x) = 2x? 4+ 1, g(x) =x? — 2. 
(a) Find k such that ko f = g. 
[Somewhat more interesting computation.] 
(b) Show that there is no polynomial h such that fcoh = g. 


[Again the students should be required to write out the argument in a precise, 
grammatical manner. If there are TA sections it would be a good idea to ask 
students to write down their answers on the board.] 


5. Let f(x) =2x+1,) glx) =x-2, h(x) =S. 
Calculate folg:h), (fog):h, f-(eeh), Cfeagoh. 


[The answers are 10x — 19, 10x — 15, 6x + 3, 33. This is to illustrate the 
importance of where you put the parentheses. ] 


6. Let f, g and h be any functions. Answer True or False and give reasons, 
(a) (f-g)oh =(fceh)-(gch), 


[It is perhaps too much to ask for a precise written argument here but the 
instructor could give the details in class.] 


(b) (fog) :-h=(f-h)o(g-h). 
[A counter example is needed. Some of the better students will perhaps find 
one. Again a good problem for classroom discussion. | 


7. Prove the Inverse Substitution Theorem: If h = (f° g):g’ and g has an 
inverse and H is an antiderivative of h then H° g~! is an antiderivative of 
f. You will need to use Problem 3 and Problem 6 (a). 


Let me end by summarizing. I have tried (A), to make a case for including a 
proof of the inverse substitution theorem in calculus courses and, (B), to suggest 
how this might be done in a way that will lead students to a better understanding 
of what the result means. My approach is based on the proposition that what 
students learn in a course is not what they read in a text or hear in a lecture but 
what they do on their own. Routine questions and drill problems definitely have 
their place, but if we want the students to understand why these routines work 
then we must also design problems whose solution will require this understanding. 
The exposition presented here was an attempt in this direction. 
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Appendix. Some Thoughts on Notation. The reader may have noticed that in 
Exercises 2 and 3 above I shamelessly abandoned my principles and wrote 
(f° f-'Xx) despite the earlier somewhat derogatory remarks about using unneces- 
sary parentheses and x’s. Also, in the first proof of the Inverse Substitution 
Theorem I said that something followed because “go g~! is the identity function”. 
How many calculus texts have you seen that talk about the identity function? 
These difficulties go right to the root of our pedagogical problem, I believe. 
Namely, I trust everyone agrees that the most important concept in calculus, and 
probably in all of mathematics, is that of function. Further, it is surely a statistical 
fact that the most used letter in mathematical discourse is x, yet this letter is 
consistently used for two entirely different objects, first, variables as in, for 
example “‘Let x be the distance....” and second the identity function, as in “the 
derivative of x is 1”. Then there’s c, the “constant” which is both a number, as in 
the “constant of integration’, and a function as in “the derivative of a constant is 
0”. This last fact which in some of the books is listed as the first rule of 
differentiation is often written 


dc /dx = 0 


which is especially troubling. What has x got to do with c? 

I realize, of course, that these observations have been made many times, and I 
expect some will say that, yes, there is some ambiguity here but we all know from 
the context what is intended, and fussing about such matters 1s pedantry. I agree, 
we know what is intended from a lifetime of experience but, remember, our 
students are seeing these things for the first time. If our mission is, as we claim, to 
induce understanding then it’s hard to see how a deliberately ambiguous notation 
will help. Furthermore, the remedy is simple. We should give functions their own 
type face, just as we already do for vectors. Thus, we have the number c and the 
constant function c or the number 5 and the function 5. The first rule of 
differentiation is then either 


ec =0 or D(c) =0 


depending on one’s choice of notation,—but please, no x’s. 

I suppose one could denote the identity function by x but I personally have a 
strong preference for i. Then in Exercise 3 above the expression fof ! = i looks 
just fine, and the “second rule of differentiation”’ is 


D(i) = 1 
and then it’s on to 
D(i") = ni", 
and so on. 

But now there are some debatable points. Consistency requires that we write 
polynomials as, e.g. 3i* — 2i + 5, which does seem a bit radical. I do feel, though, 
that choice of notation may have a definite influence on how we teach and hence 
what the students learn. 

Perhaps the real reason we can’t switch to a sensible notation is that all the 
current books do things in the traditional way, that we were brought up that way 
ourselves by our predecessors and therefore this is how we are condemned to pass 
things on to our successors. But are we really locked in forever to this way of 
operating? Now that mathematicians have gotten these things sorted out explicitly, 


(a comparatively recent achievement), one would hope that we could at some point 
pass along this enlightenment to our students. 
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There are, I admit, some complications of a practical nature in all of this. In the 
age of word processing there is no problem in getting all the type faces we need to 
distinguish numbers, functions, vectors or anything else. The trouble is, though, 
that as long as lectures are still given the old fashioned way, with chalk at a 
blackboard, one does not have so many choices. This may explain traditional usage 
where we assign different parts of the Latin alphabet to different concepts. Thus, 
a,b,c are constants whereas x, y, z are variables. There are probably psychologi- 
cal reasons for this. The more rarely used, “distant” letters suggest uncertainty or 
“variability”. Next, f, g, 4 are functions, which makes sense since f is for function, 
but why are i and j almost invariably the letters of choice for subscripts? Is it 
perhaps because i is for index? Of course, m and n are also used for subscripts, 
but note that they are also the run away favorites for exponents, whereas i and j 
are rarely used for this purpose, and if someone were to write about Euclidean 
j-space we probably wouldn’t know what was intended. 

Returning finally to my mini textbook survey, I noted a curious phenomenon. 
Almost all of the books at some point would use two different notations for the 
same object in a single formula. Here for example is the way the chain rule 
appears in H. 


d/dx( f((g(«))) =f'(8(x)) + 8'(x) 


with d/dx on the left and primes on the right. In A one has a slightly different 
mixture. 


d/dx{ f(u)] = f'(u)du/de, 
and in EP 
D{ f(g(x)] =f'(g(*)) + 8’(*). 
In general the practice seems to be to use d/dx or D or D, on the left and primes 


on the right. An even more peculiar practice was to use two different notations for 
composition. In EG, L, MW and TF one reads 


(feg)(x) =f'(8(*))a(*) 
thus, circle on the left, parentheses on the right. No book that I have seen writes 
the chain rule consistently in circle notation as I have done above. 

Continuing to backtrack, I found that the practice of using primes on the right 
and other notations on the left occurs not only for the chain rule but for the other 
differentiation formulas as well, in A, EP, LH and H. For instance, in H the sum 
rule is given by 


d/dx( f(x) + 8(x)) =f'(x) + 8'(x) 


but the product rule is simply 


(fg) = fe’ + f'g. 
Why does this happen? One certainly would not do such things in writing a 
research paper. Is it because of some sort of pedagogical theory or is it just 
carelessness? Maybe this could be a “‘subject for future research”’. 
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Workable Gears, Archimedian Solids 
and Planar Bipartite Graphs 


Gary Gordon 


1. INTRODUCTION. My young daughters have a popular children’s game which 
involves setting up a system of interlocking gears. Each of the plastic gears can be 
fastened to a plastic board in a variety of places. The gears come in three sizes and 
can be adorned with colorful plastic animals and other decorations. This toy can 
hold their attention for a long time (where ‘long time’ is defined as longer than five 
minutes). We will call an arrangement of the gears (which consists of a finite 
number of gears, some pairs of which interlock and some which don’t) workable if 
whenever any one gear is turned, all the gears turn freely. Children usually 
discover that it is quite easy to construct an unworkable arrangement of the gears. 


Gear arrangement Corresponding graph 


Figure 1 


4 


These gears also have an easy and direct connection to graph theory. (We will 
assume familiarity with most of the basics of graph theory. See [1], [3], [5] or [7] for 
very readable introductions which emphasize different aspects of the subject.) 
Given an arrangement S of the gears, define a graph G(S) whose vertices 
correspond to the gears themselves. Define the edges of GCS) so that uv is an 
edge joining vertices u and v if and only if the gears corresponding to u and v 
actually interlock. (See Figure 1, where the gears are drawn as circles with 
interlocked gears overlapping.) Recall that the chromatic number of a graph is the 
smallest number of colors needed to color the vertices of G so that adjacent 
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vertices receive different colors. I occasionally give the following question as a 
homework problem in graph theory courses. 


Problem A. Let G (= G(S)) be a graph obtained from a workable arrangement of 
gears. What can you say about G? In particular, can you determine the chromatic 
number of G? 


Students realize quickly that one thing they can say is that G must be planar, 
that is, G can be drawn in the plane so that edges don’t cross. It’s not much harder 
to see that G will correspond to a workable arrangement of gears if and only if G 
has no odd cycles. (If the arrangement is workable, then it is clear that G has no 
odd cycles; the other direction can be established by induction on the number of 
gears, for example.) Since the students have already learned (or are about to learn) 
that a graph G has no odd cycles if and only if G is bipartite G.e., G has chromatic 
number 2), they can prove the following proposition (and solve Problem A). 


Proposition 1. Jf G corresponds to a workable arrangement of gears, then G is a 
planar bipartite graph. 


The rest of this paper is devoted to constructing various planar bipartite graphs 
which are ‘nice’ in some way. All of the graphs we will consider are assumed to be 
connected. In Section 2, we discuss regular planar bipartite graphs, i.e., planar 
bipartite graphs in which every vertex has the same degree. Section 3 is concerned 
with semi-regular planar bipartite graphs, 1.e., graphs in which all the red vertices 
have degree a and all the blue vertices have degree b for given positive integers a 
and b. These graphs will be closely related to the Archimedian solids. (In a 
bipartite graph, we will refer to the vertex partition induced by the 2-coloring by 
simply saying ‘the red vertices’ or ‘the blue vertices.’) 


2. REGULAR PLANAR BIPARTITE GRAPHS. If G is a (connected) planar 
bipartite graph which is regular, then it is easy to show that the number of red 
vertices and the number of blue vertices must be equal. 


Problem B. Determine all possible positive integers rand n such that there is a planar 
bipartite graph G which is regular of degree r and which has n red vertices (and n 
blue vertices). 


The main tool used in solving all of the problems considered here is Euler’s 
famous polyhedral formula, which he discovered around 1750. (See Theorem 8.1.1 
of [3], for example.) 


Theorem 2 (Euler’s Polyhedral Formula). Jf a plane drawing of a connected graph 
with v vertices and e edges has r regions, then v — e + r = 2. (This formula includes 
the unbounded region in the count for r.) 


The next corollary (which appears as Theorem 8.1.5 in [3]) is also a standard 
result. It follows from Euler’s formula and the fact that each cycle in a bipartite 


graph contains at least four edges. 


Corollary 3. If G is a planar bipartite graph with v => 3 vertices and e edges, then 
e<2v — 4. 
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To solve Problem B, we begin by applying Corollary 3 to G. Let G be a planar 
bipartite graph which is regular of degree r on 2n vertices (n of which are in each 
of the two color classes). Then the number of edges is given by e = rn, so we 
immediately get rn < 2(2n) — 4, so r < 4. 

It remains to investigate the cases r= 1, r= 2 and r=3 separately. The 
determination of all possible values for n for the cases r = 1 and r = 2 is left to 
the reader (see Proposition 4). We will now determine all possible values of n for 
the case r = 3. 

A graph which is regular of degree 3 is called cubic. The smallest cubic bipartite 
graph is K, 3, the ‘three houses and three utilities’ graph. (The complete bipartite 
graph, denoted K,, ,, is the bipartite graph having m red vertices and n blue 
vertices with every red vertex adjacent to every blue vertex.) K; 3 is not planar, so 
the smallest cubic planar bipartite graph must have n > 4. 

Finding examples of cubic planar bipartite graphs is not hard. We now construct 
two classes of cubic planar bipartite graphs, one for even n and one for odd n. If 
n > 4 is even, define a graph B, with vertices {1,2,...,2n} as follows: Form two 
n-cycles, one with vertices {1,...,} and the other with vertices {nm + 1,...,2n}, 
then add n edges by joining vertices k and n + k for each k, 1 <k <n. (B, is 
isomorphic to the graph associated with a three-dimensional cube.) For odd n > 5, 
define B, by modifying B,_, as follows: Delete edges (1,n + 1), G,n + 3) and 
(5,n +5) from B,_,, then add two new vertices x and y, joining vertex x to 
vertices 1, 3 and 5 and vertex y to vertices n + 1,n +3 and n + 5. See FiGure 2 
for planar drawings of B, and B.. The red vertices are labeled 1, the blue ones are 
labeled 2. 


/\_/ 
fp ) 
1 2 
Be B, 


Figure 2 


The above procedure fails when n = 5. In fact, there is no cubic planar 
bipartite graph with n = 5. If such a graph G existed, then it would have 10 
vertices, 15 edges and 7 regions. For a region R, let e(R) denote the number of 
edges which bound R. Then e(R) is even (since G is bipartite) and Se(R) = 30 
(since each edge is counted twice), where this sum extends over all 7 regions of G. 
Thus, exactly one of the regions is bounded by a 6-cycle and the six remaining 
regions are all bounded by 4-cycles. The reader can now show that G cannot be 
planar; we omit the remaining details. 

The next proposition summarizes the results of this section and completely 
solves Problem B. 
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Proposition 4. If G is a (connected) planar bipartite graph which is regular of degree 
r and has n vertices in each color class, then the only possibilities for n and r are the 
following: 

(a)r=1andn=1, 

(b) r = 2 andn > 2, 

(c)r =3 andn> 4, withn # 5. 
Furthermore, each of these possibilities can be realized. 


3. SEMI-REGULAR PLANAR BIPARTITE GRAPHS AND ARCHIMEDIAN 
SOLIDS. When we allow a little more flexibility in our planar bipartite graphs, we 
can get some more interesting examples. In this section we consider semi-regular 
planar bipartite graphs. 


Problem C. Determine all possible positive integers a and b (with a < b) such that 
there is a semi-regular planar bipartite graph G in which every red vertex has degree a 
and every blue vertex has degree b. 


Suppose G is a semi-regular planar bipartite graph with m red vertices, each of 
degree a and n blue vertices, each of degree b, where a <b. Then G has 
v =m +n vertices and e = am = bn edges. By Corollary 3, am < 2m + 2n — 4. 
Substituting m = bn/a and simplifying gives the inequality 

1 1 1 2 
tp 2ate (+) 
Thus, the only possible values for a and 5 are the following: 

1.a=1and b > 1, 

2.a =2 and b > 2, 

3.a =3 and b = 4or5S. 

We again leave to the reader the construction of the appropriate graphs for the 
first two cases above and turn our attention to the case where a = 3 and b = 4. In 
the smallest possible example, the inequality in (£) will be replaced by equality. 
Solving for the number of edges then gives e = 24, so m = 8 and n = 6 (Since 
3m = 4n = e). From Euler’s formula, G must have 12 regions. Furthermore, each 
region of (any planar drawing of) G is bounded by exactly 4 edges. We can now 
construct G by modifying the graph B, constructed above (see Figure 2). As in the 
modification which produced B,, we again add two new vertices and six new edges 
to B,, but this time we don’t delete any old edges (see FiGurE 3). We denote this 
graph by C, . 

We can create an infinite family of semi-regular planar bipartite graphs with 
a = 3 and b = 4 by modifying the graphs B,, (for any k = 1) in an analogous way. 
Add 2k vertices to B,, and join each of these new vertices to three ‘blue’ vertices 
of By,. We denote this family by C,, 3, for k = 1. See Ficure 3 for a drawing of 
C12 16> which is a modification of By. 

Since we are considering semi-regular graphs, it is not surprising to discover a 
connection between the graphs we have constructed and another family of semi- 
regular graphs, the Archimedian solids. An Archimedian, or semi-regular solid has 
the property that its faces are all regular polygons, but of two or more kinds, all its 
vertices are identical and it can be circumscribed by a regular tetrahedron so that 
four of its faces lie on the four faces of the tetrahedron. (Dropping the last 
requirement concerning the circumscribed tetrahedron allows infinite families of 
prisms and antiprisms and the pseudo rhombicuboctahedron. See Figures 2.10 and 
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1 


Figure 3 


2.13 of [4].) There are 13 Archimedian solids, 11 of which can be obtained from the 
five Platonic solids by truncation. Archimedes’ account of the 13 solids which bear 
his name is lost, presumably in the great fire of Alexandria. Heron writes that 
Archimedes ascribed the cuboctahedron to Plato. (See the historical notes in 
Chapter 2 of [2] and the Foreword of [6].) Kepler’s book The six-cornered snowflake, 
published in 1609, includes what is believed to be the first complete list of the 13 
Archimedian solids, giving them the names by which they are still known. 

What properties will the dual graph (C, ,)* have? (For an introduction to 
duality for planar graphs, see Chapter 15 of [7], for example.) Since every vertex in 
C.,, has degree 3 or 4, every region in (C, 3)* will be bounded by 3 or 4 edges. 
Further, since each region of C, , has 4 bounding edges, every vertex of (C, ,)* 
will have degree 4. Finally, since C,. is bipartite, every edge of (C,,)* will 
separate a triangle from a quadrilateral, so each triangle is surrounded by 3 
quadrilaterals and each quadrilateral is surrounded by 4 triangles. Drawing (C, .)* 
on a sphere, we find (C, ,)* is isomorphic to the cuboctahedron, an Archimedian 
solid (see FiGurE 4). The faces of the cuboctahedron are two colorable (so that the 
triangles can be colored red and the squares can be blue) and it can be constructed 


(C6 3)*-Cuboctahedron (D412 29)*-Icosidodecahedron 


Figure 4 
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by truncating the eight vertices of an ordinary cube. More on building models of 
this solid (and many others) can be found in [4] or [6], both of which have very nice 
pictures. 

We can also interpret (C,, 3,)* as the graphs of 3-dimensional solids for larger 
values of k. Again, each region will be bounded by either a triangle or a 
quadrilateral and each edge will separate a triangle from a quadrilateral. But two 
of the regions of C,, 3, are bounded by more than 4 edges, so two vertices of 
(C6x.34) will have degree larger than 4 (and so these solids will not be Archime- 
dian). For example, in (C, ,,)*, there will be two vertices of degree 8 and 20 
vertices of degree 4. The vertices of degree 8 (which correspond to the internal and 
external regions of C,, ,¢) will be incident to 4 triangles and 4 quadrilaterals each, 
while the remaining 20 vertices will have 2 triangles and 2 quadrilaterals surround- 
ing them. The reader is encouraged to draw a picture (or even build a model) of 
(Cy> 16)", which is ‘close to Archimedian.’ 

We can now apply the ideas developed above to the case where a = 3 and 
b = 5. Again, in the smallest possible example, the inequality in (+) will be 
replaced by equality. Thus, we are looking for a planar bipartite graph in which 
there are 60 edges, and so m = 20 and n = 12. From Euler’s formula, this graph 
will have 30 regions, each of which is bounded by exactly 4 edges. We can construct 
such a graph using some of the ideas developed above; we denote the graph 
D412 39. See Ficure 5 for a drawing of this graph. 


ay 


Dj2,20 


Figure 5 


What does the dual graph look like this time? Proceeding as we did in 
examining (C, ,)*, we note that every region in (D,, 3))* will be bounded by 3 or 5 
edges and every vertex of (Dj, 5))* will have degree 4. Further, every edge of 
(D,» »)* will separate a triangle from a pentagon (and so each triangle will be 
adjacent to 3 pentagons and each pentagon will be adjacent to five triangles). 
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Thus, (D5 5))* is also isomorphic to an Archimedian solid in which the regions 
(faces) are two-colorable (see Figure 4). This Archimedian solid is the icosidodec- 
ahedron, and it can be constructed by truncating the twenty vertices of a dodecahe- 
dron. 

As in the a = 3, b = 4 case, we can generalize the procedure to construct an 
infinite family of planar bipartite graphs, denoted D5, 4 ,, with a = 3 and b = 5. 
We modify D,, 5, in a way which is similar to the way we modified C, , above to 
produce this family. See Ficure 6 for a drawing of D4, 4), which has 64 vertices, 
120 edges and 58 regions. (To simplify the picture, we have only labeled the 24 
vertices of degree 5. All unlabeled vertices have degree 3 and should be labeled 1.) 
A model of the dual solid (D,, 4))* will have 24 pentagons and 40 triangles. 56 of 
the 58 vertices of this solid will be incident to 2 triangles and 2 pentagons; the 
other 2 vertices will be incident to 4 triangles and 4 pentagons apiece. Again, this 
solid is ‘almost’ Archimedian in the same way C,, ,, is almost Archimedian. 


The two Archimedian solids used here, the cuboctahedron and icosidodecahe- 
dron, are quasi-regular in the sense that the two kinds of faces are all regular and 
a face of one kind is entirely surrounded by faces of the other kind. In fact, these 
two solids are the only convex quasi-regular solids. (This follows from our work 
above, or from Chapter 2 of [2].) We also remark that the graphs C, , and D4» 5 
are mteresting enough to have their own names: C, , is the rhombic dodecahedron 
and Dj) 5 is the triacontahedron. Models of these two graphs (on spheres, 
considered as solids) can be constructed so that each face is a rhombus. 

These two solids were discovered by Kepler around 1611; in fact, the rhombic 
dodecahedron occurs in nature as a garnet crystal. See Plate I in [2] for pictures of 
models of these solids. Coxeter [2] gives an elegant construction to create a model 
of the rhombic dodecahedron. Take two solid cubes and cut one of them into 6 
square pyramids, based on the six faces, all sharing the center of the cube as the 
common apex of the pyramids. Now glue the bases of these 6 pyramids to the 6 
faces of the other cube. The resulting solid is the rhombic dodecahedron. (See also 
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Table 4.12 of [4].) As far as I know, the models for the higher order graphs C,, 9, 
and Dy, >, and their duals (for k > 1) have not been considered before. 
Finally we remark that the question concerning which of these graphs can 
actually be realized as gear arrangements, which was the motivation behind these 
examples, remains open. Many of the graphs in this paper can easily be shown to 
correspond to gear arrangements. 
We conclude by summarizing the results of this section. 


Proposition 5. If G is a (connected) semi-regular planar bipartite graph in which 
every red vertex has degree a and every blue vertex has degree b (with a < b), then the 
only possibilities for a and b are the following: 

(a)a =1andb>1l, 

(b) a = 2 and b > 2, 

(c) a = 3 and b = 4, 

(d)a =3 andb=5. 
Furthermore, each of these possibilities can be realized. 
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A sine qua non for making mathematics 
exciting to a pupil is for the teacher to be 
excited about it himself; if he is not, no 


amount of pedagogical training will make 
up for the defect.” 


—R. L. Wilder 
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On the Kummer Solutions of the 
Hypergeometric Equation 


Reese T. Prosser 


One of the oldest, and still one of the most interesting, applications of group 
theory arises in the study of the transformations of an ordinary differential 
equation. If we know that a given differential equation admits a group of transfor- 
mations, then we know that the solution set must admit that same group of 
transformations, and we can deduce properties of all the solutions from the 
properties of any one of them. 

A case in point is offered by the celebrated hypergeometric equation (See 
Eq. (1) below), whose solutions include many of the most interesting special 
functions of mathematical physics. In his book [3], Einar Hille notes that this 
equation has a venerable history associated with such names as Gauss, Euler, 
Riemann, and Kummer. The hypergeometric equation is in fact a prototype: every 
ordinary differential equation of second order with at most three regular singular 
points can be brought to the hypergeometric equation by means of suitably chosen 
changes of variable [5]. 

In 1836 Kummer published a set of 6 distinct solutions of the hypergeometric 
equation. These include the hypergeometric function of Gauss, and all of them 
could be expressed in terms of Gauss’s function (See Table 1 below). A useful 
summary of their basic properties is found in [1, p. 105ff.]. A glance at the list of 
these Kummer solutions reveals a rather complicated set of relationships which 
pleads for some simple explanation. We show here that the Kummer solutions are 
related by a finite group of transformations which serve to explain their relation- 
ships and to exemplify the use of transformation groups in the study of differential 
equations. 


1. THE HYPERGEOMETRIC EQUATION. The hypergeometric equation has the 
form (See [1]) 


z(1 —z)u’(z) + [ce —- (a+b + 1)z]u'(z) — (ab)u(z) = 0. (1) 


Here a, b, and c are real or complex parameters which are independent of z. This 
equation has three regular singular points, at z = 0, 1, and ™, so it belongs to the 
Fuchsian class. The roots of the corresponding indicial equations are: 


0,1 -—c; O,c-a-—b; a,b. 


Logarithmic singularities are to be expected at these singular points if c, b — a, or 
c —a-—b 1s an integer. We shall exclude these exceptional cases from our 
discussion here. It is known that every second order homogeneous linear differen- 
tial equation with at most three singularities of regular type can be brought to this 
form by a suitable change of variable (See [5] for details). 
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u, = F(a, b; c; z) 
Tu, = (1 — z)7F(a,c — b3e;z/(z - 1) 
Tou, = - z) °F(c — a,b3c;z/(z — 1)) 
T,u, = (1 — z)°~4 °F(c -— a,c — b3 cc; z) 


u, = Fla,bja+b+1-c;1-2z) 
T\u, =z “Fla,a+1-—c;a+b+1-—c;1- 1/2) 
T,u, =z -F(b+1-—c,bsat+b+1-c51- 1/z) 


T,u, =z' “F(b+1-—c,at1—c;at+b+1-—0¢1-2z) 


u,=z “Fla,a+1-—c3a+1-5b;1/z) 
Tu, =z "1 - 1/z)-7Fla,c — bsa +1- 631/01 - z)) 
Tou, =z (1 — 1/z)°- 9 'FU —-b,a+1-—c;a+1-5b;1/0 -2z)) 
Tu, =z “(1 -— 1/z)°-* °F( — b,c — b;a + 1 — b;1/z) 


u,=z -F(b+1-—c,b;b+1-a;1/z) 
T\u,4 = za - 1/z)°-?'F(b +1-c,l-—a;b+1-a;1/Q -z)) 
Tyu, =z 9 — 1/z)-°F(e — a,b;b + 1 — a31/0 — z)) 
Tzu, =z °(1 — 1/z)°~*-°F(e — a,1 — a3b +1 - a;1/z) 


u,=z' “F(b+1—-—c,at+1—¢;2-¢;z) 
Tus =z'- (1 - z)-> F(b + 1-—¢,1 -—a;2-—¢;zH(z — 1) 
Tous =z'- (1 — z)- 9 FU -b,at+1—0¢52-0¢5zf(z - 1) 
T3u, =z! — z)°~* °F (1 — b,1 — a;2 -— ¢;z) 

Ug = (1 —z)°-* 9F(c — a,c —b;c +1-—a-—b31-2z) 
TU, = 27 “1 - z)-* §F(c —a,l-asc+1—a-—bi1-—- 1/z) 
Tug =z?“ - z)°-*-98F(V. — b,c —b3c +1 —a—b31—- 1/2) 
Tyug = z'~“ — z)-* OF — b,1 -—a;e +1-—a-—b31-2z) 


TABLE 1. The Kummer Solutions. 


Gauss in his Gottingen thesis (1812) showed that the function 
»F\(z) = F(a, b;c; z) 
, = ) 


m=0 (C)m(1) m ° 


is a solution of Eq. (1) which is regular at the origin. Here (a),, denotes the form 


(2) 


' ' (a +m) 3 
= + >> (Qt — = ——————__, 

(2)m = a(a +1) (a+m—1) = (3) 
and the series converges for |z| < 1. Note the symmetry: 


F(a,b;c;z) = F(b,a;c;z). 
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In 1748, Euler had already shown that the function 


09-11 — 10°? 


u(z)= - 4 
(2) = [| a (4) 
is also a solution of Eq. (1), provided that 

0< R1(b) < R1(c). (5) 


This solution is regular at the origin, and hence is a multiple of Gauss’s solution 
, F(z). Evaluating these functions at z = 1, we see that 


u,(z) = B(b,c — b), F(z), (6) 


where B(x, y) is the Beta function 


B(x, y) pera —t)’"' dt 


P(x)P(y) 
7 T(x +y)- (7) 


We can verify Eq. (6) directly by expanding Eq. (4) in powers of z and integrating 
term by term. 
If we make a simple change of variable in Eq. (4), of the form 


t-1-—-t, 
t—>t/(1 -z — tz), or 
t-(1-t)/( -tz), 


respectively, the resulting integral can be written again in the form of Eq. (4), and 
in this way we arrive at the Euler transformations: 


F(a,b;c;z) =(1-—z) “F(a,c — b3¢;z/(z — 1)) 
F(a,b;c;z) = (1—z) ’F(c — a,b3c;z/(z — 1)) 
F(a,b;c;z) =(1-—z)°* ’F(c — a,c — b3c;2z). (8) 


These transformations tell us that the solution u,(z) can be written in any of four 
equivalent forms. 


2. KUMMER SOLUTIONS. In 1836 Kummer observed that any function of the 
form 


u(z) = J UCs, 2) ds, (9) 

where 

st-“(s —1)° °°" 
(s—z)° 


is a solution of Eq. (1) provided that the integrand is integrable along a simple 
path I’ joining any two of its singularities. This will be the case provided that 


0 < R1(b) < R1(c) < R1(a +1) <2. (11) 


I(s,z) = (10) 
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If we assume these conditions for the moment, then there are in general singulari- 
ties of the integrand only at the points s = 0, 1, ©, and z, and hence there are only 


(:)= 6 possible paths: 
u,(z) = [ Ws, z) as, u,(z) = [Us,z) ds, 
1 Zz 
u(z) =f M(s,z) ds, — us(z) = f-I(s,z) ds, 
_ 0 


u,(z) = ['s,2) ds, u,.(z) = [Us z) ds. (12) 
0 1 


Each of these integrals reduces to the Euler form (Eq. (4)) by a suitable change of 
variable: By setting s equal, respectively, to 


s=1/t, S=Z/t, 
s=1-1/t, S = 2t, 
s=t, s=1-t(1-2z), 


we may write these six solutions, apart from inessential factors of the form (— 1)*, 
in terms of Gauss’s function F(a, b; c; z) as follows: 


u,(z) = F(a, b;c;z) 

u,(z) = F(a,b;a+b+1-—c;1-z) 

u,(z) =z “F(a,a+1-—c;a+1-—b;1/z) 

u,(z) =z °F(b+1-—c;b;b+1—-a;1/z) 

u.(z) =z’ “F(b+1-—c,a+1—c;2—-—c;z) 

u.(z) =(1—z)° * F(c —a,c —b3c +1—a—b;1—2). (13) 


These are Kummer’s six distinct solutions of the hypergeometric equation. Each of 
them has four forms, which are related by the Euler transformations of Eq. (8), 
giving 24 forms in all (See Table 1 on page 536). 


3. KUMMER TRANSFORMATIONS. Anyone looking for properties of the solu- 
tions of the hypergeometric equation and coming upon this table of the Kummer 
solutions for the first time might well be puzzled by the bewildering profusion it 
offers, and might well wonder if there were some way to organize this table so that 
the relations among the entries become clear. 

One way to bring order out of chaos here is to introduce a finite group I, (the 
Kummer group) of six linear fractional transformations of the complex z-plane 
which transform the arguments of the Kummer solutions into one another. This 
group is generated by the elements 


A(z) =1-z 
| (14) 
B(z) =1/z. 
If we represent the transformation C(z) = (az + b)/(cz +d) as usual by the 
matrix 
_fa b 
c= (2 G}: 
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then 


(15) 
B= 0 af 
-1 O77 
Note that A, B € GL(Q,Z), A? = B* =I, and det A = det B = —1. The deriva- 
tive C'(z) of C(z) = (az + b)/(cz + d) is 
C’(z) = det C/(cz + d)’, 


so that det C - C’(z) is a square, whose root we denote by W-(z): 


W-(z) = +(detC-C'(z))™”, (16) 
where the sign is chosen so that W.(z) > 0 when 0 < z < 1. In particular, 
W,(z) =1, 
A(z) (17) 
W,(z) = 1/z. 


The elements A and B generate a six-element subgroup I, of GL(2, Z) consisting 
of 


[, = {1, A, B, AB, BA, ABA = BAB}. (18) 


This group (the anharmonic group) can be visualized as the group of rotations of 
the regular hexagon (See below). The group I, admits a fundamental domain in 
the full complex plane C which we can describe as follows. We define the domains 
D,, D>», D3 and their conjugates D}, D,, D5 by: 


D, = {z: |zl <1& [1 —-2z| > 1/2} 
D, = {z: |z| >1&|1-—2zl < I} 

D, = {z: |z| > 1&R1(z) < 1/2} 

D’, = {z: |1 —z| <1&R1(z) < 1/2} 
D, = {z:|1—z| > 1&R1(z) > 1/2} 
DY, = {z: |z| <1&R1(z) > 1/2}. 


Then these domains are disjoint, and, together with their boundaries, they fill up 
the plane (See Ficure 1). 


Figure 1. Kummer Domains 
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Moreover, they are related by the elements of the group I[;: 


D, 
von 
D, D, 
D; D 
tN An 
dD; 


iy 


(Note that Di = T,,,D, for each value of i.) 
At the same time we introduce a finite group of linear transformations of the 
parameter space C° generated by the elements A and B of [;: 


A(a,b,c) =(a,b,a+b+1-c) (19) 
B(a,b,c) =(a,a+1—c,a+1-—b). 


If we denote by p the vector (a, b,c, 1) in C*, then we can represent A and B 
by matrices: 


10 0 O\{a 

10 1 0 Olle 
AP= 1, 1 1 alle 
00 0 1)\1 

(20) 

1 0 0 O\fa 

—f1 0 -1 1II06 
BP= 1, 1 9 IHle 
0 0 o 1)}\1 


Note that the entries a and 1 in p are fixed under A and B, and that if the entries 
of p satisfy the conditions of Eq. (5), then so do those of Ap and Bp. A modest 
calculation shows that A and B generate a six-element subgroup of GL(4, Z) 
isomorphic to I,, as is already implicit in our notation. 

Using these matrices, we can now define the group I, of transformations (the 
Kummer transformations) of the Kummer solutions in Eq. (13). If G(a, b;c; z) is 
any function of z with parameters a, b, and c, not necessarily a solution of the 
hypergeometric equation, then we define for any C € I; the transformation T. by 
the formula 


TcG(p, z) = We(z)”G(Cp,C(z)), (21) 


where p, = a is the first entry in p, and the weight function W-(z) is given by 


540 KUMMER SOLUTIONS OF THE HYPERGEOMETRIC EQUATION [June—July 


Eq. (16). With this definition it is straightforward to verify that 
T4u,(z) = T4F(p; z) 
= F(a,b;a+b+1-—c;1-2z) 
= u,(z), 
Tgu,(z) = TgF(p; Zz) 
=z *“F(a,a+1—c;a+1-—b5b;1/z) 
= u3(Z), 


More generally, one can verify directly that the following diagram is commutative: 


(22) 


u(z) 


Tyu,(z) T,u,(z) 


Tyu,(z) 


To complete our analysis of the symmetries of the Kummer solutions, we 
introduce the transformation T,, which realizes the symmetry F(a, b;c; z) = 
F(b, a; c; z) of Eq. (2). Here 


J(z) =2Z, 
1 0 O Oj\f1 
malo oF Oey (24) 
0 1 0 Oflb 
0 0 0 I]i\e 
W,(z) = 1. 
Then J? = 1 and det J = —1, just as for A and B, and 
T,F(a,b;c;z) =F(b,a;c;z). 
Then we can verify that 
I, = 7,1,1;, (25) 


T; = TT, = TT, 


where 7, and 7, realize the second and third of the Euler transformations in 
Eq. (8); and that 


TTT; = 7), 
T,T,T, = T,, 26 
yA2ty 1 
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Finally, we can verify that 


Ty = T,T3T,, (27) 
where the transformation 7, interchanges the independent solutions: 
[pu = Us, 
TpUuy = Ug, (28) 
Tp; = Ug. 


We denote by [yg the 48-element subgroup of GL(4, Z) generated by A, B, and 
J. We can now verify that the elements T,, T,, Tp, and T,, as well as the elements 
T,, T,, and T;, all have order 2, and that the following master diagram is 
commutative. 


Tsu, <> Us -_—_——__- T3Us 
/\ oT “AL “g -_———- 5 + ‘A T3U6 
u,, * . T3u, U4 -_/_. T3u, 
| T\u3 m ———— > Tos - TU, i TyUg 
vi T\Us 7 Tu, _ T U6 \ / 


T\u, —______—> Tpu, ————____—_@ Tus ——___—__—_» \, 


In all, there are 8 orbits of the 6-element group I,, making a total of 48 possible 
forms for the Kummer solutions, consisting of those listed in Table 1 and their 
images under 7;. The transformation rule for any element C in the group [4g is 
given by Eq. (21). 


Note added in proof. The referee, who is a much better historian than I, con- 
tributed the following remarks. “Euler not only knew that Eq. (4) is a solution of 
Eq. (1), he knew that the series Eq. (2) is a different form of this solution. Also 
Gauss claimed that Eq. (2) is a solution to an equation equivalent to Eq. (1) in his 
published paper from 1812 to 1813, but it was only in his unpublished work that he 
explicitly states the differential equation Eq. (1). He wrote it in different form. The 
most complete treatment of this equation to appear in print before Kummer’s 
paper was in Pfaff’s 1797 book. Euler found the third transformation in Eq. (8), in 
a posthumous paper which Pfaff edited (1797), and possibly in earlier work, 
although I have not found it earlier, but I have not found the first two transforma- 
tions in Eq. (8) in work of Euler. A special case of them was given by Stirling in his 
1730 book.” 
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From Richard Feynman 


The following is a letter I received from Richard Feynman while I was a high school student at 
La Mirada High School in 1962. I was a Sophomore then, and Dr. Feynman had been invited to 
give a talk to our Math Club. The title of his talk that day was “What is one-half factorial?”’. 


I had no idea who Richard Feynman was, except that he worked at Cal Tech. Thinking at that 
time that I wanted to be a physicist, I wrote him a letter the next week asking about what 
colleges to attend and what mathematics classes to take. He kindly answered this letter. 


September 26, 1962 


Dear Mr. Farnum: 


This is to answer your letter requesting information on colleges having good 
departments in physics. I am very poor at giving such advice, but I will try. It is best 
not to go to the same school for both undergraduate and graduate work, to 
develop a wider range of points of view. The schools that I can think of which 
have excellent physics graduate schools are Massachusetts Institute of Technology, 
California Institute of Technology, Columbia, Harvard, Princeton, Cornell, Univer- 
sity of California at Berkeley, University of Chicago (and others which may not 
have come to mind). Don’t worry about which is “best”—I don’t know—they are 
all very good and each will give you a complete opportunity to learn and do as 
much as you can. For undergraduate work there is the above list of course, plus 
very many others, too numerous to mention. (If you have in mind any one in 
particular, not on the list above, let me know and I will give you my opinion of it.) 


What mathematics to study? Whatever interests you the most at the moment 
is the best rule. Perhaps you will become fascinated, say, with number theory or 
topology, neither of which is used at all in physics. Then you will become a 
mathematician. Should we recommend that you don’t study number theory or 
topology simply because it is not needed for preparation for theoretical physics? It 
is not necessary now to decide too closely your specialty. For your information, 
however, the mathematics of use in physics is the calculus and its extensions (as in 
the book “Advanced Calculus” by Woods) and later on vector analysis, partial 
differential equations, and other branches of analysis described in books with titles 
like “‘Mathematical methods of physics”. However, the emphasis should be 
somewhat more on how to do the mathematics quickly and easily, and what 
formulas are true, rather than the mathematicians’ interest in methods of rigorous 
proof. 

Sincerely, 
Richard P. Feynman 


Submitted by Nicholas R. Farnum 
Department of Management Science 
California State University, Fullerton 
Fullerton, CA 92634 
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Reflections on a Mira 


John W. Emert, Kay I. Meeks, and Roger B. Nelson 


Reflective devices such as the Mira! are beginning to be used as a replacement for 
the compass and (unmarked) straightedge as tools for performing geometric 
constructions. These devices (see FiGurE 1) are generally constructed from a dark 
translucent plastic, producing an effect similar to that of looking into a car window 
on a bright, sunny day. When placed on a sketch, the device allows one to see both 
the figure and its reflection. 


Figure 1 


All isometries of the plane can be generated by reflections, and the Mira 
exploits this transformational approach more than the compass and straightedge. 
However, concerns over the mathematical basis of its replacement of the compass 
and straightedge are frequently expressed. 

We address two issues related to these concerns. First, we formally describe the 
actions used in Mira constructions. Second, we algebraically characterize the set of 
Mira constructibles and note that the set of compass and straightedge constructible 
lengths is a proper subset of this set. 


'Mira is a registered trademark of the Mira-Math Company, Willowdale, Ontario, Canada. Any thin 
piece of colored, transparent hard plastic with a smooth finish is useable. 
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1. DECLARING PRIMITIVE ACTIONS. It is customary to assume that the primi- 
tive actions of the compass and straightedge are to draw the line through two given 
points and to draw the circle with given center and radius. When using the Mira, 
we identify the following as primitive actions: 


(1) Draw the line which reflects a given point to a given point. 

(2) Mark the reflection (i.e. mirror image) of a given point across a given line. 

(3) Draw a line passing through a given point which reflects a second given 
point to a given line, if such a line exists. 

(4) Draw a line which reflects two given points to respective given lines, if such 
a line exists. 


Note that in Mira constructions, a circle is Known by its center and radius, rather 
than by its graph. Likewise, a parabola is known by its focus and directrix. 
Therefore, primitive actions (3) and (4) above can be rephrased in the following 
manner. 


(3') Mark the intersections of a given line and the circle with given center and 
given radius. 

(4’) Draw any lines which are simultaneously tangent to two parabolas, each 
determined by a given point and given line. 


The following standard compass and straightedge constructions are among basic 
Mira constructions obtainable from the above primitive actions: 


(5) Draw the line through two given points. 

(6) Copy a segment onto a line, starting at a given point on that line. 
(7) Copy an angle to any given location and position. 

(8) Draw a line which reflects a given line to a given line. 


For example, (5) can be obtained by drawing the diagonals of a rhombus having 
the given points as opposite vertices. 


2. COMPARING CONSTRUCTIBLE RANGES. We will characterize those fig- 
ures of classical Euclidean geometry which are constructible with a finite number 
of primitive actions, either using a compass and straightedge or using a Mira. We 
refer to such figures as being C-constructible or M-constructible, respectively. In 
either case, we note that a given angle a is constructible if and only if it is an angle 
of some triangle whose sides have constructible lengths. In fact, we may construct 
it as an angle of a right triangle whose legs have lengths cos a and sin a. Since in 
either type of construction, segments can be transported (by (6)) and angles can be 
copied (by (7)), constructible figures may be formally described by characterizing 
constructible lengths, after designating a unit. We denote the set of M-construct- 
ible lengths by .4 and the C-constructible lengths by 7%. 


Theotem 1. @ and .4 are both subfields of the real numbers, R. 


Proof: One must only establish that if a and B are both constructible lengths then 
a — B and a/fB are also constructible. The standard constructions [5, p. 428; 7, 
p. 400] involve only transporting lengths and copying angles and can therefore be 
carried out with either type of construction. 

Recall that F(a) denotes the smallest subfield of a fixed algebraic closure of F 
containing both F and a. F(a) is a vector space over F with dimension equal to 
the degree of the unique monic irreducible polynomial over F of which a is a root. 
This is also called the degree of a over F and is denoted [ F(a): F]. 
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The classical result characterizing C-constructibles is [8, p. 213]: 


Theorem 2. A real number a is in @ if and only if there is a finite sequence of reals 
is 3,---5 1, = @ for which 


[Q(71,725---57;): Q(ry,1,---7;-1)] = 2 


for each integer i, 1 <i <n. 


Necessity follows from the observation that any newly C-constructed length 
results from the intersection of lines or circles determined by already C-con- 
structed lengths and therefore is a root of some quadratic equation with these 
previously C-constructed coefficients. The algebraic condition is sufficient since 
the square root of any C-constructible length is also C-constructible. [5, p. 402] 

The compass and straightedge construction of square roots which appears in [5] 
is easily modified into a Mira construction. Let a > 0 be _M-constructible. Con- 
struct OA with length a and locate P on the extension of AO so that |PO| = 1. 
Let C be the midpoint of PA, and erect a line / perpendicular to PA through 
point O. Locate Q on / such that |CQ| = |CA| (see FicureE 2). Then angle PQA 
is a right angle since the triangle PQA is inscribed in the semicircle with radius 
|CA| centered at C. It follows from the resulting similarity of triangles OPQ and 


OQA that |OQ| = Va. 


Figure 2 


Since all roots of quadratics with already constructed coefficients are M-con- 
structible it follows that all C-constructible lengths are M-constructible. That is: 


Theorem 3. @ is a subfield of &. 


Note that the above theorem does not depend on use of the fourth primitive 
Mira action, since this action is not used in constructing quadratic roots. If the 
fourth action is not redundant, then the above containment must be proper. In 
fact, we will show in Sections 3 and 4 that .4 consists of all @ which result from a 
finite sequence of degree 2 or 3 extensions. 


3. SOLVING ANY CUBIC EQUATION WITH A MIRA. We showed in Section 2 
that the roots of any quadratic polynomial with Mira constructible coefficients can 
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be constructed by using combinations of Mira actions (1) through (3). With the 
inclusion of Mira action (4), we show in this section that the roots of any cubic 
polynomial can be found, a task not possible with compass and straightedge. We 
employ the method of Vieta (1591) (see [3]) as applied to the analysis of compass 
and marked ruler constructions in [9]. 

Any cubic equation can be transformed, by a rational transformation, to 
x? + px + q = 0, with discriminant 6 = —4p* — 27q’. Recall that the sign of the 
discriminant indicates the order and number of real roots. When 6 < 0Q, the cubic 
has exactly one real root and can be transformed to a quadratic in w°’ by the 
substitution x = w — p/3w. When 6 = 0, all roots are real and the cubic can be 
reduced to the form 4z? — 3z — r = 0, where |r| < 1. Expanding r = cos3a and 
substituting z = cos a also leads to this form. Hence, the ability to construct cube 
roots and trisect angles allows the construction of real solutions to cubic equations 
with constructible coefficients. 

The Mira action (4) provides a method to trisect an arbitrary angle. Such a 
construction can be found in [2], accompanied by a geometric justification. This 
construction can be modified to directly construct a root of 4z° — 3z—,r. It 
remains to show that the cube root of a constructible can be constructed. 

Given the unit and a segment OY of length a, construct a perpendicular 
segment OX of unit length. Reflect Y across line OX to locate point Y’, and 
construct the line /,, through this point, parallel to OX. Similarly, reflect X across 
line OY to locate point X’, and construct the line /, through this point, parallel to 
OY. Construct the Mira line which reflects X to /, and Y to /,, and mark the 
intersection of this line with OY as Z (see FiGure 3). The segment OZ has length 


3 
va. 


Reflection of / 


4 
y\ 
4 
4 


4 
4 
4 
‘ reflection of /, 
4 


Figure 3 


To justify this construction, first note that the pairs X,/, and Y,/, determine 
parabolas x = y*/4 and y = x*/4a, respectively. As these parabolas meet in two 


*Thanks to M. H. Motto for his suggestions. 
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points, they have a unique common tangent. Comparing the general forms for the 
tangent line of each parabola, we obtain the common tangent line y = —a7!/3x — 
a‘/>, This line meets the parabola x = y’/4 at the point (a’/°, —2a'/°) and the 
parabola y = x*/4a at the point (—2a’/°, a'/°). As a tangent line common to two 
parabolas, its constructibility is guaranteed by (4’). This relationship between 
conics and the extraction of cube roots has ancient roots. The study of conic 
sections by the Greek mathematicians Hippocrates and Menzchmus arose from an 
attempt to duplicate the cube. See [1, pp. 106-109]. 


4. IDENTIFYING THE MIRA CONSTRUCTIBLE NUMBERS. We have just 
shown that a real number is M-constructible if it is a root of a quadratic or cubic 
equation with M-constructible coefficients. Are any other reals Mira constructible? 
No. 

Since the first three primitive Mira actions can be accomplished with compass 
and straightedge, all M-constructibles involving only these actions are C-construct- 
ible. Therefore, if additional M-constructibles exist, they must result from applica- 
tions of the fourth primitive action. 

Recall that the fourth primitive action can be characterized as finding the 
common tangent to two known parabolas. If these parabolas are described by 
equations with coefficients in some subfield F of R, we show that the coordinates 
of the points of tangency satisfy either cubic or quadratic equations with coeffi- 
cients in F. 

Any pair of parabolas with coefficients in a field F may be linearly transformed 
to another pair of parabolas whose central axes are either perpendicular or 
parallel. This linear transformation may be represented by a matrix with entries 
from F. F(a) and its image under such a transformation will have the same degree 
over F. Hence, we need to consider only those cases where the central axes are 
parallel or perpendicular. 

When the central axes are perpendicular, we may assume that the axes intersect 
at the origin and express the parabolas as y* = a(x —h) and x* = b(y — k), 
where a, b, h and k are all in F. These parabolas meet the common tangent in 
points (x,, y,) and (x,, y,), respectively. The common tangent has slope a/2y, = 
2x,/b. Computing the y-intercept in two ways yields the relationship 2k — y, = 
(a/2y,Xx, — 2h). It follows from the resulting four equations in four unknowns 
that y, satisfies 


8y> — 16ky” — 8ahy + a*b = 0 
and x, satisfies 
64a(x — h)° — 128(ah + 2k7)(x - h)? + 32(2h? + abk)(x —h) — a*b* = 0. 


Coordinates x, and y, satisfy analogous cubics having coefficients in F. A similar 
attack ‘on the case when the central axes of the parabolas are parallel shows that 
all coordinates x,, ¥,, >, y> Satisfy quadratic equations. 

We see, therefore, that there are no other M-constructible numbers. That is: 


Theorem 4. A real number a is in .@ if and only if there is a finite sequence of 
reals r,,1,...,1, = @ for which 


[Q(7r,,72,.--7;):OQ(r,,72,---7%-1)] = 2 or 3 


n 


for each integer i, 1 <i <n. 
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Corollary. A regular n-gon is Mira-constructible if and only if n has the factorization 
2°3°p,... D,, where p,... DP, are distinct primes of the form 2“3' + 1. 


From the classical result for @ (Theorem 2), Gauss showed that a regular n-gon 
is constructible with compass and straightedge if and only if n has the factorization 
2’p,.-.-P,, Where p,... p, are distinct primes of the form 2” + 1 (Fermat primes). 
[8, p. 265]. Gleason recently showed that a regular n-gon is constructible with 
compass, straightedge, and angle-trisector if and only if m has the factorization as 
stated in the Corollary above [6]. 

The construction of a regular n-gon corresponds to adjoining a primitive n-th 
root of unity, resulting in a normal extension. Gleason shows that any normal 
extension of degree 3 of a constructible field is constructible by use of straightedge, 
compass, and angle-trisector, by observing that the corresponding irreducible 
polynomial has all real roots. Since elementary Mira constructions correspond to 
extensions (not necessarily normal) of degree 2 or 3 only, the Mira-constructible 
regular polygons are precisely those constructible by straightedge, compass, and 
angle trisector. 


CONCLUSION. We have shown that constructions performed with reflective 
devices such as the Mira can be formally defined, analyzed and justified mathemat- 
ically. In addition, the set of Mira constructibles extends the set of compass and 
straightedge constructibles in the same manner that the compass and straightedge 
constructibles extend the rationals. 
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Buffon Noodles 


Ed Waymire 


1. INTRODUCTION. During the 1992 spring term, physicist Corinne Manogue 
gave a colloquium talk to our mathematics department in which she surveyed 
various mathematical notions which arise in a string theorist’s approach to quan- 
tum field theory. All this talk of strings and probability by Corinne made an 
impression on my colleague Robby Robson, who had an interesting hallway 
question the following day. Robby was wondering if I knew what would happen if 
Buffon had tossed a noodle in place of a needle (actually Buffon tossed baguettes)? 
Here the image of noodles is that of the long thin stringy type which always seem 
an entangled mess. So this story will feature entangled noodles. 


2. BUFFON NEEDLE. Let’s recall the Buffon needle problem. Snell ((7]) has a 
very nice treatment together with original references and a wonderful dose of 
historical perspective blended with modern computer simulations; that’s where I 
learned about the baguette. One tosses a needle of length L onto a plane surface 
(floor) marked by parallel lines of width D> UL units apart. One asks for the 
probability of the event C that the needle intersects its closest line. The answer is 
a (linear) function 


2L 
pox) = P(C) = — = yox (1) 
of the ratio x = L/D with slope y, = 2/7 (= .636). A particularly interesting 
aspect of this example is that one obtains a statistical method of estimating 
numerical values of 7 from the crossing frequency of repeated needle tossings. 
One needs a model of the needle tossing experiment to compute the probability 
in (1). For this, let Y denote the vertical distance from the midpoint of the needle 
to the nearest parallel and let © denote the acute angle made by the needle and 
the nearest parallel. Then one assumes that (Y,@) is uniformly distributed over 
the range of values [0, D/2] x [0, 7/2]. Now the problem is a geometry problem. 
It is interesting for the ensuing discussion that the crossing probability (1) does not 
change if one assumes that an endpoint of the needle is randomly (uniformly) 
selected on a fixed vertical line and, independently, the angle made by the needle 
with the vertical line is obtained by a random rotation from 0 to 277. 


3. BROWNIAN NOODLE. While others (eg.,see [1], [5]) have considered “‘noodle’’ 
extensions defined by bending needles into various specific convex shapes to be 
tossed, we shall allow the noodles to become randomly tangled. To begin, let us 
first consider what happens in the case of a two-dimensional standard Brownian 
motion {B(t) = (B,(t),B,(t))} over a time interval of unit length and started at 
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random along a vertical line of length D, Figure la. Although the total length of 
the Brownian noodle is ©, one may introduce L as a scale parameter by taking 


v2 L 
S(t) = =z BO), O<t<l. (2) 


The answer in this case is 
8 ~ 1 2 2 2 
3 L ——— a (1 _ e~@m-) (1° /4K L/D) ). (3) 


= (2m — 1)’ 


Now the parameter L is a scale parameter and the gap is D. The crossing 
probability in (3) depends only on the ratio L/D. The extra factor ¥2 /2 is being 
introduced here for the convenience of making comparisons with computations 
later. For convenience now let L’ = ¥2L/2. 


p(x) = P(C) = 


Figure la. Sample Realization of Brownian Noodle Toss. 
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Figure lb. Sample Realization of 100 Strand Noodle Toss. 
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To obtain the solution (3) one first observes that it is enough to consider the 
one-dimensional Brownian motion {B,(t)} on the interval [0, D/L’] with uniform 
initial distribution on [0, D/L’]. We will need the joint distribution of (M,, m,) 
where 


M, = max B,(t), m,= min Bt). (4) 
O<t<1l O<r<l 


For then the answer is simply 1 — P(0 <m, <M, < D/L’). There are various 
ways to compute the joint distribution of the maximum and minimum in (4). One 
approach is to first compute the transition probabilities of the Brownian motion on 
[(0, D/L’] with absorbing boundaries at 0, D/L’ and starting at 0 <x < D/L. 
Since these transition probabilities satisfy the heat equation on the interior of the 
interval with Dirichlet boundary conditions one may compute the following eigen- 
function expansion given in ({2], p. 412-413): 


Er > mat a (mrlx\  (maLy 
t; ) = _ ~~ _ ) 
p(t;x,y) D 2 exp 5 5 in D sin( D | 


(5) 


o<nr<z 
<x,y<—|. 
x,y 7, 


Now the desired probability (3) is the complementary probability to that obtained 
by performing the indicated integrations in 
D LU ppyst' pD/L 
P\I0< <M,<—]=— 1; dy dx. 6 
m, <M, 7 ah J p(1; x, y) dy (6) 


Here one also uses the familiar fact that the sum of reciprocal squares is 7” /6, SO 
that restricting the sum to reciprocals of squares of odd numbers yields 


FiGuRE 2a contains a plot of the two probabilities (1) and (3) as a function of the 
ratio x = L/D. The corresponding result of a computer simulation is given in 
FIGURE 2b. 


0.8 
pix) 
0.6 
0.4 —— pol) 
0.2 
x 
0.2 0.4 0.6 0.8 1 


Figure 2a. Theoretical Brownian Noodle Crossing Probabilities p,(x). 
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Figure 2b. 5000 Toss Simulated Crossing Frequencies for 100 Strand Brownian Noodle. 


4. “REAL” NOODLES. Next let us consider a real string of n needles of lengths 
L/ vn each, and strung together in independent random directions. The total 
length is then n (L/ yn) = Lyn. In order to preserve the ratio as x = L/D, the 
gap between parallel lines is chosen as Dyn . Since we already know the answer 
for n = 1 by (1), let’s do the problem for n > 1. In this case we first note that the 
functional central limit theorem provides an approximation to the distribution of 
the string of needles by the Brownian noodle. In particular, observe that the 
lengths of the projections of the needle are i.i.d. with, up to the scale factor 
L/ vn, the arcsine distribution P(Y € dy) = (1/my1 — y”) dy. This distribution 
has mean 0 and variance 1/2. Thus the Brownian motion approximation has 
diffusion coefficient L?/2. This explains the scaling factor (V2 /2)L introduced 
earlier. 


5. FIRST ROUND APPROXIMATIONS. In the limit the approximation involves a 
noodle {S(t): 0 < t < 1} of length © in a gap of width ©, but in the ratio x = L/D. 
This is how string theorists discussing quantum field theory can sound! One might 
“expect” the noodle crossing frequency to be estimated by p,(x/ vn ) as this 
corresponds to formally replacing D by vn D in (3). This curve is given in FIGURE 
3a with n = 100. FiGure 3b provides the crossing frequencies p,(x) obtained from 
a simulation of 5000 tosses with n = 100 for comparison. However, since the effect 
of fixing the gap to length ratio in the above computation is a second rescaling of 
the string of noodles by 1/ vn, the central limit theorem approximation by the 
Brownian noodle is not justified. On the other hand, the formula (3) is applicable 
to’ strings composed of a large number n of needles of (fixed) lengths x each 
tossed onto a surface with gap width Yn . This fact is illustrated by the simulation 
in FiGuRE 2b of the corresponding crossing frequencies p,(x) for a Brownian 
noodle with n = 100, N = 5000; i.e. these are the crossing frequencies of — vn J2 
or Vn /2 by a string of n randomly (uniformly) oriented needles of length x each 
with endpoint randomly distributed over [— vn /2, vn /2]. 

Due to the additional rescaling of the gap width, there will be no loss in 
generality to consider the crossing probability of strings composed of n needles of 
length x < 1 each and a gap of width n. A quick and clean (rigorous) upper bound 
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Figure 3a. Theoretical Brownian Noodle Crossing Probabilities p,(x/ y100 ). 
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Figure 3b. 5000 Toss Simulated Crossing Frequencies for 100 Strand Noodle. 


on the crossing probability can be obtained with Doob’s maximal inequality (({2], 
pp. 49-52). In particular one obtains, using translation invariance of the problem 
to pose it with one end started uniformly in [—n/2, n/2], that 


P(C Tea? 7 
(C) < 7 + 3° (7) 
For large n this bounds the probability by the essentially constant value 2/3+ . 
Recall that the maximum crossing probability for the needle (1) is 2/m7 = 2/3-. 
To make this estimate, let {S,} denote the random walk starting at 0 with 
displacement distribution P(Y € dy) = (1/zry1 — y”) dy, and let Z” =nU be 
uniformly distributed on [—n/2,n/2] independently of the random walk. The 
details of the distribution of Y do not play a role beyond first and second 
moments. The essential observation is that by symmetry {xS, + Z“: k = 0,1,...} 
is a martingale. Thus, Doob’s inequality gives (7), after conditioning on Z“” and 
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using symmetry, as follows: 


n n 
P(C) = EP{ max( (x5, +Z™)> 5} U { min (x5, +Z™) < - 512 | 


n n 
< EP{ max (x5, +Z™) > siz] + EP{ min (x, +Z™) < - siz] 
k<n 


k<n 


4 
2EP( max (x5; +Z™)> 5iz| < 2G E(xS, + zm)’ 


8 
ne 


2n n- 4 2 
n 


(x7ES? + E(Z™)’) = =(> +—|=—-x7+— (8) 
” n?\ 2 12 n 30 

A bound as universal as (7) will be much too large for many cases of interest. More 

precise estimates of the crossing probability upper bound are given in the next 

section. 

For a lower bound, let D() denote the event that the initial end of the string 
falls within Bx, B <1, units from the boundary. Then D(f) has probability 
2Bx/n. Also, from the geometry P(C|D(B)) > (Cos~'(B)/7) since, for any n, 
crossing is implied by the occurence of the angle of the first needle between 
+Cos~1(B). Thus, for any 0 < B < 1, P(C) > 2Bx/n(Cos~'(B)/7) so that 


2 357 
P(C) =A(n)x, where A(n) = nm ottax, BCos”"(B) = ——. (9) 


6. LARGE DEVIATIONS. As noted above, the functional central limit theorem 
approximation involved in replacing the string of needles by a Brownian motion for 
large n provides probabilities of events involving O(1) fluctuations in the shape of 
the string of needles; the individual needle lengths already being O(1/ vn). 
However, the crossing probabilities of yn D involve O(Vn ) deviations from this 
picture which are not obtained by this approximation. These fluctuations have 
probabilities which also depend on the details of the angular distribution of the 
needles in the string. The precise probability computation desired here is a large 
deviation problem of the type which arises in the actuarial mathematics of insur- 
ance risk. In fact, the classic work of Cramer on large deviations was motivated by 
actuarial problems. Posed this way, the noodle problem also has an interesting 
twist in that it involves a “two-sided ruin event.” 

To see how the distribution will depend on the angle distribution, consider the 
degenerate case in which the angles made by the needles comprising the string 
with the horizontal are each 7/2 (nonrandom). Then the noodle is a needle with 
fixed orientation and the only randomness is in the distribution of the endpoint; 
see FiGuRE 4a. For this case one readily finds that p(x) = P(C) =x. (This 
prgbability is unchanged if one randomizes the choice of endpoint). To see how 
large deviation estimates go in a simple but less trivial situation for which explicit 
computations are still possible, consider the case in which the needles comprising 
the string make angles +7/4 with the horizontal with equal probabilities; see 
FIGURE 4b for a sample outcome of a simulation of a single toss with n = 100. 
Since the variance in step size is one in this case, the rescaling by ¥2 /2 is 
unnecessary for the comparison with the (nonrigorous) central limit theorem 
prediction; i.e. replace x by x/2 in FiGuRE 3a to correct for the v2 /2 before 
comparing with the observed Ficure 5. In this case the lengths of the projections 
of the needle are i.i.d. with, up to the scale factor L, a Bernoulli distribution 
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O 


Figure 4a. Sample 0 Degree Single Strand Noodle Realization. 


88 


86 
Figure 4b. Sample 100 Strand 45 Degree Noodle Realization. 


P(Y, = +1) = P(Y, = -1)=1/2. Let S, =(¥, + ¥) + ++: +Y,), Sp = 0. More 
generally, assume #(s) = Ee*™' finite and note for iid. {Y,}, (d(s))” = Ee" > 
e">P(S, > nb). That is, for arbitrary s 

P(S,, > nb) < eros osy~s0), (10) 


In particular, 
P(S, > nb) se", where y*(b) = inf (log Ee’ — sb). (11) 
The reverse (asymptotic) inequality for b > EY, yielding 
1 
—log P(S, >nb) > x*(b) asn>, (12) 
n 


is the often cited Cramer-Chernoff theorem on large deviations of S,,/n from EY,. 
That is to say, P(S, > nb) ~ e”*"© as n — &; the rate — y*(b) is a large deviation 
rate. The reader may consult the developments given in ({3], pp. 147-148) and ([4], 
pp. 57—63) for some basics of large deviation theory. 

Using the reflection principle for simple symmetric random walk one may also 
check that ((2], p. 10, 70), P(max, .,, 5, = b) = 2P(S,, = b) — P(S,, = b). In par- 
ticular one obtains from this, 


P( maxS, > b) < 2P(S, > 5). (13) 


k<n 
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These results will serve as our basic tools. Notice that for large n it is no more than 
a venal sin to think of the inequality (13) as equality; i.e. the resulting probability 
bound may be expected to be close to the actual probability. 

First let us compute the so-called Legendre transform y*(b) of the function 
y(s) = log Ee’’ required in (11). One may easily check by calculating limits as 
s— +o that y*(b) = —~ if |b| > 1. For |b] < 1, equating the derivative with 
respect to s of {log((e* + e~’)/2) — sb} to 0 and solving for s gives e*5 = (1 + 
b)/(1 — b)) and therefore 


. 1+b 
x*(6) = - 


As before let Z“”” = nU be uniformly distributed over [—n/2, n/2] and indepen- 
dent of {S,}. Then one has along the lines of (8) that 


log(1 + b) — log(1 — b). (14) 


P(C 2P S v 4Pis v 15 
< = —_—++t— < = — + —)], 
(C) ken nt =| " nt 5 =| (15) 
first using the symmetry of the distribution of both {S,} and U, and then the 
reflection principle. By conditioning on U, using (15), the argument for (11) 
conditionally, and then (14), one has after taking expected values (with respect to 
U), 


P(C 4EP\S v 
< = —_—+— 


* 1 * 
0 


= 4x exp — 5 [(1 + y)log(1 + y) + (1 — y)log(1 ~ y)]} dy = a(n) x. 
(16) 
Note that the (concave) function 
x(y) = —(172)[(1 + y)log(1 + y) + (1 — y)log(1 — y)] 


and its first derivative are zero at y = 0. The second derivative at 0 is —1. As a 
result one may bound by a Taylor’s approximation —(1/2)y* to obtain by a simple 
change of variable and the fact that 


V" 60/22? dz w V2qr /2, 
0 


va) = 4f'exp{ — 5[(1 + y)log(1 + y) + (1 - y)log(1 - y)]} ay 


< 2v2arn'/* = 5.01n-!/”. (17) 


Observe from this that the crossing probabilities are theoretically predicted to be 
smaller than the universal bound (7) for 45 Degree strings consisting of n strands. 
FIGURE 5 depicts the crossing frequencies p,(x) for a simulation of N = 1000 
tosses with n = 100. A sample realization of a toss was plotted in FIGURE 4b. 
Finally, observe that the argument used to arrive at the lower bound (9) can be 
simply adapted to this case to give 


2V27r 
vn 


To extend these estimates to the case of uniform angles on [0, 277) first requires 
a general inequality of the form (15). Such an extension is possible for symmetric 


— <P(C) < y,(n)x ~ x. (18) 
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Figure 5. 5000 Toss Simulation Crossing Frequencies for 100 Strand 45 Deg Noodle. 


random variables (Levy’s inequality), ({5], p. 14; [2], Exc. #4, p. 70). The corre- 
sponding bound in place of (16) for this case is, following the first two inequalities 
of (16), 


P(C) < Bet 2840/9 = ax 


1 * 1 * 
/* nx ) dy = 4x | ex" dy, (19) 
0 0 


where now for ly| > 1,y*(y) = —®©, but for |y| < 1 
*(y) = inf {1 ~ 4 
xy*(y) = inf (log! — ——— dt 

s Tw’-1V¥1—¢? 
The crossing frequencies p;(x) obtained from a simulation of 5000 tosses with 
n = 100 strands were provided earlier in FiGuReE 3b. To obtain the asymptotic 
value of y,(m) one may proceed as in the above case to compute the values of 


y*(y) and its first two derivatives at y = 0. This is most easily accomplished by the 
following Legendre transform duality equations: Let 


- o}, (20) 


1 - e*! 1 wr /2 ; 
s)=— ————— dt = — es5in® dg. 21 
xs) 9 eee =a ON 
Then 
dx(s) _ dx*(y) 
ds” dy 


One may check from this that y*(y) and its first derivative are 0 at y = 0 and the 
second derivative is —1/2. It follows as above that 


x*(y) =x(s(y)) — ys(y), (22) 


a(n) = 4f eMC) dy < AVrn-V/2 = 7.09n- 1/2. (23) 


A lower bound was given at (9). In this case the crossing probability prediction falls 
below the bound (7). 

What we have presented here is the stuff with which probability bounds and 
estimates can be made. The computer simulations provide an order of magnitude 
feel for crossings, but no more than this for n large. The reader desiring a more 
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precise recipe for ‘‘n-noodle pi’? may want to give more careful consideration to 
the small n cases, beginning with n = 2. We only hope to have whet the appetite 
for more probability and noodles. 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


What can these men talk about? 
(see page 570) 
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“‘Abstractness, sometimes hurled as a reproach 
at mathematics, is its chief glory and its surest 
title to practical usefulness. It is also the 
source of such beauty as may spring from 
mathematics.” 


—E.T. Bell 


Answer to Picture Puzzle 
(p. 559) 


What except algebra: they are Irving Kaplan- 
sky, Dan Zelinsky, and Gerhard Kalisch. The 
picture was taken in 1969 in Honolulu. 
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NOTES 


Edited by: John Duncan 


On a Curious Property of Counting 
Sequences 


Victor Bronstein and Aviezri S. Fraenkel 


This note is dedicated to the memory of Professor Joseph Gillis who passed away 
on Nov. 18, 1993, in his 82nd year. 


1. INTRODUCTION. A counting sequence / is a sequence of sequences {S;,}?_, 
of positive integers. The sequence S;,, is obtained from S$, by counting the 
number m, of times an integer k occurs in S$, and writing down in S,,, the pairs 
m,, k in increasing order of k, for all & for which m, > 0. 


Example. Beginning with S, = (1), and with S, = (6,7), the first few elements of 
the two resulting sequences —“ are depicted in Table 1. 


TABLE 1. Initial elements of the counting sequences 
for Sy = (1) and Sy = (6, 7) 


1 67 

11 1617 

21 211617 

1112 31121617 

3112 4112131617 
211213 511213141617 
312213 61121314151617 
212223 71121314152617 
114213 61221314151627 


31121314 51321314152617 


As usual, Z° and Z* denote the set of nonnegative integers and the set of 
positive integers respectively. The length |S,| of S; € “ is the number of elements 
of S,,(counting multiplicities). Thus |S,| = 2 for S, in the left column of Table 1, 
and |S,| = 4 for S, in the right column. Any sequence of finite length is called 
finite. What is the asymptotic behavior of the sequences S;? Does the length and 
hence the elements of {S,}, grow without bound? 

A counting sequence —™ is called ultimately periodic if there exist positive 
integers 1, and p such that S; = S$;,, for all 1 > 1). The smallest such p is called 
the period of , and the smallest such i, is called the preperiod of ~. For 
example, S, = (2, 2) is ultimately periodic with i, = 0 and p = 1, i.e., it is periodic 
with period 1 from the beginning. Our purpose is to prove the following surprising 
fact. 
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Theorem. Beginning with any finite initial sequence S,, all the sequences §; €= /” 
have bounded length, and ~ is ultimately periodic. 


For example, continuing Table 1 further, we get Table 2, from which we see that 
periodicity begins with i, = 12 (p = 1) for “ in the left column and with i, = 10 
(p = 3) for “ in the right column. 


TABLE 2. A continuation of Table 1 


41122314 51222314251617 ig = 10 


31221324 41421314251617 
ig = 12 21322314 51221334151617 
21322314 51222314251617 


2. THE PROOF. We begin with some notation and definitions. Let S$; € 7% 
(i > 1). Then S$; consists of pairs of positive integers m,(i), f,(i). The elements 

m,(i) are called multipliers and the elements f(i)—factors. Note that f,(i) < 

fi) < f,(i) < .... It is easy to see that |S;,,| < 2|5S,| with equality if and only if 
all elements of 5. are distinct. Thus S, finite implies S, finite for every i > 1. The 

sum of multipliers of S; is denoted by M(S;), ie., M(S,) = L,,,m,(i). If $;, 5, € % 
with j > 1, then S, is a successor of S;. The maximum element of S$; is denoted by 

max(S;), ie., max(S;) = max (m (i), f,(i)). 

We now collect a few simple properties of ~. 

(i) §; => 1), has even length. (S; consists of pairs.) 

(ii) For i > 1, M(S,) = |S;_,|. (The sum of multipliers of §, counts exactly all 
the elements of S,_,.) 

(iii) The sequence of lengths is nondecreasing: |S,| < |S,| < ..., and also 
max(S,) < max(S,) < ... . (If any integer k appears in S,, it must also appear in 
S;., Gi = 0). But we may have |S,| > |S,|.) 

(iv) If S;,, contains some integer not appearing in S,, then it appears in S,,, as 
a multiplier—not as a factor. (The set of factors of S;,, equals the set of all 
distinct integers of S,.) 

(v) For i= 1, f,@i) is equal to the maximum element of S,_,, ie., f,(i) = 
max(S,_,). 

(vi) Let 


S;= (m,(¢), fi), 21), fAb(4),-- mC) fA) (= 1). (1) 
Then f.(i) =n (since 1 <f,G@i) < f,@) < f,@) < ...). If f,@) =n (and hence 
fi) =] for all j <n), then S; is called complete. 
Lemma 1. Let S, have the form (1) (i = 1). Then 


1 
max m,(i) < |S,;_,| — 515i +1<f,(7) +1. 


l<j<n 
Proof: In view of (ii), m,(i) attains its largest value if m,(i) = 1 for all h # j. Thus 
by (ii), 
m,(i) < |S;_,l — (a — 1) = 1S,-,| — 31$j| +1 
<2n-n+1=n+1 (by (iii)) 
<f,(i) +1 (by (vi)). a 
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Corollary. Let S; be given by (1). Then max(S;) < max(S,_,) + 1 (i = 2). If equal- 
ity holds, then max(S;) = max, <;<, mi) = f,@) + 1. 


Proof: If max, m,(i) < f,(@i) + 1, then by (v), max(S;) = f,@) = max(S;_,). By 
Lemma 1, the only other possibility is max, m,(i) = f,(1) + 1. Then by Lemma 1 
and by (v), 


max(S;) = maxm,(i) = f,(¢) + 1 = max(S;_,) + 1. = 


Lemma 2. Let S; be given by (1) with n > 2. Then |S;| < 2(f,(@) + 1) for all j > i. 


Proof: Suppose S; has a successor S,; of length 2/ > 2(f,(i) + 1). Then (vi) implies 
that f,(j) > f,@) + 1, hence the maximum element of S, is also > f,(i) + 1. By 
the Corollary, there exists a successor S, = (m,(k), f,(k), m(k), f,(k),...) of S;, 
with smallest index k > i, whose largest element is f,(i) + 1, say |S,| = 2t. The 
minimality of k and (iv) imply that f,(i) + 1 is a multiplier of S,, say m,(k) = 
f, (i) + 1, and f,(k) = f, (i). Hence 


M(S,) =f,(i) + 1+ dL m(k) > fi) +1+(t-1) =f,(i) +¢. 


On the other hand, by (ii) and (vi), M(S,) = |S,_,| < 2t <t + f,(). The last two 
chains of inequalities imply that there is equality throughout. Thus 


a.t =f, (i) = f,(k) and thus S, is complete. 

b. For all j #h, m(k) = 1. 

It follows that 
S,41 = (f,(i)51, 1,2, 1,3,...51, f,(é) — 1,1, fri), 1, f,(i) + 1), 
Sao = (f,(4) + 1,1, 1,2,1,3,...51, fp(2) — 1,2, f,(2), 1, f,(4) + 1), 


with |S,,,| = [8,421 = 246,@ + 1). 

Let S, be a successor of S,,, with smallest index g, which has an element 
f,{i) + 2. Then q > k + 2. By (iv), f,(i) + 2 appears in S, as a multiplier, so S,_, 
must have f,(i) + 1 identical multipliers. Since gq — 1 > k + 2, we have |S,_,| = 
2(f, (i) + 1). Hence all the multipliers of S,_, have the same value, say r. By (ii), 
M(S,_,) = (f,@ + Dr = |S,_,]. Now q-22>k +1, so |S,_,| = 20f,@ + DV, 
hence r = 2, so 


S,-, = (2,1, 2,2, 2,3,...,2, f,(i), 2, f,(i) + 1). 
It follows that S,_, has the numbers 1, 2,..., f,(i) + 1 as multipliers. Thus by (ii), 
M(S,-2) = (falé) + 1)(fy(i) + 2)/2 = 1Sq-s1 < 2(f,(é) + 1), 
SO £@) < 2. But f,(¢) = n > 2 by hypothesis. Thus there is no such successor S,, 
and so |S,| < 2(f,(i) + 1) for all j > i. a 


Proof of the Theorem: Lemma 2 implies that the sequences S$; € “ have bounded 
length. Since the number of sequences S,;€& ~“% of length |S,| < 2k is also 
bounded, say by (k + 1)*, the family is ultimately periodic. a 


If S, is an infinite sequence, it is possible that some successor is not defined. 
Thus if S, = (1,2,3,...), then S, = (1,1, 1,2, 1,3,...) and m(1) is not defined in 
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S,. But for k = 1,2,... we can let m(k) be the smallest positive integer which 
makes S, a fixed point. Then 


S, = (2,1, 3,2, 2,3, 1,4, 5,5, 5,6, 5,7, 5,8, 6,9, 6, 10, 6, 11, 
6,12, 7,13, 7,14, 7,15, 7, 16, 8, 17, 8, 18, 8, 19, 8, 20, 
9,21, 9,22, 9,23, 9,24, 9,25, 10, 26,...). (2) 


We could replace the prefix p = (2,1, 3,2, 2,3, 1, 4) of (2) by p = (1), and then 
S,. would have the form (2). 


QUESTIONS. (i) Give formulae or meaningful bounds for the preperiod iy. (As 
for the period p, it turns out that for the case of complete sequences, p < 2 except 
that p = 3 for the example in the right column of Table 1. A similar result holds 
for sequences that are not complete.) 

(ii) Is there an infinite sequence S, for which every successor is defined, such 
that S,,, differs from S; in infinitely many elements for some 1? 

(iii) Is there an infinite sequence S, such that every successor is defined and “ 
is not ultimately periodic? 
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Chaos Without Nonperiodicity 


Carsten Knudsen 


Recently the mathematical definition of chaos proposed by Devaney in his book [3] 
has, been revisited by several authors [1, 2]. The definition of chaos proposed by 
Devaney is 


Devaney’s definition of chaos. Let f be a continuous transformation of a bounded 
metric space (X,d). If f is topologically transitive, the periodic points of f are dense 
in X, and f exhibits sensitive dependence on the initial conditions, then f is said to be 
chaotic. 


Sensitive dependence on the initial conditions is from a physical point of view 
the central element of chaos, in fact, Gulick in his book [4] has defined chaos 
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synonymously with sensitive dependence on the initial conditions. Banks et al. [2] 
pointed out that sensitive dependence on the initial conditions is a redundant 
element in Devaney’s definition, because it follows from topological transitivity 
and denseness of the periodic points. Assaf and Gadbois [1] later showed by 
construction of counter examples that neither topological transitivity nor denseness 
of the periodic points follow from the remaining two properties. 

The aim of this note is to prove that sensitive dependence on the initial 
conditions and topological transitivity are stable properties under closure, as well 
as under restriction to dense invariant subsets. The consequence of the results to 
be presented is that chaos according to Devaney [3], may exist on bounded but 
non-compact spaces without any nonperiodic orbits. 


Theorem 1. Let f be a continuous transformation of a bounded metric space (X, d). 
Let f: Y —~ Y where Y = X. Then f: X — X is topologically transitive, if and only if 
f: Y > Y is topologically transitive. 


Proof of Theorem I: Assume that f on X is topologically transitive. Let U and V 
be arbitrary open sets in X. k EN exists such that f*(U) NV # ©. Let U’ = 
Un Y and V'’=V 1 Y. Since U’ is dense in U, then due to continuity of f, 
f*(U’) is dense in f*(U), and finally invariance of Y ensures that f*(U’) NV’ # ©. 

Assume that f restricted to Y is topologically transitive. Let U and V be 
arbitrary open sets in X, and let U’ = UN Y and V’ = VOY. k EN exists such 
that f*(U’) NV’ # @, and hence f((U)NV#¥O. O 


Theorem 2. Let f be a continuous transformation of a bounded metric space (X, d). 
Let f: Y > Y where Y = X. Then f: X > X exhibits sensitive dependence on the 
initial conditions, if and only if f: Y — Y exhibits sensitive dependence on the initial 
conditions. 


Proof of Theorem 2: We first show that sensitive dependence on the initial 
conditions extends to the invariant subset Y. A sensitive dependence constant 
5 > 0 exists such that in any neighborhood N(x) of any x € Y, ye X andkeN 
exist such that d(f*(x), f*(y)) > 6. Continuity of f guarantees the existence of a 
sequence (y,) € Y with the limit point y. In particular the sequence (y,,) can be 
chosen such that all elements are contained in an open ball located in the interior 
of M(x). Since f is continuous, the sequence (f(y,)) has the limit point f(y). 
Then we can choose a y’ from (y,) arbitrarily close to x such that 
d( f *(x), f€(y’)) > 6/2. 

Next we show that sensitive dependence on the initial conditions extends to the 
closure of Y. For f: Y ~ Y we have a sensitive dependence constant 6 > 0. Pick 
any x © X\ Y, and let M(x) be an arbitrary neighborhood of x. A point x, € 
N(x), Y exists. M(x) is a neighborhood of x,, and since f: Y — Y exhibits 
sensitive dependence on the initial conditions x, € N(x) N Y and k EN exist 
such that d(f*(x,), f*(x,)) > 5. The orbit of x cannot follow the orbit of both x, 
and x,, since they separate, and hence d( f*(x), f*(x’)) > 5/4 where x’ € {x,, x5}. 

C) 

Let us describe some consequences of the two theorems. Take a dynamical 
system that is chaotic according to Devaney’s definition. Consider the restriction of 
the dynamics to the set of periodic points, which is clearly invariant. Theorems 1 
and 2 imply that the restricted dynamical system is topologically transitive, and 
that the system exhibits sensitive dependence on the initial conditions. Together 
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with the trivially fulfilled condition of denseness of periodic points, this implies 
that the system is chaotic. Due to the lack of nonperiodicity this is not the kind of 
system most people would consider labeling chaotic. It may, of course, be argued 
that since for any periodic point there is another periodic point close by with 
arbitrarily high period, and therefore this is a kind of chaos. Nevertheless, insisting 
on the distinction between a truly nonperiodic orbit, say, a dense orbit, and a 
periodic orbit of even arbitrarily high period, such a system should not be termed 
chaotic. As another example let us consider a system from which we have removed 
all periodic points as well as their preimages, for instance the logistic map x > 
4x(1 — x) on the unit interval. According to Theorems 1 and 2 the restricted map 
is topologically transitive, and it exhibits sensitive dependence on the initial 
conditions. This system, on the other hand, is full of nonperiodic orbits and 
consequently deserves the labeling chaotic although the system has no periodic 
points. 

As a consequence of the results presented in this note we therefore propose the 
following definition of chaos which excludes chaos without nonperiodicity, but 
allows, for instance, the example of the restricted logistic map to be considered as 
chaotic: 


Proposed definition of chaos. Let f be a continuous transformation of a bounded 
metric space (X, da). If f has a dense orbit in X and f exhibits sensitive dependence on 
the initial conditions, then f is said to be chaotic. 
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A Reverse Stolarsky’s Inequality 


Josip Peéaric 


K. B. Stolarsky [1] proved the following result: 
If 0 < g(x) < 1 and g is nonincreasing on [0, 1], then, for all positive numbers a 
and 5b, it follows that 


fos(ave”) dx > fos 2") dr f'g(2'/") dx. (1) 
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Stolarsky also observed that by introducing the quotient Q(g, p) as 


1 1 
QO(g,p) = J g(x)xPo! ax/ | x?—! dx, 
0 0 
we can make a change of variables and formulate (1) as 


O(g,a +b) = O(g,a)Q(g,)). (2) 


A very simple proof of Stolarsky’s inequality is given in [2]. 
Moreover, if we only suppose that g is a nonnegative nonincreasing function, 
using the substitution: g(x) — g(x)/g(0), we get from (2) 


g(0)O(g,a + b) = O(g,4)O(g,b). (3) 


In this paper we shall prove a reverse inequality. 


Theorem. [f g is a nonnegative nondecreasing function on [0,1], then for all positive 
numbers a and b, the reverse inequality in (3) is valid. 


Proof: Note that 


i 
= [xP d (x) 
~~ & 
0 Pp -0 


QO(8,DP) =p f'g(x)x?™! dx = p—8(x) 
O(g,p) = a(1) ~ fx? de(x). (4) 


Also, it is obvious that we have 


1 1 
O0< fx? decxy/ dg(x) <1. 
The following result is valid: 
(P + Q)(P + QAB) < (P + QA)(P + QB) (6) 
whenever A < 1, B < 1 and PQ < 0. In fact, the inequality (6) follows from the 


following equality 
(1 —A)(1 — B)PQ = (P + Q)(P + QAB) — (P + QA)(P + QB). 


Without loss in generality we can suppose that g(0) — g(1) = 1. In view of (5) 
we can apply (6) with P = g(1), Q = g(0) — g(1) (= 1), A = fgx* dg(x) and 
B = {3x° dg(x), to find that 


#(0)(¢(1) — [ox dg(x) ['x’ de(x)] 


s (e(1) - f'x* de(x)][ (1) - fx" de(2)) = OC8,0)0(8,b). (1) 


A classical integral CebySev inequality (see, for example, [3, pp. 239-293]) gives: 


[ox de(x) fx’ dg(x) < fo dg(x). (8) 
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By combining (7) and (8) we get 


Q(e,a)O(e, 6) 2 (0){a(1) ~ ['x**? de(x)} = 8(0)Q(e,a +b), 
and the proof is complete. 
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A Note on Some Irrational Decimal 


Fractions 
A. McD. Mercer 

Let 1 < a, <a, < ... be a strictly increasing sequence of positive integers and 
write Dec{a,} to mean the decimal fraction 0.(a,Xa,)... . Recently the following 
result was proved in [1]: 
Theorem A. [f 

ae | 

r= 

k=1 % 


then Dec{a,} is irrational. 

Taking a, to be the kth prime this theorem provides a new proof that the 
decimal 0.2357111317... is irrational (see also [2] and [3]). On the other hand, it 
does not throw any light on the nature of, say, the number Dec{k’}. Our purpose 
in this note is to prove the following generalization. 

Theorem 1. Jf there is an integer r = 0 such that the series 
[o-e) k’ 
then Dec(a,} is irrational. 


This theorem assures us, for example, that all of the numbers Dec{k”™} m = 
1,2,... are irrational. 
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It should also be noted that the proof of Theorem 1, specialized to the case 
r = 0, differs from that of Theorem A and so provides a new proof of that result. 


Proof of Theorem 1: Let us suppose that x = 0.(a,Xa,)... is a rational number 
which, written in the usual decimal form, reads x = 0.b,b,... (0 < b, < 9). Since 
x is rational its decimal expansion will be periodic (at least after a first block of 
digits). Let the period be p. To fix ideas we shall assume that the periodic 
behaviour commences immediately after the decimal point. (If this were not the 
case we could simply move the decimal point sufficiently to the right and treat the 
fractional part of the new number.) 

Let N, be the number of digits in a,. Clearly N, < N,,, for each k and since 
the left-most digit of a, is at least 1 then 


a, >10%"' (=c, say) (1) 


Let the values taken by the sequence {N,}7 be V, < V, < --- and let the value V, 
be taken exactly m, times. We note that: 


Gi) IfN, =N,4, then c,,, =c, and if N,<N,,, then c,,, = 10c, (2) 


(ii) At most p consecutive N,’s can have the same value. To see this we observe 


that N, =N,,, = °°: =N,,, (=N say) would imply a, = a,,, because pN is a 
period of the decimal representation of x. Hence 

m, =P (3) 

(iii) If the jump N, <N,,, occurs at the k-values k =k,,k,,k3,... then 

kj; =m, +m,+ °°: +m,. (4) 


Let r > 0 be an integer. According to (1) and (4) the partial sums of the series 
ve hk’ /a,) do not exceed 


m,(m,) 4 m,(m, + m2) 4 m,(m, +m, + m3) 
C C5 C3 
and this series converges since by (2) and (3) it is dominated by the series 
p’t} ra) kr 
k—-1° 
Ci x=1 (10) 


Hence £7 _ ,(k"/a,) is convergent. This completes the proof of the theorem. 
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UNSOLVED PROBLEMS 


Edited by: Richard Guy and Richard Nowakowski 


In this department the MONTHLY presents easily stated unsolved problems dealing 
with notions ordinarily encountered in undergraduate mathematics. Each problem 
should be accompanied by relevant references (if any are known to the author) and by 


a brief description of known partial or related results. Typescripts should be sent to 
Richard Guy, Department of Mathematics & Statistics, The University of Calgary, 
Alberta, Canada T2N 1N4. 


A Possible Permanent Formula 


David Callan 


The permanent of an n-square matrix A =(a,,) is defined by per A = 


Les lI} 14;5(;). The sum of the permanents of all (")" r by r submatrices of A is 
denoted by a, (A). Equivalently, a, (A) is the sum of all products of r elements 
from A, no two in the same row or column. Thus if J is the n by n matrix of 1’s, 
per (xJ + A) = L7_prio,_,(A)x’, providing a sort of generating function for the 
g,(.A) [we take o,(.A) = 1]. 
Unlike its cousin, the determinant, few simple explicit formulas are known for 
we permanents of specific matrices. One of these few is Scott’s formula: for 
= (1/(w*!~* — w**- Dig where w = e'”/", it is known [1, 2, 3] that per S = 
(_ 1)"-72n(1 - 3. Se (n — 2))?/2" if n is odd, = 0 if n is even. The proofs all 
use a classical result of Borchardt [4] (unfortunately of rather limited scope) that 
converts the problem to an evaluation of determinants, and this evaluation relies 
on the tractability of circulant matrices. (The n by n matrix (a,,) is circulant if it is 
constant along each of its n extended diagonals, that is, if a,, depends only on the 
value of kK —j modulo n.) 
Now let D denote the diagonal matrix of nth roots of unity, diag(1, w? 
w*,...,w*"~*), and consider the (circulant) matrix C = i(J — 2DS), for which 
Cy = cot((2fk —j)+1)7/2n). The following remarkably simple formula for 


4 
per(xJ + C) has been verified by the author using Mathematica® for n up to 7. 


Conjecture. For C as above, 
n n(n — 2) 
xJ+C)=n!)x"+ =x"-2 + ———-x""4 
per( )=ni\x 5% 5 4 
n(n — 2)(n — 4 
(n=2(n-4) 6 a) 
2°4:-6 
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(The last term is (n(n — 2):-+3/2-4-:-(n — 1)) xif nis odd, and is simply 1 if n 
is even.) 


Replacing x by —i in (1) and using the identity (Exercise: prove it!) 


In/2] n(n — 2)+++(n — 2j + 2) 
1+ bu (-1)’ 2:4°°-2] 


j=1 


yields Scott’s formula, which tends to make the conjecture credible. (In this 
identity both sides are zero if n is even.) 

We can at least establish the vanishing of o,(C) that is implied by (1) for odd r. 
The first row of C looks like (c,,C3,...5 Cj_193 —Cims— Cm—o+++ ~Cy) [n odd] or 
(C4,C95 0665 Cs —lms ~Em—pre++s C1) [n even]. A cyclic permutation of C’s 
columns does not affect o,(C) or C’s circulant-ness, so the following proposition 
covers both cases. 


Proposition 1. Suppose A is an n-square circulant matrix whose first row is 
(0, €,, 45, ~-- +5 Ans —Any —A,,-35.++5 —@,) [n odd] or (a,,a,,...,4a 
—Ayny— Ay —jy+++5 —4;) [n even]. 

Then o,(A) = 0 for odd r. 


m? 


Proof: Consider the set L of n’ positions (or places) in the matrix A. Suppose we 
can exhibit a byection Ww: L — L with the following two properties: 

(i) If places /,,/, in L are not on the same line of A [{lines} = {rows} U 
{columns}], then neither are wWl,, wl,. 

(ii) If a is the entry in place /, then —a is the entry in place wi. 

Then we are done because w& induces a mapping on the set of products that go 
into a,(A) [by (i)] that is actually a permutation [since & is] and sign-changing if r 
is odd [by(ii)]. So everything cancels out. If n is odd, the operation of transposition 
clearly fits the bill (and happens to be an involution). If n is even, the operation to 
use is: transpose first, then drop down one place (the first row being considered 
the successor of the last row). In symbols, (j,k) = (k + 1,j) [arithmetic modulo 
n]. The easy verification that & has the requisite properties is left to the reader. 

For Scott’s matrix S itself, an analogous formula is fairly easily established: 
per(xJ + S) = n!x" + per S. This is equivalent to: o,(S) = 0 for 0 < r <n, which 
follows on taking @ = w~* and B = 3(J + iC) in the following proposition. 


Proposition 2. Let B be a circulant n by n matrix, 6 a primitive n'" root of unity, 
and D = diag(1,0,0*,...,0"'). 
Then o,(DB) = 0 forO<r<n. 


Proof: Let Q,,, denote the set of increasing sequences of r integers, a(1) < a(2) 
< ... < a(r) contained in [1,n], and for a, B € Q, ,,, let A [a|B] denote the r by 
r submatrix of A whose rows (resp. columns) are indexed by a (resp. B). Thus 
o{A)=L, geo,, per Alalf]. Finally, let T denote the shift operator on Q, ,, 
that is, for a = (a(j))F_, © Q, ,, to obtain Ta add 1 to each term in a and, if 
necessary, reduce modulo n and cyclically permute to ensure ascending order. For 
example, for a = (1,2,4) € Q3 5, Ta = (2,3,5) while for a = (1, 2,4) € Q3,, Ta 
= (1, 2, 3). 
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Define two sequences in Q,, to be equivalent if some number of shifts 
transforms one to the other. Then Q, , is clearly partitioned into equivalence 
classes, each of size dividing n. Let (a;);_, be a set of representatives of these 
equivalence classes, and let n,; denote the size of class i. Note that rn, is always 
a multiple of n (to see this, add the entries of the identical lists (modulo n) a; 
and T” a, = {a(j) + n,}7_,). Due to the circulant nature of B, it is easy to see 
that per DB[TalTB] = w’ per DBlalB] and so, since TJ is a _ bijection, 
Leeo,, per DB[TalB] = Ygeg per DB[TalTB] = w'Yeeg per DBlalfl. 
Hence o,(DB) = Yie;LiLiLgeg,, per DBIT/a|B) = Lie LiLo" Mgeg , 
per DB[a,|B] = 0, the parenthesized factor vanishing for 0 < r <n. 
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Added in Proof. The author announces a proof of this conjecture. Details will appear elsewhere. 


Teachers 
Dear Editor: 


re Hungerford’s Future Elementary 
Teachers (January 1994). 


Why is it never suggested that 
mathematics in the elementary schools, 
like music, should be taught by teachers 
who specialize in that technical discipline? 


It is highly unlikely we will be able to 
bring all elementary teachers up to the 

, required level of competence. But it is 
possible we could find a few of these 
teachers with a real talent for mathematics 
and then train them. 


J. H.C. Creighton 
33 Garces Drive 
San Francisco, CA 94132 
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PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on 
the inside front cover. Please include solutions, relevant references, etc. Three copies 
are requested. 


Solutions of published problems should arrive before November 30, 1994 at the 
MONTHLY PROBLEMS address given on the inside front cover. Solutions should be 
typed with double spacing, including the problem number and the solver’s name and 
mailing address. Two copies suffice. A self-addressed postcard or label should be 
included if an acknowledgment is desired. 


An asterisk (* ) after the number of a problem, or part of a problem, indicates that 
no solution is currently available. Partial solutions will be useful in such cases. 
Otherwise, the published solution is likely to be based on a solution which is complete 
and correct. Of course, an elegant partial solution or a method leading to a more 
general result is always useful and welcome. In addition, references to other 
appearances of MONTHLY problems or to solutions of these problems in the 
literature are also solicited. 


PROBLEMS 


10389. Proposed by Raphael M. Robinson, University of California, Berkeley, CA. 


Find all solutions of the equation 


am b 


a> _ bs 
a;? — by? 


n 


where m > 1, n > 1, the a; and Db, are integers with 2 <a, < 4 and 2 <b, < 4, 
and a, # by. 


10390. Proposed by Ognian Enchev, Boston University, Boston, MA. 


A standard deck of 52 playing cards is arranged at random in 4 rows and 13 
columns. Show that with finitely many transpositions of cards of the same value 
(e.g., 7% and 7¥, K@ and K&, and so on) all cards can be arranged in such a way 
that each column contains one club, one diamond, one heart and one spade. 


10391. Proposed by Emre Alkan (student), Bosphorus University, Istanbul, Turkey, 
and the editors. 
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If a,,a,,...,a, are real numbers with a, >a,> -:: >a,, and if ¢@ is a 


convex function defined on the closed interval [a,, a,], then 


n? 


DV O(4,) ag = LV O(n 41) a, 
k=1 k=1 
with the convention that a,., = @). 
10392. Proposed by Murray S. Klamkin, University of Alberta, Edmonton, Alberta, 
Canada. 
Determine the extreme values of 
1 1 1 


l+xt+u 1+ytov 1+zt+w 


3 


where xyz = a°, uvw = b>, and x, y, z,u,v,w > 0. 


10393. Proposed by Jean Anglesio, Garches, France. 
Show that 
we “*(1 —e7*)" (-1) “~(n k r-1 
$$ dy = —————— —] +k l +k 
J x" cop & (e}k (a+ ky los(a +k) 


where a > 0 and 1 <r <n (except for d = 0, r = 1). 


10394. Proposed by Ignacy I. Kotlarski, Oklahoma State University, Stillwater, OK. 


Let N,Z,,Z>,...be a sequence of independent random variables, where N 
follows the geometric distribution with probabilities 


P(N =n) =p(1-p)" 


for n = 1,2,... with 0 < p < 1, while the Z,; are identically distributed complex 
random variables Z, = X, + iY, where (X,, Y,) have density 
a 
fe(*%,y) =) 20 
0 for |z| > 1 


Iz\7~* = for |z| <1 


where z =x + ly and a > 0. Find the distribution of 
W=2Z,°:Z2, °°: Ly. 


4 


10395. Proposed by F. G. Boese, Max-Planck-Institut fir extraterrestrische Physik , 
Miinchen, Germany. 


Show that a bit more than 45.6% of all zeros of 
f(s) =1+275 +375 
lie in the open halfplane St(s) < 0. More precisely, show that these zeros lie in the 


vertical strip —1 < R(s) < 0. 
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NOTES 


Notes: (10391) This includes the special case in which a, > 0 and (x) = x° for 
integer c > 1, which may be studied by algebraic means. (10395) Although the 
series n°, only converges for R(s) > 1, there are formulas relating the Rie- 
mann Zeta function in the critical strip 0 < s < 1 to the partial sums 2,(s) = 1 + 
2-5 +... +n-* (see E. C. Titchmarsh, The Theory of the Riemann Zeta-function, 
chapter IV). On the other hand, there is no reason to expect that the set of all 
zeros of the partial sums would be related to the set of zeros of ¢(s). This problem 
gives a strong quantitative difference in the first non-trivial case. 


SOLUTIONS 


Positive Solutions of a Matrix Equation 


6666[1991, 655]. Proposed by Marcel F. Neuts, University of Arizona, Tucson, AZ. 


Suppose A and B are n by n matrices with positive real entries, suppose A is 
non-singular, and let a denote the largest eigenvalue of BA. 
(a) Prove that there exists an n by n matrix X with positive entries satisfying 


X = A+ XBX (*) 


if and only if a < 1/4. 

(b) If a < 1/4, prove that 

(i) there is a solution X, of (*) with positive entries such that if X, is any other 
solution of (+) with positive entries, then all entries of X, — X, are non-negative; 

(ii) the maximum eigenvalue of BX, is (1 — v1 — 4a )/2; 

(iii) if X, is any solution of (*) with positive entries other than the minimal 
solution X,, then the maximum eigenvalue of BX, is (1 + v1 — 4a )/2. 


Solution by R. Holzsager, The American University, Washington D.C. The follow- 
ing properties of matrices with positive entries are used: 


1. There is a positive eigenvalue, called dominant, which exceeds the absolute 
value of any other eigenvalue. 

2. There is a unique (up to scalar multiplication) eigenvector with positive 
entries; it corresponds to the dominant eigenvalue. 

3. If ris the dominant eigenvalue of the positive matrix M, then the sequence 
(r-"M”") converges. 

4. If v is any positive vector, and tv < Mv (coordinate-wise) then t <r; if 
t =r, then v is an eigenvector. 
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These facts are all part of Perron-Frobenius theory and can be found in F. R. 
Gantmacher, The Theory of Matrices, Vol. 2, Chelsea, 1960 (chapter XIII, section 
2, pp. 53-66), or similar books. 


Proof of (a). From X =A + XBX, BX = BA + (BX)?’. If v is the positive eigen- 
vector of the positive matrix BX, and r is the dominant eigenvalue, then BAv = 
BXv — (BX)*v = (r — r’)v, so v is also a positive eigenvector of BA. By unique- 
ness, a = r — r’. Since r is real (in fact, positive), a < 1/4. 

Conversely, if a < 1/4, then the series 


(27 Ja"/(n +1) 
converges to (1 — y(1 — 4a) )/2a. Since a~"( BA)” is uniformly bounded, 
¥ (27 \(pay"/(n +1) 


converges to some matrix U. Multiplying series gives U = I + BAU *=]+ UBAU, 
so AU =A + AUBAU, and X = AU is our desired solution. 


Proof of (b). i) Let X, be the solution AU that we just found. If X, is another 
solution with non-negative entries, then starting with X, = 0 (entry-wise), and 
repeatedly substituting in A + X,BX,, we get an inductive proof that X, exceeds 
every partial sum of the series X, = A + ABA + 2 ABABA + 5ABABABA + :::. 
Passing to the limit, gives X, > Xp. 

ii)BX, = BAU = ¥(2")\(BA)"*! /(n + 1). If v is the positive eigenvector of BA, 


then it is also an eigenvector of BX,, with eigenvalue L (2 Jon! J(n + 1) = 


(1 — ¥(a — 4a) )/2. By uniqueness again, this is the dominant eigenvalue of 
BX. 

iii) If X, is a non-negative solution, and w is the positive eigenvector of BX,, 
with eigenvalue r, then w is also an eigenvector of BA = BX — (BX)*, with 
eigenvalue r—r’. By uniqueness, this must equal a, so r must equal 
(1 + y(1 — 4a) )/2. If the minus sign holds, then X, — Xp, is a non-negative 
matrix by (i), and annihilates the positive vector v; it is therefore 0. Thus, if X, is 
distinct from X,, the plus sign applies. 


Editorial comment. Some solvers used a normalized form of the problem in 
which it was necessary to assume that A was non-singular. The selected solution 
can be seen not to require that hypothesis, although not all solvers taking this 
approach noted that fact. Only Reinhard Wolf gave a method to prove the 
existence of solutions with the larger eigenvalue. He notes that the set of matrices 
X with non-negative entries satisfying BXv = Av is a non-empty compact convex 
set in the usual identification of n by n matrices with R”’. When v is the positive 
eigenvector of BA and A? — A + a = 0, this set is taken into itself by X —~ A + 
XBX. The Brouwer fixed point theorem then shows that the equation has a 
solution. The equation shows that any solution with non-negative entries must have 
all entries positive. 


Solved also by D. Z. Dokovié (Canada), I. Kastanas, O. P. Lossers (The Netherlands), R. Wolf 
(Austria), and the proposer. 
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A Sequence of Squares 


10211 [1992, 361]. Proposed by Herbert S. Wilf, University of Pennsylvania, Philadel- 
phia, PA. 


Choose integers a,b,c,d, and let K = 2(b* + a*d — abc). Show that every 
member of the sequence defined by y, = a’, y, = b” and 
Yn+1 = (c* — 2d)y, — dy, -_, + Ka" (n= 1) 


is the square of an integer. 


Composite solution I by Stephen M. Gagola, Jr., Kent State University, Kent, OH, 
and O. P. Lossers, University of Technology, Eindhoven, The Netherlands. Define a 
sequence of integers by x, = a, x, = b, and x,,, =cx, — dk for n > 1. Then 
set 


n-1 


Notice that 


and hence det M,,, = ddet M,. An easy induction then establishes that for all 
n> 1, 


2 
x), 


— Xn41X,_-1 = det M, = d”"~'det M, = Kd""/2. 
From the relation (x,,, + dx, _,)* = c?x2, we have 
Xpar = OXy — 2 dy yi Xy-1 — Xn -y 

= c°x, — 2d(x7, — Kd"~'/2) — d*x; 


= (c* — 2d)x7 — d’x?2_, + Kd". 


Since x§ = a? = y, and x? = b* = y,, it follows that x? = y, for all n. 


Composite solution IT by Ilias Kastanas, California State University, Los Angeles, 
CA, and R. Glenn Powers, Western Kentucky University, Bowling Green, KY. For 
indeterminates a and Db and fixed integers c and d, consider the endomorphism T 
of Z[a, b] generated by T(a) = b and T(b) = bc — ad. Note that 


T(K) = 2|(bc — ad)? + b?d — b(be — ad)c| = 2(b? + ad — abc)d = Ka. 


Also T(y9) = b* = y, and T(y,) = (bc — ad)? = y. Assuming T(y,) = y;,, for 
all j <n, we see that 


‘ T(Yn+1) = (c? 7 2d)T( yn) 7 d*T(y,-1) + T( K)d"” 
= (c° — 2d)y,4, —d°y, + Kd"*! 
= Vn+2° 


It follows by induction that 
n n n 2 
y, =T"(¥o) = T"(a*) = (T"(a)) 


Editorial comment. Most solvers discovered the sequence of square roots given 
in Solution I. David Zeitlin observed that this may also be established using 
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techniques explored in general in his paper, “Power identities for sequences 
defined by W,,, = dW,,,, — cW,,”, Fibonacci Quarterly, 3 (1965), 241-256. Fran- 
cisco Bellot and Maria Ascensién Lépez pointed out that the particular case with 
a =b=d=1 and c = 4 was a problem from the 1987 Bulgarian Mathematical 
Olympiad. A solution and generalization to that problem appeared in Crux 
Mathematicorum, 16 (1990), 292-294. 


Solved by 49 other readers and the proposer. 


Nearest Integer Zeta Functions 


10212 [1992, 361]. Proposed by Seung-Jin Bang, Seoul, Korea. 


3 
Let a(n) be the integer closest to Vn . Evaluate L®_,a(n)~‘. 


Composite solution by all solvers. The value is 7*/2 + m*/23040. Indeed, for 
s > 3, one has 


Y a(n) = 3£(s — 2) + 4°-°f(s). 


Here, ¢(s) denote the Riemann zeta function. The values of ¢(2n) for positive 
integers are easily computed rational multiples of 77”. 


Proof: The expression to be evaluated may be written as L*_, f(r)r-° where f(r) 


a ..3r. ; 
is the number of positive integers n for which vn is closest to r. Thus f(r) is the 
number of positive integer solutions of 


1\? 1\°? 
[r= 5] <n<([r+5). 


(The endpoints may be excluded since they cannot be integers.) This is an interval 
of length 3r’ + 1/4, so it will contain either 3r* or 3r* + 1 integers (all positive if 
r > 0). The number is 377 + 1 only if (27 + 1)? = 1 (mod 8), and this is true if and 
only if r = 0 (mod 4). Thus 


foe) 


E a(n) = Dflryet= LE Gr yt Y (4m) 


n=1 


= 3¢(s — 2) + 4-8¢(s). 


Editorial comment. Jonathan M. Borwein & Leo C. Hsu, Rick Mabry & Keith 
Neu, Josef Roppert, Douglas B. Tyler, and B. M. M. de Weger considered the 


foo} 


more general sums S,(s) = L* _,a,(n)~* where a,(n) is the integer closest to Vn. 
The method employed above for N = 3 also gives S,(s) = 2¢(s — 1) and S,(s) = 
4¢(s — 3) + £(s — 1). For larger N, Hurwitz zeta functions, f(a, s) = L?_.(n + 
a)~* appear, and only Chu and the team of Borwein and Hsu persisted to give any 
values of S,(n) with N > 4. The latter solution contains a proof that S,(n) is a 
polynomial in 77 whose coefficients are algebraic numbers whenever n — N is an 
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odd integer. For example, with 


170912 + 49928V2 246013 + 35366472 
O; = nn and Q,= ry 


one gets 


516 51° ar’ 1 , 1 \ 7° 

= — + — + | — — —,/— |—~ 

5(6) 6 36 945 Qs Va | qr 
4 6 7 


$7) =r +—+— + Qo— 
(7) = 7° + Te t+ F559 + Or 577- 


Solved by 65 readers (including those cited) and the proposer. One incorrect solution was received. 
A Ring with No Nilpotents 
10215 [1992, 362]. Proposed by Michael Barr, McGill University, Montreal, Quebec, 


Canada. 


Let R be an associative ring (not necessarily commutative or possessing a unit 
element) with no non-zero nilpotent elements. Suppose that r and s are two 
elements of R such that r? = s¢ and r® = s°, where d and e are relatively prime 
positive integers. Show that r= s. 


Solution by Pat Stewart, Dalhousie University, Halifax, Nova Scotia, Canada. 
Choose integers a and b such that ad + be = 1. Without loss of generality, we 


may assume that a > 0 and b < 0. Since r? = s7, we have r74 = 5°4 and hence 
pi—be — gl-be Also, since r° = s*, we have r-°° = s—°*. Thus there is a positive 
integer u such that r't” = s5!t" and r“ = 5". 


Substituting for s“, we see that r'+“ = r“s, and hence r“(r — s) = 0. Elements 
x,y of a ring with no non-zero nilpotent elements satisfy xy = 0 if and only if 
yx = 0. Hence r(r — s)r“~! = 0. Continuing in this fashion, one obtains by induc- 
tion that (r(r — s))“ = 0, and so r(r —s)=0. Similarly, s(r — 5s) = 0. Hence 
(r — s)* = 0, which implies r — s = 0, as desired. 


Editorial comment. Several readers noted that the term reduced is used for rings 
without nilpotent elements. D. D. Anderson sketched an argument to show that, in 
a reduced ring, if a,...a, =0 and o is any permutation of [1,...,n], then 
Agq)+++4 gn) = 9. Frank Schmidt noted that the problem follows from the struc- 
ture theorem for reduced rings: every reduced ring is a subdirect product of 
domains. W. K. Nicholson made the same observation, including a reference to 
source of the theorem (Andrunakevic and Rjabuhin, Soviet Math. Doklady 9 
(1968), 565-567, MR 37 #6320) and the proof by A. Klein (Canad. Math. Bull. 23 
(1980), 495-496). 


Solved also by D. D. Anderson, G. Behrendt (Germany), B. W. Brock, D. Caccia, D. Callan, R. J. 
Chapman (U. K.), T. C. Craven, E. Dobrowolski and N. Buck (Canada), N. J. Fine, E. A. Herman, M. 
Hongan (Japan), M. Juvan (Slovenia), S. Kanetkar, K. S. Kedlaya (student), J. F. Kennison, J. J. 
Kuzmanovich, C. Lanski, S. C. Locke, O. P. Lossers (The Netherlands), R. F. McCoart, Jr., A. Miller 
(France), W. K. Nicholson (Canada), F. Schmidt, R. Stong, E. T. Wong, University of Wyoming 
Problem Circle, and the proposer. 
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A Nonlinear Integer-Valued Iteration 
10247 [1992, 781]. Proposed by Cristian Turcu, London, U.K. 


For a fixed real number A, define a sequence {X,: n > 0} by 


3X, — $5X2 + 44? 
X,=0 and X,,,= rr as for n > 0. 
(a) For which A is the sequence X,, convergent? 


(b) For which A are all X,, € Z. 


Solution by Lev Wertheim (student), Novosibirsk State University, Novosibirsk , 
Russia. (a) We have X, = —|Al\, and clearly X,,, < 3X, < 0. So (X,} converges 
if and only if A = 0, in which case X,, = 0. 

(b) From the definition of {X,} we can conclude 

(2X,,, —3X,)° = 5X2 + 44? 
or 
Xp a 3X41 Xp + X; — A’, 

Replacing n with n —1 in this last equation yields also X,7_, — 3X,_,X,, 
+X° = A’, 

Thus X,_, and X,,, are both roots of the quadratic equation 

t?- 3X 1+X2-A’?=0. 
If A #0, then the inequality in the proof of (a) is strict, so X,_, and X,,,, are 
distinct. Hence they are all of the roots so by Vieta’s theorem 
Xn-1 t+ Xa = 3X, (1) 
From this it is immediate that if X,_,, X, © Z then X,,, © Z. Thus X, € Z for 
all n iff A € Z. 


Editorial comment. As observed by many solvers, we have 


Xx), = —F),, |A| 
where (F,_) = (1,1, 2,3,5,...) is the usual Fibonacci sequence. This follows at 
once from the above solution since (1) is obviously satisfied if X, = —F,,|A|, and 
one checks that in fact 

X,, = —F,, |A| 


for the first few terms. 
Readers massaged the original recursion into a variety of different forms, from 
which Fibonacci behavior was the recognized and the integer property deduced. 
More general sequences of this type are discussed in Murray S. Klamkin, 
“Perfect squares of the form (m? — 1)a* + t”, Math. Mag. 42 (1969), 111~113. 


Solved by 62 readers (including those cited) and the proposer. In addition, three readers solved only 
part (a). 


Collaborating editors: David F. Appleyard, Paul T. Bateman, Duane M. Broline, 
Barry W. Brunson, Frank §. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. 
Griggs, Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, 
Daniel J. Kleitman, Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, 
Stephen L. Portnoy, J. O. Shallit, John Henry Steelman, Kenneth B. Stolarsky, 
David E. Tepper, Douglas B. Tyler, Daniel Ullman, and William E. Watkins. 
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REVIEWS 


Edited by Darrell Haile 
Indiana-University, Bloomington, IN 47405 


Ideals, Varieties, and Algorithms. By David Cox, John Little, Donal O’Shea. 


Springer-Verlag, New York, 1992, vii + 513, $39.95. 


Reviewed by Moss Sweedler 


Pure mathematics has resisted computers longer than most branches of science. 
The advent of computers is finally, fundamentally transforming the practice of 
mathematics. Both pure and applied mathematics and mathematicians are experi- 
encing the upheaval. The changes include: 


*The line between pure and applied mathematics is in flux. 
*Computers spur new areas of research and revive languishing areas. 
*Computers increase productivity or make productivity possible by providing 
new means for: 
generating examples and testing conjectures; 
recording and disseminating information; 
conducting joint research among far removed researchers. 
*Algorithm development motivates theory. 
*Computers enhance teaching by providing: 
vastly improved graphical presentation; 
ability to present non-trivial examples in class; 
ability to routinely assign non-trivial homework problems; 
tools for students to analyze, experiment and play with the subject matter. 


Areas of mathematics such as applied logic, algebraic geometry, combinatorics, 
commutative algebra, dynamical systems, and many others, owe their existence, 
rebirth or major new research initiatives to the influence of computers. Re- 
searchers in these areaS use computers as an extension of hand calculation to 
compute examples which test conjectures and provide data to increase understand- 
ing and motivate theorems. Beyond computation, computers exert a profound 
influence on mathematical research. Researchers delve into the algorithms under- 
lying computation. They find motivation for pure mathematics from analyzing 
existing algorithms and developing new algorithms. Significant developments in the 
creation of algorithms require significant developments in theory. This is a fertile 
source of new mathematical theory. 

Ideals, Varieties, and Algorithms by David Cox, John Little and Donal O’Shea is 
among the first of a new breed of algebraic geometry books. It is oriented toward 
the coming era emphasizing computation within mathematics. Algebraic geometry 
books from previous computational eras do not take into account the theory, 
computational techniques and viewpoints developed in the interim. /deals, Vari- 
eties, and Algorithms is an introduction to the computational study of ideals, 
varieties, modern algorithms and, importantly, the ways in which algorithms 
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motivate theory. Computation aside, Ideals, Varieties, and Algorithms is a fine 
introduction to the study of algebraic geometry and commutative algebra. 

The most significant algorithmic developments for commutative algebra and 
algebraic geometry since the last era emphasizing computation within mathemat- 
ics, or perhaps of all time, are in the body of theory and techniques which have 
developed around the seminal constructive results of Bruno Buchberger. Buch- 
berger theory is an appropriate name for the area, in the same way that Galois 
theory refers to a body of theory and techniques developed around Galois’ seminal 
work. To the extent possible, Buchberger theory allows one to manipulate polyno- 
mials in several variables much as one manipulates polynomials in one variable. 
There are 7 keys to Buchberger theory. The first three generalize familiar ideas 
from polynomials in one variable: 


Buchberger theory polynomials in one variable 

1. suitable notion of leading term highest degree term of a polynomial 

2. reduction over a set of polynomials — synthetic division with multiple divisors 
3. Groebner basis of an ideal principal generator of an ideal 


Buchberger named Groebner bases after his thesis adviser Wolfgang Groebner. 
Fourth is the §-pair, a polynomial resulting from two initial polynomials. S is for 
syzygy. Fifth and sixth are Buchberger’s S-pair test and the resulting Buchberger 
algorithm for constructing Groebner bases. Starting with a set S of polynomials, 
one pair wise tests if all the S-pairs reduce to zero over S. If so, S is a Groebner 
Basis for the ideal it generates. If not, form the union of S with the non-zero 
reduction results and repeat using this augmented §. The process eventually 
terminates with a Groebner basis for the ideal generated by the original 5. Seventh 
are multiplicative term orders which play a fundamental role in both the basic 
theory and the applications. 

Buchberger theory is as fundamental and more elementary than Galois theory 
and should be taught in undergraduate algebra courses and first year graduate 
algebra courses. Ideals, Varieties, and Algorithms is an excellent introduction to 
Buchberger theory and other computational methods. Not only are the techniques 
presented clearly, they are presented in the appropriate geometric context. 

In 1985 I learned about Buchberger’s work from Barry Trager at IBM’s Watson 
Research Center. I was stunned. Surely he must be mistaken or else these 
powerful and simple techniques, useful for both computation and proofs! and 
available since the mid sixties, would routinely be included in algebra courses and 
algebra journals. Instead they were virtually unknown among academic algebraic 
geometers and commutative algebraists. I started writing a Buchberger book at the 
minimal level to present the algorithms; namely, just assuming familiarity with 
polynomials in several variables. Several chapters into the book I realized that it 
would be impossible to present many applications and that the entire treatment 
suffered from the lack of geometric context. Ideals, Varieties, and Algorithms is the 
book I wish I had written. 

Buchberger theory, other computational methods, and the related theory are 
becoming known to commutative algebraists and algebraic geometers because of 
their importance for computation and because they provide the basis for many new 


1 Originally Galois theory was primarily computational. Now a days, proofs often make use of the 
fact that a Galois group exists and no computation is involved. The same can be done with Buchberger 
theory. 
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interesting problems. The computer algebra community realized the significance of 
this work before the general academic pure mathematics community. The com- 
puter algebra community involves a full spectrum from engineers to theoreticians. 
The problem solvers using computer algebra, the implementors, the algorithm 
developers and the theory developers work side by side, motivating and benefiting 
from each other’s developments and problems. At best, the result is a robust, 
stimulating, productive scientific environment. At worst, over zealous theoreticians 
—Puritans—and over zealous algorithm developers—A/gos—divide into narrow- 
minded, self-serving communities. Each thinks it is smarter and works harder than 
the other. The Algos believe that algorithmic solutions to problems require better 
theorems with more difficult proofs than non-algorithmic solutions. The Puritans 
believe that the Algos work on problems which have already been solved. 

Purely existential results are fundamental to algorithm development.’ For 
implementation in computer algebra systems, one wishes to use efficient algo- 
rithms as measured by the number of computational steps and memory usage. 
Determining bounds on algorithms and finding efficient algorithms requires and 
motivates further theory. This is why algorithmic approaches to problems can be 
more difficult and involve more theory than non-algorithmic approaches. 

Suppose a new area of mathematics has just opened. Here is a list of new 
problems and their difficulties. 


Problem: 

A B C D E F... 
Difficulty of non-algorithmic approach: 

5 10 15 20 25 30... 
Difficulty of algorithmic approach: 

6 12 18 24 30 36... 


At first the Puritans and Algos work on problem A. An existential solution is 
obtained before an algorithmic solution. Soon the Puritans are working on Prob- 
lem F, while the Algos are working on problem E. This is why the Puritans feel the 
Algos are working on problems which have already been solved. The Algos have 
contempt for problem F as being of only theoretical interest and irrelevant to the 
real world. The first Algos developing algorithmic approaches to F face this 
contempt from their Algo peers. Once F becomes established on an algorithmic 
basis, the Algos act as if they always believed it had real world relevance. 

Given the antipathy between the two communities, it is not surprising that 
Buchberger’s contribution is poorly understood and poorly acknowledged. The 
Puritans confuse the significance of the contributions of Buchberger and Hironaka. 
Buchberger’s algorithm for constructing Groebner bases is the primary engine in 
modern commutative algebra and algebraic geometry. Groebner bases have an 
abstract characterization which is independent of any algorithm to create them. 


2 By the Hilbert basis theorem, an infinite generating set contains a finite generating set for ideals in 
polynomial rings over a field. Suppose we have a technique which produces a stream of polynomials 
(f1.f2,--.-) which generates an ideal /. I.e. for every h € J there are a finite number of polynomials 
{g,} where 2 fg; = h. Furthermore, suppose we have a concrete test of whether a given finite set S 
generates /. Applying this test repeatedly to the sets S, = {f;, f.,...f,,} we are assured by the Hilbert 
basis theorem that we reach a generating set for /. This insures we may terminate computation after a 
finite number of steps but gives no idea how many steps will be required. 
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This abstract characterization is the polynomial analog of standard bases for power 
series introduced by Hironaka, shortly before Buchberger’s work. Consequently, 
Puritans ask: 

Does Buchberger deserve credit for developing these bases for polynomial 
rings? 

Should the bases in polynomial rings be called Groebner or standard? 

Such questions miss the point and obscure the credit Buchberger deserves. 
Hironaka introduced no algorithm to create standard bases. The abstract notion of 
the basis without the S-pair test and resulting algorithm is like the glider rather 
than the airplane or the hot air without the balloon. While gliders rise to heights 
and have practical uses, compare the flight industry based on airplanes without 
engines with the flight industry based on airplanes with engines. 

The first printing of the first edition of Jdeals, Varieties, and Algorithms does 
not give Buchberger enough credit. It is easy to come away thinking: “Buchberger, 
he’s the guy who invented standard bases after Hironaka did.” It would have been 
refreshing and long overdue to come away thinking: ‘Buchberger, he’s the guy 
whose fundamental work made a market for this book, provided an engine for a lot 
of computer algebra systems and is changing the face of commutative algebra and 
algebraic geometry.” Cox, Little and O’Shea should say twenty Haile Bruno’s and 
remedy this. Only twenty because they have fashioned a beautiful introduction to 
Buchberger theory. 

For a more extensive catalog of algorithms in Buchberger theory, I recommend 
Groebner Bases by Becker and Weispfenning in the Springer GIM series. Ideals, 
Varieties, and Algorithms and Groebner Bases are excellent companions. 

The authors and publisher are to be commended for holding down the price of 
Ideals, Varieties, and Algorithms. The availability of top quality, inexpensive books 
helps to advance the field. By no coincidence, Ruediger Gebauer at Springer 
worked in this area. The authors and publisher were magnanimous in letting the 
manuscript circulate quite freely up until publication. I imagine this policy helps 
the sale of good books. 

Ideals, Varieties, and Algorithms appears in the Undergraduate Texts in Mathe- 
matics series. The mathematical prerequisites of the book as described in the 
preface are: “‘the students should have had a course in linear algebra and a course 
where they learned how to do proofs.” I understand that the authors have 
successfully used the book as a textbook for a lower level undergraduate course. 
Nevertheless, I consider the book to be a wonderful text, not just a source book or 
reference book, for an upper level undergraduate or lower level graduate course. 
The exposition is very clear, there are many helpful pictures, and there are a great 
many instructive exercises, some quite challenging, providing a rich fund of 
homework problems. Applications made possible by computers are part of the 
impetus for the study and development of modern commutative algebra and 
algebraic geometry. The book presents applications in chapters on Robotics, 
Invariant Theory of Finite Groups and Automatic Geometric Theorem Proving. 
Automatic Geometric Theorem Proving is not an artificial intelligence attempt to 
duplicate the human theorem proving process. Instead, using analytic geometry, 
the theorem to be proved is reduced to a system of equations, possibly with some 
exceptional conditions or inequalities. One then uses algebraic means to verify the 
resulting algebraic system. 

A number of books about computational commutative algebra and algebraic 
geometry are on the verge of publication. Many are strong on algebra and 
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algorithms and weak on the underlying geometry. The heart without the soul. 
Ideals, Varieties, and Algorithms offers the heart and soul of modern commutative 
algebra and algebraic geometry. 


The review is based on the following. First contact was when the manuscript began circulating 
several years ago. Next contact was to review it for a publisher other than Springer. Recently, I 
conducted a reading course from the book. While I have tried to write impartially, it should be noted 
that I am friends with Bruno Buchberger, David Cox and Ruediger Gebauer. 


Department of Mathematics 
Cornell University 
Ithaca, NY 14853-7901 


Calculating for Cubics 
Letter to the Editor: 


Take a cubic equation X? + pX + q = O over a field F (of characteris- 
tic * 2,3), and assume that its discriminant D = —4p* — 27q? is a 
square D = d? in F. The roots x,,x ,x3; then are either in F or in a 
cyclic Galois extension of degree 3. Either trivially or by Galois theory, 
there is then a quadratic Q(Y) = aY? + bY + ¢ over F with Q(x,) = x,, 
Q(x,) =x3, Q(x,) =x,. Recently H. B. Griffiths and A. E. Hirst 
(Montuty 101, 151-161) derived a,b,c in terms of p,q as a conse- 
quence of other results, saying that “the algebraic theory does not 
explicitly tell us how to calculate [them].” Rising to the challenge, I shall 
sketch a straightforward computation. 

Let £ =(-1+V¥—3)/2 be the cube root of 1. Set s = —27g/2 + 
3¥—3d/2, and let u>=s. Cardano’s formulas give one root as 
3x, =u —3p/u =u — 3pu’/s; similarly, 3x. = fu — 3pf*u?/s and 3x, 
= {7u — 3plu*/s. Observe that replacing u by another cube root {u or 
{7u permutes the indices of the roots cyclically. Now the equation 
Q(x,) =ax? + bx, +c =x becomes alu?/9 — 2p/3 + up*/s) + 


b(—u*p/s + u/3) +c = —pl7u?/s + fu/3. If we just try to make the 
coefficients of 1,u,u? agree, we get a system of equations for a, b, c: 
a(—2p/3) +c = 0, 
a(p*/s)+b/3 =£/3, 
a(1/9) — b( p/s) = —£7p/s. 


It is straightforward to solve this, and we get 


a =9V¥— 3 ps/(s* + 27p*) = 3p/d, 
b = {— 9p? /sd = —(9q + d)/2d, 
c= (2p/3)a = 2p?/d. 


As u does not occur explicitly in these formulas, we can replace it by the 

other cube roots and find that these a,b,c also give Q(x.) =x, and 
Q(x3) = x). 

William C. Waterhouse 

Department of Mathematics 

Penn State 

University Park, PA 16802 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 
with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 


T : Textbook P : Professional Reading 


C; Computer Software L : Undergraduate Library 
S ; Supplementary Reading 13: Grade Level 


1-4: Semester 
** : Special Emphasis 
?? : Questionable 


Readers are advised that price information is subject to change. Selected books and 
software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, ‘Northfield, 


MN 55057-1098. 


General, T(13-14: 1), S, L*. Concepts & 
Images: Visual Mathematics. Arthur L. Loeb. 
Design Sci. Collection. Birkhauser, 1993, xi + 
228 pp, $49.50. [ISBN 0-8176-3620-X] Non- 
technical introduction to mathematics of visual 
figures: symmetry, tessellation, glides, reflec- 
tion, translation, quasi-symmetry, geometric 
construction, etc. In “workbook” style; many 
drawing and tracing exercises. Informal and 
inviting—a double antidote for visual illiteracy 
and math anxiety. PZ 


General, S*(13-18), L**. Spirals: From 
Theodorus to Chaos. Philip J. Davis. AK Pe- 
ters, 1993, ix + 237 pp, $29.95. [ISBN 1-56881- 
010-5] Spirals in general and the Theodorus 
spiral in particular, from many, many points of 
view: historical, theoretical, computational, an- 
alytic, dynamic. Extensive notes and historical 
supplements. Generous, learned, entertaining; 
a tour de force in the liberal tradition. Based on 
author’s 1990 Hedrick Lectures. PZ 


Reference, S, C, P. The Maple Hana- 
book. Darren Redfern. Springer-Verlag, 1993, 
497 pp, $29 (P). [ISBN 0-387-94054-5] Doc- 
uments all commands available in Maple V Re- 
lease 2. Concisely organized; 12 subject areas, 
each with a short introduction. DP 
Mathematics Appreciation, S(15—16), P, L*. 
The Broken Dice and Other Mathematical Tales 
of Chance. Ivar Ekeland. Transl: Carol Volk. 
Univ of Chicago Pr, 1993, 183 pp, $19.95. 
[ISBN 0-226-19991-6] A refreshingly liter- 
ary look at the mathematics of luck and fate, 
told partly through Norse myths. BC 


Recreational Mathematics, S(13). Mathemat- 
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ical Cavalcade. Brian Bolt. Cambridge Univ 
Pr, 1992, vi + 118 pp, $17.95 (P). [ISBN 0- 
521-42617-0] 131 mathematical recreations 
(puzzles, curiosities, games, activities) on arith- 
metic, geometry, graph theory, logical thinking. 
No prerequisites (except high school algebra for 
some problems). LCL 


History, P, L. The Search for E.T. Bell, Also 
Known as John Taine. Constance Reid. Spec- 
trum Ser. MAA, 1993, x + 372 pp, $35. (ISBN 
0-88385-508-9] Engaging biography of one 
of the more colorful “men of mathematics” of 
the 20th century. BC 


History, P**, L*. A Century of Mathematics: 
Through the Eyes of the Monthly. Ed: John 
H. Ewing. MAA, 1994, xi + 323 pp, $39.50. 
[ISBN 0-88385-459-7] A wide-ranging and 
delightful collection of articles, excerpts, pho- 
tos, notes, news clips, etc., from the Monthly’s 
first 100 years (1894-1994). Mathematical ex- 
position at its best, covering research, educa- 
tion, recreation, competitions, sociology, and 
more. A unique survey of modern American 
mathematics as discipline and as profession. 


Logic, S(13-14), L**. Introduction to Mathe- 
matical Philosophy. Bertrand Russell. Dover, 
1993, viii + 208 pp, $6.95 (P). [ISBN 0-486- 
27724-0] An unaltered republication of the 
1919 Second Edition of this classic. Topics 
include numbers, orders, relations, infinite car- 
dinal numbers, limits and continuity, proposi- 
tional functions, descriptions, and classes. DP 
Foundations, T*(16-18: 1, 2), S*, P, L. The 
Joy of Sets: Fundamentals of Contemporary Set 
Theory, Second Edition. Keith Devlin. Under- 
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grad. Texts in Math. Springer-Verlag, 1993, x + 
192 pp, $29.95. [ISBN 0-387-94094-4] _ Fol- 
lowing a brisk introduction to “naive” set the- 
ory, develops Zermelo-Fraenkel theory in de- 
tail: Z-F axioms, cardinal and ordinal numbers, 
axiomatic set theory, constructibility, etc. Fi- 
nal chapter outlines non-well-founded set the- 
ory, an alternative to Z-F. First half has exer- 
cises, problems; clear, reader-friendly expo- 
sition throughout. (Revised edition of Fun- 
damentals of Contemporary Set Theory, TR, 
March 1980.) PZ 


Group Theory, T*(17: 1). Topics in Com- 
binatorial Group Theory. Gilbert Baumslag. 
Lect. in Math. Birkhauser, 1993, vii + 164 pp, 
$29.50 (P). [ISBN 0-8176-2921-1] Notes 
from a 1987-88 course at ETH Ziirich. Top- 
ics include history, the Burnside problem, free 
groups and the Reidemeister-Schreier method, 
free products and HNN extensions, groups act- 
ing on trees. DP 


Algebra, T(15-16: 1, 2). Abstract Algebra: 
Theory and Applications. Thomas W. Judson. 
PWS, 1994, xiii + 427 pp. (ISBN 0-534-93684- 
9] Groups, rings, fields, in that order. Sylow 
theorems, lattice and Boolean algebras, Galois 
theory, applications (e.g., cryptography, alge- 
braic coding theory), and historical notes. LC 


Complex Analysis, T(18: 1, 2), S, P. Bound- 
ary Behaviour of Conformal Maps. Ch. Pom- 
merenke. Grund. der math. Wissenschaften, 
B. 299. Springer-Verlag, 1992, ix + 300 pp, 
$69. [ISBN 0-387-54751-7] Wide-ranging 
survey of conformal mapping theory, starting 
with basics; stresses relation between geometry 
of simply-connected domains and analysis of 
their Riemann mapping functions. Each chapter 
begins with short, helpful overview. Readable 
exposition; many brief exercise sets; sizable 
bibliography. PZ 

Differential Equations, T(14: 1, 2).  Ele- 
mentary Differential Equations with Boundary 
Value Problems, Third Edition. C.H. Edwards, 
Jr., David E. Penney. Prentice Hall, 1993, xv + 
774 pp. [ISBN 0-13-253410-X] Flexible text: 
systems can be taught with or without assum- 
ing linear algebra; power series are reviewed 
before being applied. Impulses, Dirac delta 
functions are optional topics. Covers numer- 
ical techniques with Basic code. Plentiful ex- 
ercises, many with solutions. (Second Edition, 
TR, December 1989.) SM 


Differential Equations, P. Global Behavior 
of Nonlinear Difference Equations of Higher 
Order with Applications. V.L. Kocic, G. Ladas. 
Math. & Its Applic., V. 256. Kluwer Academic, 
1993, xi + 228 pp, $102. [ISBN 0-7923-2286- 
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X] Basics, global stability, rational recursive 
structures, applications, periodic cycles, open 
problems. Comprehensive bibliography. DH 


Partial Differential Equations, P. Asymp- 
totic Behaviour of Solutions of Evolutionary 
Equations. M.I. Vishik. Cambridge Univ Pr, 
1992, 155 pp, $39.95; $18.95 (P). [ISBN 
0-521-42023-7; 0-521-42237-X] Global ap- 
proximate solutions of evolutionary equations 
are constructed from locally asymptotic solu- 
tions by perturbing the equations. Equations 
considered include Navier-Stokes, reaction- 
diffusion, and other systems of parabolic and 
hyperbolic type. 40-page appendix on non- 
autonomous systems. SP 


Dynamical Systems, P. Stochastic and 
Chaotic Oscillations. Yu. I. Neimark, PS. 
Landa. Math. & Its Applic., V.77. Kluwer Aca- 
demic, 1992, xii+ 500 pp. [ISBN 0-7923-1530- 
8] Presents various dissipative dynamical sys- 
tems (from engineering, mechanics, chemistry, 
and biology) that exhibit chaotic behavior. SP 


Dynamical Systems, T(14: 1). Mathematics 
of Models: Continuous and Discrete Dynamical 
Systems. H. Brian Griffiths, Adrian Oldknow. 
Math. & Its Applic. Ellis Horwood, 1993, xiii 
+ 435 pp, $42. [ISBN 0-13-563800-3] Two 
parts: (1) introduction to classical models; geo- 
metric tools; more complex models (including 
catastrophe models, discrete nonlinear dynam- 
ICS); (2) theoretical support. DH 


Dynamical Systems, P. Dynamical Systems 
VIII: Singularity Theory II, Applications. Ed: 
V.I. Arnol’d. Ency. of Math. Sci., V. 39. 
Springer-Verlag, 1993, 235 pp, $89. [ISBN 
0-387-53376-1] Self-contained treatment of 
classification of functions and mappings, Leg- 
endre and Lagrangian singularities, Maxwell 
sets, gradient dynamical systems, singularities 
of the boundaries of domains of function spaces, 
ramified integrals, monodromy, deformation of 
real singularities, local lacunas, etc. SP 


Dynamical Systems, P, L. The General Prob- 
lem of the Stability of Motion. A.M. Lyapunov. 
Transl: & Ed: A.T. Fuller. Taylor & Francis, 
1992, 1x + 270 pp. [ISBN 0-7484-0062-1] 
English translation of Lyapunov’s 1892 mem- 
oir. Develops notion of stability of equilibria, a 
central theme in dynamical systems. SP 


Dynamical Systems, T(15—-17: 2), L. Dynam- 
ical Systems: Differential Equations, Maps and 
Chaotic Behaviour. D.K. Arrowsmith, C.M. 
Place. Chapman & Hall, 1992, x + 330 pp. 
[ISBN 0-412-39070-1] For second DE course 
with qualitative emphasis. Treatment driven by 
examples. Many exercises—some require com- 
puter. Broad coverage. SP 
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Numerical Analysis, T(15: 1), S, C. An Intro- 
duction to Numerical Linear Algebra. Charles 
G. Cullen. PWS, 1994, xiii + 314 pp, disk 
included. [ISBN 0-534-93690-3] Numerical 
methods for solving linear systems directly and 
by iteration, finding eigenvalues and eigenvec- 
tors, linear least squares approximations. DH 


Algebraic Topology, T(17-18: 2, 3). Topology 
and Geometry. Glen E. Bredon. Grad. Texts 
in Math., V. 139. Springer-Verlag, 1993, xiv + 
557 pp, $69. [ISBN 0-387-97926-3] Mainly 
devoted to algebraic topology with “glimpses 
into the beautiful and important realm of smooth 
manifolds.” Covers the fundamental group, ho- 
mology, cohomology, homotopy theory. DP 


Operations Research, T(16-17), L. Mathe- 
matics for Operations Research. W.H. Mar- 
low. Dover, 1993, xv + 483 pp, $12.95 (P). 
[ISBN 0-486-67723-0] Reprint of 1978 Wi- 
ley edition (TR, December 1978). Topics in- 
clude linear algebraic and differential systems, 
optimization inn dimensions. Terse exposition; 
assumes good calculus background. Many ex- 
ercises and solutions, both theoretical and com- 
putational. SM 


Probability, T(15-16: 1). The Essentials of 
Probability. Richard Durrett. Duxbury Pr, 
1994, viii + 269 pp. [ISBN 0-534-19230-0] 
Concise, classical treatments of theory precede 
numerous examples and exercises. Final chap- 
ter briefly introduces statistical inference. RSK 


Probability, T(14-15: 1). Probability. Jim 
Pitman. Texts in Stat. Springer-Verlag, 1993, 
xi + 559 pp, $49. [ISBN 0-387-97974-3] 
Calculus-based but somewhat non-traditional 
introduction, stressing intuitive understanding 
and problem solving, not theory (e.g., no men- 
tion of moment generating functions on proof 
of the central limit theorem). Many advanced 
topics appear, particularly in exercises. RSK 


Elementary Statistics, T*(13-15: 1, 2), L. 
Statistics for Engineering Problem Solving. 
Stephen B. Vardeman. PWS, 1994, xii+ 811 pp. 
[ISBN 0-534-92871-4] Unusual introduction 
to subject. Stresses concepts needed by practic- 
ing engineers, from data collection and descrip- 
tive statistics to fractional factorial designs and 
mixture studies. Amply illustrated with real ex- 
amples, MINITAB output. A final project inte- 
grates many of the methods covered. Extensive 
problem sets, many with real data. RSK 

Elementary Statistics, T(13: 1). Statistics in 
Practice. Ernest A. Blaisdell. Saunders Col- 
lege, 1993, xxvi + 755 pp, $52. (ISBN 0-03- 
032229-4] Well-written text on standard top- 
ics, with early introduction to regression analy- 
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sis, frequent exposure to P-values. MINITAB 
use is nicely interwoven. DH 


Mathematical Statistics, T*(16-17: 2), L. 
Statistical Theory, Fourth Edition. Bernard W. 
Lindgren. Chapman & Hall, 1993, xii + 633 pp, 
$57.95. [ISBN 0-412-04181-2] Revision and 
reorganization of 1976 Third Edition (TR, Oc- 
tober 1976). Statistical decision theory now de- 
ferred until the end; Bayesian inference comes 
earlier. Forty percent more problems. Still a 
solid presentation of classical theory. RSK 


Statistical Methods, T(16-17: 1), P. Design of 
Experiments: A No-Name Approach. Thomas 
J. Lorenzen, Virgil L. Anderson. Stat.: Text- 
books & Mono., V. 139. Marcel Dekker, 1993, 
xi + 414 pp, $59.75. [ISBN 0-8247-9077-4] 
Novel approach to experimental design. Avoids 
classical terminology in favor of four funda- 
mental concepts: factorial designs, nested de- 
Signs, restrictions on randomization (including 
randomized complete block and split plot de- 
signs), and fractional designs. Theoretical ma- 
terial and exercises are coded, can be skipped. 
Many coded, worked examples and key prob- 
lems with solutions. RSK 


Statistics, P*. Bivariate Discrete Distribu- 
tions. Subrahmaniam Kocherlakota, Kath- 
leen Kocherlakota. Stat.: Textbooks & Mono., 
V. 132. Marcel Dekker, 1992, xvi + 361 pp, 
$125. (ISBN 0-8247-8702-1] Thorough in- 
troduction to the most prevalent bivariate dis- 
crete distributions, with latest techniques for 
computer simulation. Extensive bibliography, 
including references to multivariate discrete 
distributions. RSK 


Programming, C, P. C-XSC: A C++ Class Li- 
brary for Extended Scientific Computing. R. 
Klatte, et al. Transl: G.F Corliss, et al. 
Springer-Verlag, 1993, xv + 269 pp, $49.95 (P). 
[ISBN 0-387-56328-8] C-XSC (C-eXtended 
Scientific Computing) is aclass library designed 
for very high numerical precision. Other fea- 
tures include data types for interval arithmetic 
and easy dynamic allocation of matrices. MPR 


Computer Systems, P**, L. UNIX for Pro- 
grammers and Users: A Complete Guide. Gra- 
ham Glass. Prentice Hall, 1993, xxii + 633 pp, 
(P). (ISBN 0-13-480880-0] Excellent intro- 
duction and reference. From basic utility com- 
mands (ls, chmod, pwd, etc.), through complete 
discussions of the Bourne, Korn, and C shells. 
How to write scripts, how to use the Internet, 
and more. A must-have for regular UNIX users. 
With exercises, examples. MPR 


Computer Graphics, T(16—-17: 2). L. Math- 
ematics for Computer Graphics. S.G. Hoggar. 
Tracts. in Theoretical Comp. Sci., V. 14. Cam- 
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bridge Univ Pr, 1992, xviii + 472 pp, $39.95. 
[ISBN 0-521-37574-6] Mathematics underly- 
ing computer graphics: plane isometries, di- 
hedral group, plane patterns, 3-space, matri- 
Ces, isometries, other transformations, fractals, 
topology, iteration of complex functions. Scat- 
tered exercises; many illustrations. LC 


Applications (Biological Science), P. Popu- 
lation Dynamics and the Tribolium Model: Ge- 
netics and Demography. Robert F. Costantino, 
Robert A. Desharnais. Mono. on Theoret. & 
Appl. Genetics, V. 13. Springer-Verlag, 1991, 
xii + 258 pp, $89. [ISBN 0-387-97581-0] 
Interesting monograph integrates bifurcations, 
limit cycles, strange attractors, and stationary 
probability distributions with biological obser- 
vations on flour beetles. Treats age structure 
dynamics, oscillations, natural selection, dise- 
quilibrium, cannibalism, coexistence. SP 


Applications (Biological Science), P, L. The- 
oretical Mechanics of Biological Neural Net- 
works. Ronald J. MacGregor. Neural Net- 
works: Found. to Applic. Academic Pr, 1993, 
xii + 377 pp, $69.95. [ISBN 0-12-464255-1] 
Aim is a foundation for neural network theory 
akin to Newton’s foundation for classical me- 
chanics. SM 


Applications (Biological Science), P. Fluid 
Dynamics in Biology. Eds: A.Y. Cheer, C.P. van 
Dam. Contemp. Math., V. 141. AMS, 1993, 
xii + 586 pp, $73 (P). [ISBN 0-8218-5148-9] 
24 papers from a 1991 conference, including 
a very readable overview by James Lighthill, 
an article on computational biofluid dynamics 
by Peskin and McQueen, and one titled “Hairy 
little legs: Feeding, smelling, and swimming at 
low Reynolds numbers.” BC 


Applications (Communication Theory), P, L. 
Wavelets: Algorithms & Applications. Yves 
Meyer. Transl: Robert D. Ryan. SIAM, 1993, 
xi + 133 pp, $19.50 (P). [ISBN 0-89871-309- 
9] Introduces wavelets, outlines history and a 
few applications: signal processing, computer 
vision, turbulence, study of galaxies, etc. JO 


Applications (Engineering), P. Artificial In- 
telligence in Engineering Design, Volume I: De- 
sign Representation and Models of Routine De- 
sign. Eds: Christopher Tong, Duvvuru Sriram. 
Academic Pr, 1992, xv + 473 pp. [ISBN 0-12- 
660561-0] In two parts: knowledge represen- 
tation, its effects on design and constraint sat- 
isfaction; frameworks for design problem solv- 
ing, examples of designers. RJA 

Applications (Engineering), L. Software 
Systems for Structural Optimization. Eds: 
H.R.E.M. H6rnlein, K. Schittkowski. ISNM, 
V. 110. Birkhauser, 1993, viii+ 283 pp, $89.50. 
[ISBN 0-8176-2836-3] Finite element meth- 
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ods are used in structural design to test configu- 
rations for stresses, displacements, natural fre- 
quencies, etc. 11 software packages described 
here go one step further, producing feasible de- 
signs that optimize an objective function (e.g., 
weight or stiffness). SM 


Applications (Engineering), S(17). Methods 
of Engineering Mathematics. Edward Haug, 
Kyung K. Choi. Prentice-Hall, 1993, xx + 
585 pp. [ISBN 0-13-579061-1] Encyclope- 
dic compilation of post-calculus mathematics 
needed for graduate work in engineering: lin- 
ear algebra, analysis, mechanics, ODE’s and 
PDE’s, numerical methods. SK 


Applications (Physics), T(17-18: 1, 2), S, 
P. The Link Invariants of the Chern-Simons 
Field Theory: New Developments in Topologi- 
cal Quantum Field Theory. Enore Guadagnini. 
Expos. in Math., V. 10. Walter de Gruyter, 1993, 
xiv + 312 pp, DM 148. [ISBN 3-11-014028-4] 
Self-contained text for introduction to topolog- 
ical quantum field theory. A detailed exposi- 
tion of a non-trivial gauge theory that is exactly 
solvable in any 3-manifold. MU 


Applications (Physics), S(15—-16), L. And Yet 
It Moves: Strange Systems and Subtle Ques- 
tions in Physics. Mark P. Silverman. Cam- 
bridge Univ Pr, 1993, xvii + 266 pp, $49.95; 
$24.95 (P). [ISBN 0-521-39173-3; 0-521- 
44631-7] Drawn from a rich professional ca- 
reer, Salient examples of incongruities predicted 
by quantum theory are discussed, set in personal 
and historical context. Text is largely descrip- 
tive, but examples are buttressed by pertinent 
equations and charts. Excellent supplementary 
reading for a course in quantum mechanics. MU 


Applications (Quality Control), P. Reliabil- 
ity Improvement with Design of Experiments. 
Lloyd W. Condra. Quality & Reliability, V. 41. 
Marcel Dekker, 1993, x + 370 pp, $65. [ISBN 
0-8247-8888-5] Practical guide for “reason- 
ably intelligent and inquisitive technical person- 
nel who wish to set up and operate product relia- 
bility programs,” applying experimental design 
(primarily Taguchi methods) to improve relia- 
bility. Treats design of experiments, reliability 
methods, accelerated testing. With real exam- 
ples from author’s experience. RSK 


Reviewers 


RJA: Richard J. Allen, St. Olaf; LC: Laura Chihara, 
St. Olaf; BC: Barry Cipra, St. Olaf; DH: Deanna 
Haunsperger, St. Olaf; SK: Steve Kennedy, St. Olaf; 
RSK: Richard S. Kleber, St. Olaf; LCL: Loren C. Lar- 
son, St. Olaf; SM: Steve McKelvey, St. Olaf; JO: Jeff 
Ondich, Carleton; SP: Samuel Patterson, Carleton; DP: 
David Peifer, St. Olaf; MPR: Matthew P. Richey, St. Olaf; 
MU: Milton Ulmer, Carleton; PZ: Paul Zorn, St. Olaf. 


[June—July 


Bringing You Solutions. 


Catalan’s Conjecture 
Are 8 and 9 the Only 
Consecutive Powers? 


Paulo Ribenboim 


Fully accessible and self-contained, 
this book provides a 
thorough historical 
Study of the efforts of 
mathematicians to solve 
Catalan’s problem. The 
book features a compre- 
hensive bibliography 
and an appendix on 
Catalan’s equation and 
powerful numbers, and 
includes many beautiful 
results of classical num- 
ber theory not found in 
any other book. 


February 1994, 364 pp., $64.95 
ISBN: 0-12-5871 70-8 


Optimization 
Techniques in Statistics 
Jagdish S. Rustagi 


A Volume in the STATISTICAL MODELING AND 
DECISION SCIENCE Series 


Statistics help guide us to optimal deci- 
sions under uncertainty. A large variety 
of statistical problems are essentially 


solutions to certain optimization problems. 


The mathematical techniques of optimiza- 
tion are fundamental to statistical theory 
and practice. In this book, Jagdish 
Rustagi provides full-spectrum coverage 
of these methods, ranging from classical 
optimization and Lagrange multipliers, 
to numerical techniques using gradients 
or direct search, to linear, nonlinear, and 
dynamic programming using the Kuhn- 
Tucker conditions or the Pontryagin 
maximal principle. 


CONTENTS (Chapter Headings): 
Synopsis. Classical Optimization 
Techniques. Optimization and 
Inequalities. Numerical Methods of 
Optimization. Linear Programming 
Techniques. Nonlinear Programming 
Methods. Dynamic Programming 
Methods. Variational Methods. 
Stochastic Approximation Procedures. 
Optimization in Simulation. Optimization 
in Function Spaces. Chapter Exercises 
and Applications. Bibliography. Index. 


April 1994, c. 352 pp., $69.95 
ISBN: 0-12-604555-0 


The History of 


Mathematics 
Volume III 

Eberhard Knobloch 
and David E. Rowe 


This volume contains nine 


sophical issues involved in 
mathematical history, and 


ment of the ideas of 
differential 

geometry 

and analysis. 

The text also dis- 

cusses the structure 

and interaction of 
mathematical communities 


through studies of these communities 


in the U.S. and in China. 


May 1994, c.450 pp., $55.00 (tentative) 


ISBN: 0-12-599663-2 


Finite Elements for 


Analysis and Design 
J.E. Akin 


A Volume in the COMPUTATIONAL MATHEMATICS 


AND APPLICATIONS Series 


This book provides a 
thoroughly revised 
and up-to-date 
account of the finite 
element method 
(FEM) and its numerous 
applications, with 
added emphasis 

on basic theory. 
Numerous worked 
examples are included 
to illustrate the material. 


Paperback: $39.95/ISBN: 0-12-047654-1 
Casebound: $79.00/ISBN: 0-12-047653-3 


March 1994, 560 pp. 


Includes a disk with FORTRAN code for 


the programs cited in the text. 


essays that span three different 
approaches to the history of 
mathematics. It addresses the 
historiographical and philo- 


determining the meaning of 


traces the convoluted develop- 


From 
Academic Press Journals 


Historia Mathematica 
Editor 


David E. Rowe 
Universitat Mainz, Germany 


Managing Editor 


Karen Parshall 
University of Virginia Charlottesville 


Historia Mathematica is concerned 
with the history of all aspects of the 
mathematical sciences in all parts of 
the world and all historical periods. The 
journal publishes biographies of mathe- 
maticians and historians, studies of 
organizations and institutions, essays 
on historiography, and articles on the 
interactions among all facets of mathe- 
matical activity and other aspects of 
culture and society. 


Published under the Auspices of the International 
Commission on the History of Mathematics 

of the Division of the History of 

Science of the Intemational 

Union of the History and 

Philosophy of Science 

Volume 21 (1994) 

4 issues 

ISSN 0315-0860 

In the U.S.A. and 

Canada: $127.00 

All other countries: $156.00 

Journal subscriptions are for the calendar year 
and are payable in advance. 

Free sample copies are available on 
request. For more information, 
please contact: 

Academic Press, inc. 

Journal Promotion Department 

525 B Street, Suite 1900 

San Diego, California 92101-4495, U.S.A. 
1-800-894-3434 

All prices are in U.S. dollars and are subject 
to change without notice. 

Canadian customers: Please add 7% Goods 
and Services Tax to your order. 


Order from your local bookseller or directly from 


ACADEMIC PRESS, INC. 4 Division of Harcourt Brace & Company 
Order Fulfillment Department DM17915, 6277 Sea Harbor Drive, Orlando, FL 32887 


Call Toll Free 1-800-321-5068 or Fax 1-800-336-7377 


Prices subject to change without notice ©1994 by Academic Press, Inc All Rights Reserved KR/LMW/MEH—02064 


In Eves’ Circles 


Joby Milo Anthony, Editor 


Howard Eves celebrated his eightieth birth- 
day in 1991. To honor that occasion, the 
University of Central Florida sponsored acon- 
ference that focused on the lifelong interests 
of this prominent American mathematician 
namely, the history of mathematics, the teach- 
ing of mathematics, and geometry. Eves is 
well-known for his contributions to all three 
areas. 


The conference was unique. Conference 
participants included pre-college mathemat- 
ics teachers, community college and univer- 
sity teachers, and research mathematicians. 
Papers were delivered in sessions devoted to 
the classroom teacher, to the history of math- 
ematics, and to pedagogical and research 
interests in geometry. Many lectures com- 
bined these subjects. This book presents 
some of those lectures. Anyone involved with 
teaching or producing mathematics can find 
something in this volume that will be interest- 
ing to them. 


Some of these papers are specifically for the 
classroom teacher. They discuss a use of 
technology, or the organization of a class for 
some specific purpose. Other articles will 
provide teachers with examples of 
mathematical problems or historical episodes 
that can be used as part of a mathematics 
class. Still other papers deal with the 
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philosophy of mathematics education. There 
are articles in this volume that present new 
insights into some of the history of 
mathematics, and there are other articles that 
deal with some new results in geometry. This 
is really the legacy of Howard Eves. He has 
been both a mentor for teachers at every 
level, and a colleague of research 
mathematicians. And so some part of this 
volume will be appropriate for anyone 
interested in mathematics. 


Also included in this volume is a penetrating 
interview with Eves. 


220 pp., 1994, Paperbound 
ISBN-0-88385-088-5 

List: $24.00 

Catalog Number NTE-34 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


Total $ 
Payment Check QO VISA OQ MASTERCARD 
Credit Card No. 
Signature 


Exp. Date 
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Call 1-800-331-1622 
to order your bulk 
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Research Issues in 
Undergraduate 


Mathematics Learning 
Preliminary Analyses and Reports 


James J. Kaput and 
Ed Dubinsky, Editors 


Research in undergraduate mathematics edu- 
cation is important for all college and university 
mathematicians. If our students are to be more 
successful in understanding mathematics, then 
college faculty need to understand how math- 
ematics is learned. This knowledge can guide us 
in curriculum reform and in improving our own 
teaching. It can help us make mathematics 
accessible to all students and it can increase the 
number of graduate students in mathematics. 

This volume of research in undergraduate 
mathematics education informs us about the 
nature of student learning in some of the most 
important topics in the undergraduate curricu- 
lum: sets, functions, calculus, statistics, abstract 
algebra and problem solving. Paying careful 
attention to the trouble students have in learn- 
ing mathematics will help us to work with stu- 
dents so they can deal with those difficulties. 

A survey of the literature begins the volume. 
Becker and Pence have brought together an 
unusually complete list of references on research 
in collegiate mathematics. Their comments will 
guide those attempting to begin or to continue 
a program of research in student learning. 

The sad fact that even good calculus students 
stumble over nonroutine problems is the theme 
of Selden, Selden, and Mason. Their conclu- 
sions point to significant shortcomings in the 
curriculum. This study of student difficulties is 
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continued by Ferrini-Mundy and Graham who 
investigate a single student’s interactions with 
the fundamental concepts of the calculus. 
Baxter studies a group of students to learn how 
they acquire the concept of set, while Cuoco 
does the same for the concept of function. 

Cooperative learning does help the student. 
That is the conclusion of Bonsangue, who inves- 
tigates how two carefully matched classes of 
students in a statistics course perform on exams. 
How students learn to write proofs in group 
theory is the subject considered by Hart. 
Rosamond breaks new ground by comparing 
how emotions vary in their effect on the prob- 
lem solving ability of novices and experts. 

All college faculty should read this book to 
find how they can help their students learn 
mathematics. 


150 pp., Paperbound, 1994 
ISBN 0-88385-090-7 

List: $24.00 

Catalog Number NTE-33 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


Total $ 
Payment QO Check OQ VISA QO MASTERCARD 


Credit Card No. 
Signature 
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Industrial Mathematics 
A Course in Solving Real-World Problems 
Avner FRIEDMAN and Walter LITTMAN 


“This is REAL. It has a different spirit. It gives students the distinct feeling 
that they could go into industry and actually work on problems like those in 
this book. The standard teaching of ‘bere is the mathematics, use it to solve 
this problem’ bas been replaced with ‘here is a problem, use mathematics to 
solve it.’ This book refreshes the interest of students in mathematics and 
motivates them to learn more of it. It belps them understand the nature and 
tbe importance of mathematics in real world applications.” 

— Oscar Bruno, Assistant Professor of Mathematics, Georgia Institute 

of Technology. 


Contents 

Introduction; Preface to the Student, Chapter 1: Crystal Precipitation; Chapter 2: Air 
Quality Modeling; Chapter 3: Electron Beam Lithograpby; Chapter 4: Development of 
Color Film Negative; Chapter 5: How Does a Catalytic Converter Function?. Chapter 6: 
The Photocopy Machine, Chapter 7: The Photocopy Machine (Continued); Index. 


Available July 1994 / Softcover / Approximately 152 pages / ISBN 0-89871-324-2 
List Price $22.50 / SIAM Member Price 18.00 / Order Code OT42 
Special Classroom Adoption Price $15.00 (minimum purchase of five copies) 


Wavelets: Algorithms & Applications 


Yves MEYER 
translated and revised by Robert D. RYAN 


“... I believe it is accessible to any scientifically minded reader witb 
rudimentary knowledge of Fourier analysis; furtbermore, the seasoned 
mathematician will find discussions of many interesting nonmathematical 
topics. ... I recommend the book as a delightful introduction to wavelets.” 
— Ingrid Daubechies, AT&T Bell Laboratories, Science, Vol. 262, 
December 3, 1993. 
1993 / xi + 133 pages / Softcover / ISBN 0-89871-309-9 
List Price $19.50 / SIAM Member Price $15.60 / Order Code OT38 
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TO ORDER 


Use your credit card (AMEX, MasterCard and VISA): Call toll free in USA: 800-447-SIAM 
Outside USA call: 215-382-9800 / Fax: 215-386-7999 / E-mail: service@siam.org 

Or send check or money order to: SIAM, Dept. BKMA94, PO Box 7260, Philadelphia, PA 
19101-7260 


Payments may be made by wire transfer to SIAM's bank: PNC Bank, 3535 Market Street, 
Philadelphia, PA 19104, ABA Routing #031000053: Account Name: Society for Industrial and 
Applied Mathematics; Account Number 509-704-5. 


Shipping and Handling: USA: Add $2 75 for the first book and $.50 for each additional book 
Canada: Add $4 50 for the first book and $1.50 for each additional book. Outside USA/ 
Canada: Add $4.50 per book All overseas delivery is via airmail 


Handbook of 


Writing for the 
Mathematical Sciences 


Nicholas J. HIGHAM 


Contents 

Preface; Chapter 1: General 
Principles, Chapter 2: Writer's 
Tools and Recommended Reading; 
Chapter 3: Mathematical Writing; 
Chapter 4: English Usage; Chapter 5: 
When English is a Foreign Language; 
Chapter 6: Writing a Paper; Chapter 
7: Revising a Draft; Chapter 8: 
Publishing a Paper; Chapter 9: Writing 
a Talk; Chapter 10: Computer Aids for 
Writing and Research; Appendix A: 
The Greek Alpbabet; Appendix B: 
Summary of TeX and LaTex Symbols; 
Appendix C: GNUEmacs — The Sixty+ 
Most Useful Commands; Appendix D: 
Mathematical Organizations, 
Appendix E: Winners of Prizes for 
Expository Writing; Glossary; 
Bibliography; Index. 

1993 / xii + 241 pages / Softcover 

ISBN 0-89871-314-5 / List Price $21.50 

SIAM Member Price $17.20 

SIAM Student Member Price $12.00 

Order Code 0T39 


Quantity discounts available for classroom use. 


Mathematics Applied to 
Deterministic Problems 
in the Natural Sciences 
C.C. LIN and L.A. SEGEL 


Addresses the construction, analysis, 
and intepretation of mathematical 
models that shed light on significant 
problems in the physical sciences. 


1988 / xxi + 609 pages / Softcover 
ISBN 0-89871-229-7 / List Price $28.00 
SIAM Member Price $22.40 

Order Code CL01 


i For complete tables of contents or 
additional information about SIAM 

publications, access gopher.siam.org or 

contact SIAM Customer Service Department. 
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Science and Industry 
Advance with Mathematics 


Selected by CHOICE Magazine as one of 


the outstanding academic books of 1991. 


More Mathematical 
Morsels 


Ross Honsberger 


Honsberger shows how powerful problem-solving 1s 
using elementary level mathematical techniques. His 
book contains a potpourn of intriguing and atypical 
problems and their solutions for the highly motivated 
student of mathematics...A wonderful source of 
interesting problems for highly gifted high school 
students, undergraduate mathematics students, and 
their instructors. —CHOICE 


Ross Honsberger is the best-selling author of 
seven books in the Dolciani Mathematical Expo- 
sition series, each of which presents problems 
from algebra, arithmetic, number theory, prob- 
ability, and geometry, and provides ingenious 
solutions and/or intriguing results. His most 
recent addition to the series is More Mathemati- 
cal Morsels which is a continuation of his best- 
selling Mathematical Morsels volume published 
in 1979. 


All of the problems are accessible to anyone with 
a knowledge of freshman mathematics. 


Here is Morsel 1: No matter which 55 positive 
integers may be selected from (1, 2,...,100), 
prove that you must choose some two that differ 
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by 9, some two that differ by 10, some two that 
differ by 12, and some two that differ by 13, but 
that you need not have any two that differ by 11. 


315 pp., Paperbound, 1990 
ISBN 0-88385-314-0 

List: $24.00 MAA Member: $18.00 
Catalog Number DOL-10 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


Foreign Orders Please add $3.00 per item ordered to 
cover postage and handling fees. The order will be 
sent via surface mail.If you want your order sent by air, 
we will be happy to send you a proforma invoice for 
your order. 


Total $ 
Payment O Check O VISA UO MASTERCARD 
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No matter how you 


express it, it still means 
DERIVE. Is half price. 


The DERIVE A Mathema- 
tical Assistant program lets 
you express yourself symbol- 
ically, numerically and 
graphically, from algebra 
through calculus, with vec- 
tors and matrices too—all 
displayed with accepted 
math notation, or 2D and 3D 
plotting. DERIVE is also easy 
to use and easy to read, 
thanks to a friendly, menu- 
driven interface and split or 


DERIVE is a 
registered trademark of 
Soft Warehouse, Inc. 


overlay windows that can 
display both algebra and 
plotting simultaneously. 
Better still, DERIVE has been 
praised for the accuracy and 
exactness of its solutions. 
But, best of all the suggested 
retail price is now only $125. 
Which means DERIVE is now 
half price, no matter how you 
express It. 


Soft Warchouse: 


aesmess HONOLULU*HAWAI! 


System requirements 


DERIVE: MS-DOS 2.1 or 
later, 512K RAM, and one 33" 
disk drive. Suggested retail 
price now $125 (Half off!). 


DERIVE ROM card: 

Hewlett Packard 95LX & 
100LX Palmtop, or other PC 
compatible ROM card 
computer. Suggested retail 
price now $125! 


DERIVE XM 

(eXtended Memory): 

386 or 486 PC compatible 
with at least 2MB of extena- 
ed memory. Suggested list 
price now $250! 


Soft Warehouse, Inc.» 3660 Waialae Ave. 
Ste 304 +» Honolulu, HI, USA 96816-3236 
Ph (808) 734-5801 + Fax. (808) 735-1105 


Winner of the MAA 
Book Prize 


The Mathematics of Games 


and Gambling 


Edward Packel 


The whole book is written with great urbanity and 
clarity...1t 1s hard to see how it could have been 
better or more readable. 

—The Mathematical Gazette 


You can’t lose with this MAA Book Prize 
winner if you want to see how mathematics 
can be used to analyze games of chance and 
skill. Roulette, craps, blackjack, backgam- 
mon, poker, bridge, state lotteries and horse 
races are considered here in a way that 
reveals their mathematical aspects. The tools 
used include probability, expectation, and 
game theory. No prerequisites are needed 
beyond high school algebra. This book:is an 
excellent supplement to a course on prob- 
ability. 

No book can guarantee good luck, but this 
book will show you what determines the best 
bet in a game of chance or the optimal 
Strategy in a strategic game. Besides being a 
good supplement in a course on probability 
and good bedside reading, this book's treat- 
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THE MATHEMATICS OF 
GAMES AND GAMBLING 


ment of lotteries should save the reader 
some money. 


141 pp., Paperbound, 1981 
ISBN 0-88385-628-X 

List: $16.00 MAA Member: $13.00 
Catalog Number NML-28 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 
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Exploring Mathematics 


With Your Computer 


Arthur Engel 


A must for academic libraries supporting an under- 
graduate major in mathematics. Public libraries with 
strong science collections should have this book as a 
resource for traditional mathematics topics and as a 
recreation for the mathematically inclined. Capable 
high school students would also benefit. 

—CHOICE 


Today’s personal computer gives its owner tre- 
mendous power which can be used for experi- 
mental investigations and simulations of un- 
precedented scope, leading to mini-research. 
This book is a first step into this exciting field. 


This isa mathematics book, not a programming 
book, although it explains Pascal to beginners. 
It is aimed at high school students and under- 
graduates with a strong interest in mathematics 
and teachers looking for fresh ideas. It is full of 
diverse mathematical ideas requiring little back- 
ground. It includes a large number of challeng- 
ing problems, many of which illustrate how 
numerical computation leads to conjectures 
which can then be proved by mathematical 
reasoning. 


You will find 65 interesting and substantial 
mathematical topics in this book, and over 360 
problems. Each topic is illustrated with examples 
and corresponding programs. The major goal 
of the book is to use the computer to collect data 
and formulate conjectures suggested by the data. 
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It is assumed that readers have a PC at their 
disposal. 


264 pp., Paperbound, 1993 

ISBN 0-88385-636-0 

List: $38.00 MAA Member: $26.50 
Catalog Number NML-35 


A 3.5" IBM-compatible disk containing the Pas- 
cal programs described in the book is packaged 
with this volume. 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC 20036 

1-800-331-1622 Fax (202) 265-2384 


Foreign Orders Please add $3.00 per item ordered to 
cover postage and handling fees. The order will be 
sent via surface mail.If you want your order sent by air, 
we will be happy to send you a proforma invoice for 
your order. 
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Student Research Projects 


in Calculus 


Marcus Cohen, Edward D. Gaughan, Arthur Knoebel, 


Douglas S. Kurtz, and David Pengelley 


It is yet another workable way to reyuvenate college 
calculus. The monograph is highly recommended to 
those interested in expernmenting uith mathematics 
curricula. —AAAS, Science Books & Films 


I found the book very readable and thought provoking. 
Whether your students are in engineering, pure or 
applied science, or even liberal arts, this book may 
change the way you teach—and the way they learn— 
calculus for the better! —The Mathematics Teacher 


An important contribution to the growing list of new 
curricular materials becoming available to assistin the 
teaching of calculus. —CHOICE 


Changing the way students learn calculus was 
the goal of the authors of this excellent guide- 
book. In the Spring of 1988, they began work on 
a student project approach to calculus. 


You can use their methods in teaching your own 
calculus courses. Over 100 projects are pre- 
sented, all of them ready to assign to your stu- 
dents in single and multivariable calculus. The 
projects were designed with one goal in mind: to 
get students to think for themselves. Each project 
is a multistep, take home problem, allowing 
students to work both individually and in groups. 
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Student 
Research 
Projects 
in Calculus 


Each project has accompanying notes to the 
instructor reporting students’ experiences. The 
notes contain information on prerequisites, list 
the main topics the project explores, and sug- 
gest helpful hints. The authors have also pro- 
vided several introductory chapters to help in- 
structors use projects successfully in their classes 
and begin to create their own. 


232 pp., Paperbound, 1992 
ISBN 0-83385-503-8 

List: $25.50 MAA Member: $18.00 
Catalog Number SRPC 
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Scientific Word, a Windows™- 
based front-end to LAT RX is 


D I SCO ver the easy to learn and easy to use. It 


6 " f is a full document processor, not just an equation editor 
e i | [i LY 0 You enter text and mathematics on a continuous screen 
7 cen without the distraction of popping in and out of equation 
AY C / e i] f | f | C | boxes. With Scientific Word you use familiar mathemati- 
’ cal notation to enter your mathematics - you need no 
special codes. Scientific Word adheres to internationally 
HT] 0 rd : accepted mathematical formatting standards, so you are 
one free to deal with the content of your document rather than 
its appearance. Your document is saved as an ASCII 
LATEX file and printed output is produced via TeX, 
the mathematical typesetting standard. 


Call toll free 1-800-874-2383 to order your copy today. 
Ask about our 30-day money-back guarantee and our 
educational discount. 


1190 FOSTER ROAD 
LAS CRUCES, NM 88001 


TEL: (505) 522-4600 
FAX: (505) 522-0116 


800-874-2383 


CALL TODAY FOR MORE INFORMATION! SOFTWARE RESEARCH sales@tclisoft.com 


SCIENTIFIC WORD IS A REGISTERED TRADEMARK OF TCI SOFTWARE RESEARCH. 
TEX 1S A TRADEMARK OF THE AMERICAN MATHEMATICAL SOCIETY 
WINDOWS IS A TRADEMARK OF MICROSOFT. 


The American 
Mathematical Monthly 


Volume 101 Number 6 / JUNE-JULY 1994 
(ISSN 0002-9890) 


Contents 


ARTICLES 


Juggling Drops and Descents / JOE BUHLER, DAVID EISENBUD, 
RON GRAHAM, and COLIN WRIGHT 507 


Teaching Integration by Substitution / DAVID GALE 520 


Workable Gears, Archimedian Solids and Planar Bipartite Graphs / 
GARY GORDON = 527 


On the Kummer Solutions of the Hypergeometric Equation / 
REESE T. PROSSER = 535 


Reflections on a Mira / JOHN W. EMERT, KAY I. MEEKS, 
and ROGER B. NELSON 544 


Buffon Noodles / ED WAYMIRE = 550 


FEATURES 


COMMENTS = 506 
PICTURE PUZZLE 559 


NOTES 
On a Curious Property of Counting Sequences / VICTOR BRONSTEIN 
and AVIEZRI S. FRAENKEL 560 
Chaos Without Nonperiodicity / CARSTEN KNUDSEN 563 
A Reverse Stolarsky’s Inequality / JOSIP PECARIC 565 
A Note on Some Irrational Decimal Fractions / 
A. MCD. MERCER = 567 


THE AUTHORS 569 


UNSOLVED PROBLEMS 
A Possible Permanent Formula / DAVID CALLAN 571 


PROBLEMS AND SOLUTIONS 574 


REVIEWS 
Ideals, Varieties, and Algorithms. By David Cox, John Little, 
and Donal O’Shea / MOSS SWEEDLER = 582 


TELEGRAPHIC REVIEWS = 58/7 


THE MATHEMATICAL ASSOCIATION OF AMERICA 
1529 Exghteenth Street, N W 


i” % 


eee $e05: 2° oy SELES V8 
: -oeuean SURRE RES Ge oe RS. Map : Gaguee pies cand et gues BETS oe 
ns ; pegtngs ee ' si ee EBORES © NE RE ERB gy HED, Be ya Rap Re 
cass oe ; . REWER ERE: tte ee EE PE EE go 
: : ; ie ge ee Reh peer eneee 
pe: 


ee ee eee Rea SaRes- Ie ice saga 
WAY eee 
MS PARES REL YL 
$Gh Fey 
% aie ; : 4 PRS ie ees ge & 
ecb eae ber ha bo 2 Be Hh Dee ES et PERS Be WEEE S Rig EARLE : : ne abe Bos 
a So roehensen 7 eS FABER, PRUNES CRG EMER EEE ER GSS Ce é: 7 aaa SEARS LS 
LN ERR RRR TA Be Bef BEG £83 q 
reer eR CR ES YE REE ES gsee es 
j eRe : 
3 Ske pe eos 3 eee 5 vee bag aE sé 
: ‘ Se eae Ouss 


wom 
a ee ad 


vr 
a 
“A 
= 
pA 
fa) 
==) 
= 
fa) 
foam 
Quy 
fa) 
if 
foam 
2 
jo) 
oe) 
=) 
< 
NN 
~ 
So 
v 
a—| 
= 
— 
Zz 
—_ 
—) 
= 
v 
= 
— 
S 


LbBhs cher 


neko BREE IN ER oes eye heed veobat bas shee exe eee verse. Foe isheer sane tists ESS le 


Lupe CAG RL aE RRAR TOTES PE EEG gee EL Fa BE HE She Seate Gan see ee UE sees eeseatet eee 
PR GE BO LR BAY ROGYK RE wee Se PAR 2B BRERA TDR ee RO REY. Owe SER RED ORE OS TY Ae eS 7 
a $ fee EN ARS G EYER E OR PEs Ow Bee neeae Be 
Ee ie Oh Ay BS So Oe Le PE BER REOE Le hei gs 


Chee Pe A NE me 


AN OFFICIAL PUBLICATION OF THE MATHEMATICAL ASSOCIATION OF AMERICA 


NOTICE TO AUTHORS 


The Monthly publishes articles, notes, and other fea- 
tures about mathematics and the profession. The 
readership of the Monthly is intended to include ev- 
erybody who is mathematically inclined, including of 
course professional mathematicians and students of 
mathematics at all collegiate levels. While no single 
article or feature is likely to appeal to everyone, mate- 
rial should interest and be accessible to a large num- 
ber of readers. This is the most important criterion for 
acceptance. 


Articles may be expositions of old results or presenta- 
tions of new ones. They may concern all of mathe- 
matics or one small area, a broad development or a 
single application, historical reminiscences or one 
important event. While some articles may contain the 
author’s new research, the novelty of material and 
generality of the results is far less important than the 
clarity of exposition and general interest. Discussing 
one illuminating case of a well known result is far 
better than providing all the details of an obscure but 
new proposition. Articles in the Monthly are sup- 
posed to inform and to entertain; they are meant to 
be read rather than archived. 


Notes are short and possibly informal articles. A note 
may concern a clever new proof of an old theorem, a 
novel way to present tired material, or a lively discus- 
sion of a philosophical (but still mathematical) issue. 
Also, any topic is suitable, so long as it is related to 
mathematics. Because a note is short, the first few 
sentences are the most important part: They should 
explain the purpose and invite the reader in. Pho- 
tographs or diagrams often will attract the reader’s 
attention. 


All articles and notes should be sent to the editor: 


JOHN EWING 

Department of Mathematics 
Indiana University 
Bloomington, IN 47405 


Please send 3 copies, typewritten on only one side of 
the paper. Illustrations should be carefully drawn on 
separate sheets of paper in black ink; the original 
should be without lettering and two copies should 
have appropriate captions and lettering indicated. 


Proposed problems or solutions should be sent to: 


RICHARD BUMBY, 
P.O. Box 10971 
New Brunswick, NJ 08906-0971. 


Please send 2 copies of all material, typewritten if 
possible. 


Letters to the Editor, both for publication and for 
private reading, should be sent to the Editor at the 
address given above. Comments, including criti- 
cisms, are welcome, as are all suggestions for mak- 
ing the Monthly a lively, entertaining, and informative 
journal. 


EDITOR: 
JOHN H. EWING 


ASSOCIATE EDITORS: 


PETER BORWEIN 
RICHARD BUMBY 
DENNIS DETURCK 
UNDERWOOD DUDLEY 


FRED KOCHMAN 
CATHERINE MCGEOCH 
RICHARD NOWAKOWSKI 
ARNOLD OSTEBEE 


JOHN DUNCAN LEE RUBEL 
JOAN FERRINI-MUNDY ABE SHENITZER 
JOSEPH GALLIAN LYNN STEEN 
STEVEN GALOVICH STAN WAGON 
RICHARD GUY DOUGLAS WEST 
DARRELL HAILE HERBERT WILF 
PAUL HALMOS SANDY ZABELL 
JOAN HUTCHINSON PAUL ZORN 


EDITORIAL ASSISTANT: 
MISTY CUMMINGS 


STAFF ARTIST: 
MIKE CAGLE 


Reprint permission: 
MARCIA P. SWARD, Executive Director 


Advertising Correspondence: 
Ms. ELAINE PEDREIRA, Advertising Manager 


Subscription correspondence, change of address, 
and other inquiries: 
Membership / Subscriptions Department 


All at the address: 


The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC 20036. 


Microfilm Editions: University Microfilms International, 
Serial Bid coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. 


The AMERICAN MATHEMATICAL MONTHLY (ISSN 
0002-9890) is published monthly except bimonthly 
June-July and August-September by the Mathemati- 
cal Association of America at 1529 Eighteenth Street, 
N.W., Washington, DC 20036 and Montpelier, VT. 
Copyrighted by the Mathematical Association of 
America (Incorporated), 1994, including rights to this 
journal issue as a whole and, except where otherwise 
noted, rights to each individual contribution. General 
permission is granted to Institutional Members of the 
MAA for noncommercial reproduction in limited quan- 
tities of individual articles (in whole or in part) pro- 
vided a complete reference is made to the source. 
Second class postage paid at Washington, DC, and 
additional mailing offices. Postmaster: Send address 
changes to the American Mathematical Monthly, 
Membership / Subscription Department, MAA, 1529 
Eighteenth Street, N.W., Washington, DC, 20036- 
1385. 


The American 
Mathematical Monthly 


Volume 101 Number 7 / AUGUST-—SEPTEMBER 1994 
(ISSN 0002-9890) 


Contents 


ARTICLES 


A Tale of Two CD’s / DAN KENNEDY 603 
Three Problems in Search of a Measure / JONATHAN L. KING 609 


The n-Queens Problem / IGOR RIVIN, ILAN VARDI, and 
PAUL ZIMMERMANN 629 


What’s the Difference Between Cantor Sets? / ROGER L. KRAFT 640 


Morphisms, Squarefree Strings, and the Tower of Hanoi Puzzle / 
JEAN-PAUL ALLOUCHE, DAN ASTOORIAN, JIM RANDALL, and 
JEFFREY SHALLIT 651 


FEATURES 


COMMENTS 602 


NOTES 
Sierpinski’s Theorem Is Deducible from Euler and Dirichlet / 
A. A. AGEEV 659 ; 
On Nonnegativity of Symmetric Polynomials / F. MATUS 661 
New Tricks for Old Trees: Maps and the Pigeonhole Principle / 
N. GRAHAM, R. C. ENTRINGER, AND L. A. SZEKELY 664 


THE COMPUTER SCIENCE SAMPLER 
Do You Know the Way to Vertex A?/ JEFF ONDICH 668 


THE EVOLUTION OF... 
On the Calculus of Variations and Its Major Influences 
on the Mathematics of the First Half of Our Century. Part I./ 
ERWIN KREYSZIG_ 674 


THE AUTHORS 679 
PROBLEMS AND SOLUTIONS 681 


REVIEWS 
How to Teach Mathematics, By Steven G. Krantz / 
MEYER JERISON 692 


TELEGRAPHIC REVIEWS 695 


COMMENTS 


Teaching reform. My shelf is full of books and pamphlets produced by committees 
over the past 10 years. The titles are clever and catchy — about counting and 
assessing and measuring up. The presentations are slick (and expensive). The themes 
are uniformly gloomy, predicting disastrous consequences if we don't take action 
quickly to set things right. These pamphlets preach to politicians, not to 
mathematicians. They preach to the converted. Mostly, that's what committees do. 

I am therefore delighted to find two wonderful books that directly or 
indirectly come from committees. The most recent is 

You're the Professor WHAT NEXT 
edited by Bettye Anne Case and published by the MAA. The other is 
How to Teach Mathematics: A Personal Perspective 
written by Steve Krantz and published by the AMS. 

WHAT NEXT looks like a book written by a committee. It has a bit of 
everything — committee reports, survey results, case studies, solicited articles, and 
lists of resources. Nearly half the volume consists of appendices filled with reprinted 
short papers on career, social, and educational issues. Many of these may look 
familiar; they still make fascinating reading for mathematicians both young and old 
at every kind of institution. The theme of the book is simple: Young mathematicians 
ought to receive some organized professional training in graduate school. We ought 
to teach mathematicians how to be mathematicians. The book gives advice but no 
simple recipe for doing this (except to present ideas for professional seminars). Its 
vagueness is its strongest virtue. It is a book of ideas, not proclamations. 

How to Teach was written as the result of acommittee on which the author 
served. (A review by Meyer Jerrison appears in this issue of the MONTHLY.) It's 
organized into three sections, Guiding Principles (sample: Respect, Attitude, Math 
Anxiety, Computers), Practical Matters (sample: Voice, Eye Contact, Homework, 
Office Hours, Grading), and Sticky Wickets (sample: Non-Native English Speakers, 
Cheating, Discipline, Sexism). It's full of common sense advice about teaching in the 
real world — the kind of advice a senior colleague might communicate to a new 
instructor on the way to lunch or in the lounge over coffee. "Common sense", of 
course, does not mean "obvious" nor even "correct". But it's advice based on 
experience, not abstractions, and even when one disagrees with the author it's hard 
not to sympathize with his goals. 

These books contain strong opinions and plenty of advice — but they do not 
preach disaster. Young mathematicians will profit from reading both from cover to 
cover. Older folks will profit even more because both books comment on everyday 
experiences we all share, and make us think harder about our profession. 

Will you agree with everything in these books? Almost surely not. Will you 
be annoyed by some of the advice? Quite likely. Will they make you think about 
being a mathematician? That's guaranteed. 

These books were written for mathematicians, not for politicians. 


John Ewing 
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A Tale of Two CD’S 


Dan Kennedy 


These are indeed exciting times in the world of Mathematics. Stirred by the 
NCTM Standards, the winds of change are blowing through every level of the K-12 
curriculum. At the same time, in colleges and universities the finest minds of our 
profession are turning their attention to the forging of a Calculus for a New 
Century. Earlier this year, we even saw the unexpected verification of Fermat’s 
Last Theorem, something which surely none of us thought we would see in this 
lifetime. There is so much to talk about when mathematicians get together these 
days, yet, because we are teachers, we have so little time for talking. That is why I 
have chosen to write this article about none of these things. 

Instead, I would like to write about records. 

I have collected records since I was in grade school. By the time I was in 
graduate school, I owned more than two thousand 45’s, three hundred albums, and 
miscellaneous 78’s and EP’s. I shared my hobby with friends, including some 
memorable years as music director and station manager of the college radio station 
at Holy Cross. I was one of those guys who could, upon hearing a golden oldie on 
the radio, quote the title, artist, and year of the song, and quite often the label and 
songwriter as well. My interest in the music of the moment naturally declined 
about the time disco music became popular, but by that time I had accumulated 
enough vinyl classics to keep myself and my party guests entertained forever. For 
example, one of my favorite ways to pass an evening with friends is to stage a 
“nostalgia playoff’ between two guests, playing alternately the hit songs from their 
respective high school graduation years, until the quality of one year’s hits is 
clearly unable to keep up with the quality of the other’s. (In case you are 
interested, no guest has ever gone up against a graduate of 1957 without conceding 
after the 15th round or so.) 

As a student of the recording industry, I would always sit up and take notice 
when some new product emerged which the prophets predicted would change the 
way people listened to music. The first big pretender to the vinyl throne was the 
8-track tape. “The 8-track tape,” they predicted, “will redefine the recording 
industry.” It required no threading onto a spool, it did not scratch or shatter, it did 
not collect dust, it required no needle, it produced high fidelity sound for multiple 
speakers, and you could play it in your car! I was momentarily impressed, but I 
continued to buy records, and so did apparently a lot of other people. Today if you 
want to buy an 8-track tape you have to go to an antique show. 

Then came the cassette tape. ‘““The cassette tape,” they predicted, “‘will refine 
the recording industry.” It was smaller than the clumsy 8-track, but it had all its 
same advantages, including that of being playable in your automobile. Moreover, 
you could actually stick a few into your glove compartment. You could also buy a 
“nortable tape player’ which would play your cassettes on an arbitrary street 
corner at an arbitrary volume level. I myself continued to buy records, although I 
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did eventually buy a cassette recorder so that I could tape my records at home for 
playing later in my car. Although record stores eventually began selling albums on 
cassette, it was usually from a shelf toward the back of the store. Vinyl was still 
king. 

Next came the laser disc. ““The laser disc,” they predicted, “‘will redefine the 
recording industry.” The laser disc had the music encoded digitally, promising 
virtually perfect fidelity forever. It also would not scratch, smudge, or collect dust, 
and in place of the old diamond needle, which everyone always suspected would be 
fatal to plastic records eventually, there was a neat, powerful laser beam to lift the 
music off of the disc as cleanly as Scottie might beam up Captain Kirk. This was 
dazzling technology indeed; unfortunately, you could not play a laser disc in your 
car. For one thing, each disc was the size of a medium pizza; for another thing, you 
had to se// your car in order to buy a laser disc player, which cost several thousand 
dollars. Needless to say Hugh Hefner bought one for every room of his mansion, 
while the rest of us just kept buying records. 

Then along came the compact disc. ““The compact disc,” they predicted, “will 
redefine the recording industry.” And in an incredibly short period of time, it did. 

Walk into a record store today and it probably won’t even be called a record 
store. The bins that once stored the vinyl now store row upon row of CD’s. Oh, the 
biggest stores will still carry a few records, but you often have to walk past the 
cassette section in order to find them. If you want a real adventure, try replacing 
your old phonograph needle! The recording industry has been completely taken 
over by the compact disc. 

Why did the CD succeed where the other technologies had failed? Simply put, it 
was such a perfect idea that nobody who loved music could resist it. The sound was 
virtually perfect; the discs themselves were rugged; the players were affordable; 
and these things were, as their name implied, compact—you could fit dozens of 
them into a shoebox and carry them to the home of your friend. There, you would 
almost assuredly find another CD player. Everyone could share in the miracle. 

Of course, for a while I held out. After all; I had this enormous investment in 
records, not to mention the means for playing them. But I would go the homes of 
my friends and hear the crystalline strains of CD music, and I would be jealous 
of that incredible sound. Finally, I realized that it was not the records that I liked; 
it was the music, and the music could be heard better on CD’s. I bought myself a 
player and began collecting compact discs. Most of my first compact disc purchases 
were actually albums that I already owned on vinyl, but I bought them so that I 
could rediscover them on a new level. Now I hardly ever play my records unless I 
am hosting a graduation year playoff. I still own them, but they are doing 
something that records do unfortunately well: They are gathering dust. 

I have lived long enough to see the very essence of my lifelong hobby redefined. 

But records are only my hobby; my profession is teaching mathematics. I 
suppose I became interested in mathematics at about the same time I started 
collecting records. I was in seventh grade when the Russians launched Sputnik, 
thereby kicking off some interesting times for my mathematics teachers. My high 
school courses were taught out of paperback textbooks authored by the School 
Mathematics Study Group, code SMSG, whose approach, they predicted, would 
redefine the way we taught and learned mathematics. 

That was the New Math, and it lasted long enough to develop a reputation bad 
enough to spawn the Back to Basics movement. ‘““The Back to Basics Movement,” 
they predicted, “will redefine the way we teach and learn mathematics.” 
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Then suddenly we all got distracted by computers. Computers were doing well 
in the stock market, and were pretty much running most companies and the 
government, so there was a strong feeling that we should all find out how they 
work. What most people discovered was that they worked by mathematics, which 
was enough for most people, because it meant that mathematics teachers could 
henceforth be held responsible for explaining the remaining details to their 
children. Computers were installed in many schools, because, they predicted, 
“computers will redefine the way we teach and learn mathematics.” 

I was part of a three-man committee in 1978 that persuaded our Board of 
Trustees to sink 100 thousand dollars into a computer system that featured a Data 
General Nova 830 with 32K RAM, seven CRT terminals, and a teletype printer. 
Five years later we were back before them, hats in hand, pleading for another 100 
thousand dollars to upgrade to a Hewlett-Packard 8000/30, plus ten more termi- 
nals and a sensible printer. Five years later we all but abandoned the HP and built 
a computer lab with shiny new Apple II’s, at a cost of another 100 thousand 
dollars. Now we have an entire multimedia lab, fully stocked with Macintoshes, 
laser printers, scanners, CD-ROMs, and networking hardware, while our gleeful 
supplier has another 100 thousand of our school dollars. The amazing thing about 
this buying frenzy is that each time we pleaded our case with the Trustees, we 
assured them that computers would redefine the way we taught and learned mathe- 
matics. Incredibly, they fell for it every time. 

But the sad truth of the matter was that we were still teaching and learning 
mathematics the way we had been doing it for decades. For all its marvelous 
capabilities, the computer was not changing the way that we taught and learned 
mathematics, and it was costing our school approximately 100 thousand dollars 
every five years to prove to ourselves that this was so. 

Meanwhile, pocket scientific calculators had quietly appeared on the scene and 
had been welcomed in most mathematics classes. They did change a few things, 
but at such a mechanical level that people hardly noticed. Trig tables and log 
tables died a hasty and largely unmourned death, and now everyone had an equal 
chance at finding the purchase price of 4 CD’s at $16.95 a piece, after a 10% 
discount and a 7.5% sales tax have been figured in. To most teachers, this was the 
sort of inconsequential drudgery that machines were supposed to do. Indeed, such 
was the inflexibility of the mathematics curriculum that these machines were 
welcomed precisely because they made so little difference in what we taught and 
learned. Significantly, the only place where they were at all controversial was at the 
elementary level, where people were nervous about children losing the ability to 
multiply and divide on paper. In any event, pocket calculators did not redefine the 
way we taught and learned mathematics. 

By now you have probably figured out my little parable. SMSG and Back to 
Basics may have shifted things a little bit, but only in terms of decades, while the 
inflexibility of the mathematics curriculum must be measured on a geological scale. 
New points of emphasis come and go periodically, denting the curriculum monolith 
with the same approximate impact as that of the 8-track tape on the recording 
industry. 

Scientific calculators were nice, and they even got into the classrooms, but they 
were the cassettes of our profession. Sure, everyone has one, but cassettes never 
replaced the records, which were still the way that serious people played their 
music, and scientific calculators never replaced factoring, which was what students 
really needed to know if they wanted to succeed in serious mathematics. 
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Computers took us to the laser disc stage, and failed to redefine the way we 
taught and learned mathematics for the same reasons that laser discs failed to 
redefine the recording industry. It had nothing to do with the capabilities of the 
technology, and everything to do with the mood of the market place. No machine 
can inspire a revolution if Hugh Hefner is the only one who owns one. 

Of course, there is another chapter in this story. We are still writing that 
chapter as I write these words, but I firmly believe that you and I have lived long 
enough to see the compact disc of our profession: an instrument which is so 
perfectly suited to what we do, that it is in the process of redefining the way we 
teach and learn mathematics. We call this wonderful machine a graphing calcula- 
tor, but it is in fact a computer, with ironically the same computing power in 
kilobytes as that first computer system we bought at my school fifteen years and 
400 thousand dollars ago. With this machine my students can do far more than 
compute; they can conjecture, they can model, and they can make connections—the 
very things that I want to teach them to do. Moreover, they own this technology; 
they do not have to go over to a rich friend’s home or to a special room at school 
to use it. Nor do I even have to tell them how or when to use it. Like the compact 
disc, it has become a part of their lives. 

Oh sure, for a while I tried to treat this technology the way I treated the 
cassettes in the record store. I bought one and used it, but I never thought it would 
redefine my profession. After all, I had twenty years of my life invested in the math 
curriculum monolith, and I had become pretty successful at teaching the tradi- 
tional courses in some embarrassingly traditional ways. But I was open to change, 
and I had read the Standards, so I began to chip away at my preconceptions of 
what and how I had to teach. The first thing I did was to let them use their 
graphing calculators all the time. The next thing I did was to start every class with 
a problem, which the students would talk out until a solution emerged that they 
could explain to each other. What I discovered, of course, was how useless my 
crisp set of lecture notes had been all these years. The students were discovering 
the results without me, and then showing each other how to solve the problems. That 
left me free to walk around and answer questions. Every so often I still tie things 
together or generalize, but for the most part I let the course evolve through what 
the students are doing, and I provide the direction by the problems I select. I’m 
still not sure what the heck I’m doing, but I do know this: There is more 
mathematics going on in my classroom these days than there ever has been before. 
Now there are more people doing it. 

Was the graphing calculator responsible for transforming my entire approach to 
teaching? Well, yes and no. What the graphing calculator did was get me to 
question what I had been doing for twenty years. It also got me focused on how I 
would get the students using it, which in turn got me focused on student learning 
rather than my own teaching. Eventually, after sacrificing the first few sacred cows, 
I acquired a taste for sacred beef and the rest was easy. And believe me, that same 
thing is going on in mathematics classrooms all across the country, in a movement 
that is growing exponentially. 

When I was young, all of my radical friends were in reform school. Today, all of 
my radical friends are in school reform. It is a crazy, free-wheeling time, not unlike 
the political scene in eastern Europe, and it’s really pretty exciting—once you 
overcome the initial sensation of being totally lost in a Brave New World. Let’s 
face it, though: For years we mathematicians were in a rather unique position in 
the world of academia, being smugly certain that we could all teach the exact 
same curriculum in the exact same way because we knew exactly what was best for 
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our students. The other subject area committees in the AP program fought among 
themselves all the time, while my colleagues and I on the Calculus committee 
nodded sagely in unison, reviewing yet another problem about a region being 
rotated about the x-axis. It was a happy, homogeneous, unrealistic world, totally 
incompatible with the spirit of creative discovery that had characterized the 
evolution of mathematics since the dawn of cognition. 

Did you ever wonder what Newton would say if he could come back today and 
watch a traditional calculus class in action—if you could call it “action”? Do you 
suppose he would be flattered to see that, 300 years after his death, we were all 
teaching his same results to our students? I don’t want to put words into the old 
Lion’s mouth, but knowing that Newton once wrote to Robert Hooke, “If I have 
seen further it is by standing on the shoulder of giants,” I dare say that he might 
pick up a graphing calculator, stare at it for several minutes with amazement, then 
say something like this: ““The creators of this magic are giants. Is there nobody 
here who would stand on their shoulders?” 

It is almost axiomatic in the academic world that the most creative minds belong 
to the mathematicians. Sadly, in the world of mathematics education that renowned 
creativity has been stifled for too long. To counter the trend, I tried a few years 
ago to release my own creative Muse in an evening of unbridled mathematical 
activity. I sat down with a fresh pad of paper and a pitcher of martinis, and before 
the evening was over and the martinis were gone I had produced 75 proofs of 
never-before-seen theorems. (Actually, I underachieved. It was an 80-proof gin.) I 
also created the perfect calculus problem, and here it is: 


THE ALL-PURPOSE CALCULUS PROBLEM 
©1993 Dan Kennedy 


A particle starts at rest and moves with velocity v(t) = {je -** de along a 10-foot ladder, which leans 
against a trough with a triangular cross-section two feet wide and one foot high. 

Sand is flowing out of the trough at a constant rate of two cubic feet per hour, forming a conical pile 
in the middle of a sandbox which has been formed by cutting a square of side x from each corner of an 
8” by 15” piece of cardboard and folding up the sides. 

An observer watches the particle from a lighthouse one mile off shore, peering through a window 
shaped like a rectangle surmounted by a semicircle. 

(a) How fast is the tip of the shadow moving? 


(b) Find the volume of the solid generated when the trough is rotated about the y-axis. 
(c) Justify your answer. 


(d) Using the information found in parts (a), (b), and (c), sketch the curve on the pair of coordinate 
axes given. 


Okay, maybe I went too light on the vermouth. But here’s the sad part: Every 
teacher reading this article knew those problems by heart, because we’ve been 
teaching those same basic problems for years. That, indeed, is what calculus 
teachers have been doing for decades and decades. We have been teaching the 
math literate of tomorrow with the problems of yesterday, while explaining to them 
all the while that they will need this mathematics in the future. My friends, this is 
the stuff of which the Emperor’s New Clothes are made! While we have been 
spinning golden oldies on the phonograph, perhaps more accurately the victrola, 
the world of Mathematical Reality has gone CD. 

Today, thankfully, all of that is changing. To cite just one close-to-home 
example of that, the College Board has given final approval to the AP Committee’s 
recommendation that the AP examinations be made graphing-calculator-active 
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beginning in 1995. We will also go to a new test format, splitting the multiple 
choice section into two 45-minute parts, one with 25 questions to be taken with no 
calculator, the other with 15 questions, some requiring a graphing calculator. The 
free response section will remain the same, 6 questions in 90 minutes, and will be 
designed with graphing calculators in mind. The extent to which calculators will be 
required will be slight at first, but may increase over time as emphases in the 
curriculum change. 

For the next year or so, the Committee will be buried under test development 
details resulting from our decision. After that we will emerge to begin a careful 
evaluation of the calculus curriculum, from a to z. Or, if you like, from « to 6. I 
recommend, however, that every teacher in the AP program get started without us, 
just as you started without us with graphing calculators. Don’t worry about whether 
you are doing the same thing in your classroom as I am doing in mine; we have all 
wotried about that for too long. Just worry about whether your students are 
learning calculus and enjoying it. If that’s the only thing that our classrooms have 
in common, then our students are still much better off than when we had 
everything else in common at the expense of that. 

In the next few years, the vibrations from all these education reform movements 
will reach a crescendo, and from it all will emerge a new paradigm for teaching 
and learning. I have seen enough to realize that it will not be confined only to 
mathematics; nonetheless, it is apparent to any observer that mathematics is 
leading the way. If Newton does come back to visit us, I hope he waits a few years 
so that all of this will have had a chance to develop. Then, when he asks to see the 
Calculus for the Twenty-first Century, we can show him something which Newton 
would truly appreciate: an entire Renaissance Curriculum. 

And Ill bet we get to show it to him on a CD—hooked up to our graphing 
calculator. 


The Baylor School 

Box 1337 

Chattanooga, TN 37401 
DKennedy@UTCVM.Edu 


An analyst, surname of Nero 
Is my mathematical hero. 
Says he, “When in doubt, 

I always start out, 

‘Givene > 0...’” 


James R. Martino 
Department of Mathematics 
The Johns Hopkins University 
Baltimore, MD 21218 
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Three Problems in Search of a Measure 


Jonathan L. King 


§P. PREFACE. What do the following three problems have in common? 


Poncelet’s Theorem. The lefthand figure below shows a pair of ellipses C and E 
which happen to have a circuminscribed polygon; a polygon which is simultaneously 
inscribed in the outer ellipse and circumscribed about the inner ellipse. 


(a) (b) 


Figure P.1. On the left are two non-degenerate ellipses, with a quadrilateral (solid line) inscribed in C 
and circumscribed about E. The dotted-line shows another circuminscribed quadrilateral, which can be 
thought of as the first quadrilateral “rotated” about E. The right hand figure shows a pair of ellipses 
with a (self-intersecting) circuminscribed pentagon. 


Jean-Victor Poncelet’s famous CLOSURE THEOREM, published in his Traité des 
propriétés projectives of 1822, asserts that if there exists one circuminscribed n-gon 
then any point on the boundary of C is the vertex of some circuminscribed n-gon. 
Indeed, if we allow the n-gon to continuously change its shape, a circuminscribed 
n-gon can be continuously rotated around E. 

How might this theorem be generalized to the case where the CE pair has no 
circuminscribed polygon? 


Tarski’s Plank Problem. Consider a circular table with diameter 9 feet. At your 
disposal are many planks, each 1 foot wide and longer than 9 feet. What is the 
minimum number of planks required to cover the surface of the table? Nine 
parallel planks certainly cover the tabletop. But can you take fewer and criss-cross 
them to cover the tabletop? If countably many planks of different widths are 
permitted, is there a cover using less total width than the parallel one? 
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Figure P.2. Thin planks partially covering a round tabletop, in criss-cross fashion. The planks can be 
thought of as being infinitely long. 


Gelfand’s Question. In the table below, row n has the leftmost (high-order) digit 
of the numbers 2”, 3”,...,9”, when written in base ten. The “‘7” in the third row, 
column 9”, is the 7 from 729 = 9°. 


TABLE P.3 The leftmost digits of powers. 
: 2” 3” 4” 5” 6” 7" 8” 9” 


"WDM PP YN YS 
= OW OR LN 
Nm NHN ON OO W 
me he NA 
“SPE WA KS NN 
NO BTDeE NWA 
CO rm BN WwW ~) 
NON WhNWNW CO 
haan n ~) © O 


S 
Nn 
— 
_ 
— 
— 
_ 
Po 
i) 


Will a “9” ever occur in the 2-column? Will the row “23456789” appear again? If 
so, will the set of n such that row, = “23456789” have a frequency’) —and if it 
does, will it be rational or irrational? Will a row of all-the-same-digit occur? Will 
the decimal expansion for an 8-digit prime every appear? 


Philosophy. At first glance the three problems seem to have little in common; 
Poncelet’s theorem is a question about conic sections, the Plank problem about 
geometric set-inclusion, and Gelfand’s question is number theoretic. Yet it turns 
out that two of these three are secretly isomorphic. 

More significantly, they have a less precise but deeper commonality in that each 
of the three problems has, sitting somewhere inside it, a natural measure—a 
measure which, on some collection of sets, is preserved under some family of 
motions. This basic tool of an invariant measure is the theme of this article. It will 
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turn out that the associated invariant measures make the hidden isomorphism 
conspicuous. 


Anatomy. The next three sections construct in turn natural invariant measures for 
Poncelet’s Theorem, the Plank Problem and Gelfand’s Question, and can be read 
essentially independently. All three measures are finite measures which can be 
interpreted as lengths, areas or volumes. Consequently, the reader need only have 
an intuitive feel for the size of a set; no theorems of formal measure theory’ are 
used. The hope is that the article can be read by motivated undergraduates who 
are comfortable with integration and elementary topology. 

The APPENDIX contains a brief history of each problem or a pointer to such. It 
alludes to the connection with ergodic theory, and ends with an open problem. 


Idiosyncrasy. Use ‘a := b” to mean “a is defined to be b’. We use the usual 
symbol, “B°”, for the complement of a set B; let ““A = B” denote the set-dif- 
ference A ™ B*°. Symbol L! 7B, indicates that the sets {B,}, in the union happen to 
be disjoint. 

If A is a measure such as “length” on a space Y, then a “A-nullset” B CY has 
zero length, ACB) = 0. An example is a set B consisting of finitely many points. 
The pair (Y, A) is called a measure space. A map gy: X — Y between two measure 
spaces CX, w) and (Y, A) is measure-preserving if 


u(p 'B) = A(B), for all subsets B of Y. 


A measure-preserving map R: X — X from a space to itself is called a transforma- 
tion, which we write in full as (R: X,w). Finally, for a point z <= X let R"(z) 
denote the n-fold composition R(RC’.R(z)...)). The sequence of points 
z, R(z), R7(z),... is the orbit of z under R. 


§1 PONCELET’S THEOREM. From any point outside of E, a “righthand tangent 
to E” can be consistently chosen. In particular, E gives rise to a “righthand 
homeomorphism,” R, mapping C to itself. 


Figure 1.1. Two ellipses, with E properly inside of C. An observer standing at z peering inward at E 
sees a “righthand” and a “lefthand” tangent to E. Call the righthand tangent map R: C > C. 
NoTaTION: From a point z outside E, let |z, E]; and |z, Ele denote the distance from z to the lefthand 
and righthand tangent points on E. In the case E is a circle, ie. |z, El, = |z, Elz, agree to write |z, E| for 
the common value. 


‘Originally, this article contained a fourth problem, invented by David Feldman, concerning billiards 
in the cusp between the curves y = 1/x and y = ~—1/x. Because its solution does use elementary 
measure theory —and on an infinite measure space— as well as the idea of recurrence in a dynamical 
system, the discussion of that problem will appear separately, in [KiNG]. 
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Viewed this way, Poncelet’s theorem becomes a statement about the dynamics of 
the map R and explains why it is called a CLosuURE THEOREM; if the orbit of one 
point z “closes up” in n steps, that R”(z) = z, then the orbit of any point closes 
up and also in n steps. 

In closing up, if the orbit of z “winds around E” p times (p = 1 and n =4 in 
Figure P.1(a); in P.1(b), p = 2 and n = 5) then every point’s orbit would have to 
wind p times before closing up, since R preserves order along C. This suggests 
that —after a suitable change of coordinates—R is simply a rigid rotation of a 
circle by rational rotation number p/n. Let K := [0,1) be the half-open interval 
topologized as a circle, and let © and © denote addition and subtraction modulo 
1. For a rotation number a &€ R, let 


Par. K ~ Ki: x x Oa 


be the corresponding rigid rotation. A “change of coordinates” would then be a 
homeomorphism ¢g: C — K with a commutative diagram 


c—c (1.2) 


Such a 9 satisfying g° R = p,°¢ is called a topological conjugacy from R to p,. 
Poncelet’s theorem would thus follow from 


Lemma 1.3. For ellipses C, E and mapping R as in Figure 1.1, there is a rotation 
number a & [0,1) and topological conjugacy » carrying R to p,,. 

Indeed, this lemma asserts something even if the R-orbit of z fails to close 
up—the case when a is irrational. 


The appearance of a measure. Arclength measure A is invariant under the 
rotation p,, that is, 


A(p,'(B)) = A(B) 


for any set B included in K. We normalize A to a probability measure, A(K) = 1. A 
conjugacy » would lift A to an R-invariant measure pw on C, 


u(A) = A(g(A)), (1.4) 


which we will call good: finite, non-atomic (any individual point has zero p-length) 
and giving positive length to open intervals. 
It is the converse which will help us out: 


Any R-invariant good jw gives rise to a topological conjugacy. 


In order to define this conjugacy, for points z,y © C let [z,y) denote the 
half-open interval on C going counterclockwise from z to y. Next, normalize pu so 
that u(C) = ACK) = 1; now, for any three points z, y, x on C 


we([z,y)) @u(Ly,x)) =eu([z,x)). 
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Fix any particular point z, © C. Define @ and a by 


p(y) = w([Z0,¥)) and a@ = w([ Zo, Rzo))- (1.5) 


Thus g sends z, to 0 in K and is well-defined because of the normalization. 
Non-atomicity implies continuity of @ and the “positive length” condition insures 
that ¢ is invertible. Invariance yields that for any x, 


w([x, Re)) = u([Rzo, Rx)) © w([Rz, x)) 
= u([z0, x)) © m([ R29, *)) 


= w([Zo, Rz,)) — &. 


In other words, the rotation number a did not truly depend on the arbitrary point 
Za, but only on the R-invariant measure mw. With this, it is easy to verify that 
carries R to p,, as in (1.2). 

Our previous lemma can be restated now like this. 


Lemma 1.3’. For ellipses C and E, with R as in Figure 1.1: There exists an 
R-invariant good measure |. 


When C and E are concentric circles, arclength measure along C is R-invariant. 
However, the case where they are non-concentric circles seems not as apparent, 
and the general elliptical case is less evident still. Happily, we can at least assume 
that the outer ellipse C is a circle—since any linear map of the plane carries 
ellipses to ellipses and tangent chords to tangent chords and so carries Figure 1.1 
to another just like it. So before even starting to construct jw, one can linearly 
compress the figure along the major axis of C to arrange that now C is a circle. 


Rolling the chord. The righthand map, R, generally stretches or shrinks sub-inter- 
vals I Cc C, thus making the standard arclength non-invariant. A direct approach to 
making an R-invariant “length” py, is to compensate for stretch /shrink by integrat- 
ing against arclength an appropriately chosen “height function” h whose height 
varies so as to cancel out the distortion introduced by R. Then the ‘“‘w-length” of a 
set A would be 


(A) = 1,(A) _ f (2) dz where “dz” denotes arc- (1.6) 


length measure on C. 


If h is a continuous function from C to the positive reals, then automatically p will 
be non-atomic and give positive length to open intervals. The issue becomes: What 
property does h(-) need to fulfill for the measure fs = w,, to be R-invariant near a 
point z? 

In Figure 1.7, the ratio ((z-arc)/Az) — 1 as the angle @ \. 0, and similarly 
((y-arc)/Ay) — 1. Consequently, the “infinitesimal ratio” of the y-are to the 
Z-arc 1S 


, y-arc , Ay vA 
im = lim — ; 
6.0 z-arc 6.0 Az Z 
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this last equality comes from the equality x Pyz = X Pzy of the base angles of 
isosceles triangle yPz at the right of Figure 1.7. 


Figure 1.7. Circle C surrounds ellipse E. Let z and y be points at opposite ends of a chord tangent to 
E, and let ¢, and ¢, denote their distance to the common point of tangency. Roll the chord through a 
small angle @. Its ““z end” sweeps out an arc on C as does its “‘y end”; on the corresponding tangent 
lines, distances Az and Ay are swept out. 


Infinitesimally, the uw-length of the “z-arc”’ is simply its arclength times h(z). So 
for uw to be R-invariant at z, the function h must satisfy that —infinitesimally— 
the product h(z) Length(z-arc) equals h(y) Length(y-arc). In consequence, the 
invariance condition required of h(-) is 


h(z) -|z,Elr =h(y)- ly, Elz, where y = Rz. (1.8a) 


This uses the previous displayed-equation as well as ¢, = |z,Elr and ¢, = ly, Elz. 


Proof of Lemma 1.3’: If our inside ellipse E happens to be a circle, then the 
lefthand and righthand tangential-distances equal a common function z — |z, El. 
Then 


h(z) :=1/|z,E 


satisfies (1.8a), and the corresponding wp is the desired R-invariant measure. 

To handle the non-circular case, choose some linear map .” which transforms 
E into a circle, as shown in Figure 1.9. Since a linear map preserves the ratio of 
lengths of parallel line-segments, we have that 


ly, Elz _ ly, ME, 
Iz, Ele |Wz, LER 


But since .E is a circle, the subscripts on the second ratio are unnecessary and it 
can be written |.V/y, ~AE|/|.7z, WME|. Just as before, then, 


h = —————_.. forallzEC 1.8b 
(z) 2, aE’ or all z , ( ) 


satisfies (1.8a) and makes the measure 4, —which we will call Poncelet-measure — 
invariant under R. 


Exercise: Poncelet-measure for a special case. Suppose C and E are confocal 
ellipses, with foci at (0,+F) and with semi-minor axis lengths of 1 and r 
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Figure 1.9. Ellipse E is carried by linear map to a circle .W(E). For artistic convenience, the linear 
map illustrated simply compresses the major axis (vertical, in the diagram) of E until equality with E’s 
minor axis. Each point of chord yéz is carried vertically downward to line-segment (y)W(t)WA(z); 
here ¢ is the point where the yz-chord is tangent to E. 


respectively, ie. 
2 2 2 
y x y 
——; =! and E: 5 + >= 
F-+P ry? F247? 
with 0 <r <1. Then Poncelet-measure pw, will be the integral, against arclength 
measure along ellipse C, of the function 


C: x7 4+ = |, 


1 
h(x, y) = const +» ——_———————— (1.10) 
V(1+x°F*)(r? + x°F’) 
where the constant is the square root of (F? + r2)/(. — r?). Oo 


Remark. What can be said when CE has no circuminscribed polygon? —that is, 
when R is conjugate to an irrational rotation. One starts drawing tangent-chords 
from some point z, as in Figure 1.1, but the polygon-in-progress never closes up. 
What will be true is that the vertices of this unfulfilled polygon will densely fill 
out C: 


Under an irrational rotation p = p,,, the orbit of 0 is dense. (1.11) 


To establish this, partition the circle K into M subintervals 


Leta) 


Take N > 1 smallest such that p%(0) falls inside [0,(1/M)). (This certainly will 
happen for some N <M since, by the Pigeonhole Principle, some two of the 
M + 1 points {0, p(0), p7(0),..., 0“(0)} fall into the same subinterval.) Thus the 
p’’-orbit of 0 is (5;)-dense. So its p-orbit is e-dense, for all «. Oo 
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By the way, it is natural to wonder whether the Poncelet rotation-number a is 
essentially unique; is it independent of linear map .~? That it is, follows from this 
challenge: 


If rotations p, and pz are topologically conjugate, where 0 < a, B < +, then a = B. 
Another natural question, which is discussed in the appendix, is to wonder 


Is Poncelet-measure unique? How does 


4; depend on our choice of linear map /? (1.12) 


§2 THE PLANK PROBLEM. Experimentation with strips of paper as “planks” 
tends to suggest that a disk D cannot be efficiently covered unless the strips are 
parallel, which leads one to this 


Plank Conjecture. Suppose (w,,)”_, are the widths of a countable family of planks 
which cover disk D. Then 

>) w, > Width(D). 

n=1 
If LU? _,w,, actually equals Width(D), the diameter of the disk, then the planks must 
be parallel to one another. 


Measuring area differently. We can take D to be the closed radius-1 disk centered 
at the origin and ask that a plank P be the closed region between two parallel 
lines, both of which contact D. The width of P is the perpendicular distance 
between these parallel edges. 

In order to cover the disk by small total width, one is tempted to use planks 
which pass near the center of D, since such planks P cover more than their fair 
share of the area of the disk; more than Width(P)/Width(D). Yet if many planks 
pass near the center, they must waste some of their area by overlapping one 
another. Conversely, while it is easy to arrange that planks passing near the 
boundary of D be disjoint, such planks cover less than their fair share. 

An answer to the Plank Problem is not obvious because the area covered by a 
plank P is not invariant under moving P over the disk. An analogy with Poncelet’s 
theorem is useful. The only “obvious” case where a circuminscribed polygon can 
be rotated about E, is when the two ellipses are concentric circles. And this is 
exactly the case where normal arclength on C is evidently invariant under the 
transformation R. What saved the day, in the case of general ellipses, was the 
discovery of a different way of measuring length on C, the measure yw, of (1.6), 
which indeed is invariant under motion by R. 

Returning to our planks, the analogy suggests looking for a new way to measure 
area on D so that this “new area” —at least for planks— is invariant under rigid 
motions. We might expect to construct this measure, v, as we did with Poncelet- 
measure, by integration of some nice positive height function h: D ~ R, against 
area. If we build v in this way, then any subset of the disk with zero v-area would 
have to have zero area, as well. Also, the desired invariance of v under rigid 
motions would mean there is a positive constant ”# so that 


v(P) =h- Width(P), for any plank P. (2.1) 


One clarification is in order. Since v is to be a measure on D, we need to interpret 
a plank P as subset of the disk and agree to regard “P” as the set PM D. With 
this convention, D itself is a plank. 
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Assuming now that someone with outstanding geometric intuition has provided 
us with such a rp, let’s use it to verify the conjecture. 


Provisional Proof of the Plank Conjecture. If planks {P,}? cover D then 


1 1 
Width(D) = -»(D) = =| U P,] 


< — )\v(P,) = ¥ Width(P,). (2.2) 
Wn Wn 
This shows that no cover can use less total width than a parallel cover. 

To show that only parallel covers use minimal width, suppose now we have a 
cover realizing equality in (2.2), that is, L,v(P,,) = v(U,,P,). Consequently, the 
intersection P; 1 P, of any pair of planks must have no v-area and thus no area. 
This means that in a minimal cover any two planks have disjoint interiors; no two 


planks overlap. So we need but prove that 
Any two planks P and R of a non-overlapping cover , are parallel. 


To see this, consider any plank Q of the cover which intersects segment L, in the 
figure below. 


Figure 2.3. Given two planks P and R, choose distinct parallel line-segments L, = p,r, and L, = p5rp; 
the L, are “open” segments, not containing their endpoints. Endpoints p, and r,; are on neighboring 
edges of P and R, respectively. Since L, and L, are disjoint from P and R, they must be covered by 
the other planks. 


We can have chosen the point p, to be in the interior of D; so Q cannot have L, 
as an edge. Therefore, since Q overlaps neither P nor R it must run between their 
neighboring edges; hence Q crosses both L, and L,. Since the L, are parallel, 
intervals Q 1 L, and QL, have the same length. Consequently, 


> Length(QNL,)= ) Length(QL,), 
QeEQ, QeEQ, 
where 2, consists of all planks in the cover which overlap L,. But these planks do 
not overlap each other. Consequently the closed intervals {Q M Ligoe g, are 
non-overlapping, and we may conclude that Length(L,) < Length(L,). Reversing 
the roles of L, and L, shows that they have equal length. And this means that P 
and R were parallel all along. O 


1994] THREE PROBLEMS 617 


Both halves of the proof, the inequality and the parallelness, hinge upon our 
not-yet-known-to-exist plank-measure v. The rotational symmetry of condition 
(2.1) suggests looking at rotationally symmetric geometric figures as a possible 
source of plank-measure. It turns out that the geometric insight which will provide 
plank-measure was made by Archimedes in his treatise On the Sphere and the 
Cylinder. 


Archimedes’ area-preserving map of the Earth. The figure below shows how to 
project the globe onto a cylinder of paper in order to get a map of the Earth on 
which countries of equal spherical-area are represented by map-regions of equal 
(planar) area. 


Figure 2.4. A sphere S and a (finite) cylinder C are tangent along their common equator. The height of 
C equals the sphere’s diameter. For a point p © S, let p © C be its radial-projection: A ray starting 
from and perpendicular to the axis-of-symmetry of C passes through p and intersects the cylinder at p. 


Archimedes proved that for any region B on the sphere, 
Area, (B) = Area,(B). 


Since both the sphere and the cylinder are rotationally symmetric, it suffices to 
establish this when B is a band. Figure 2.4 shows, drawn on a sphere of radius 1, a 
band B with lower edge at latitude 0), and upper edge infinitesimally higher at 
latitude 0, + d0. Since the radius of the bottom edge of B is r = cos(6), 


Infinitesimal Areag(B) = (271) - d@ = 27 cos(@,) dé. 


The projected band B has radius 1 and infinitesimal width d6= do - cos(@,), by 
similar triangles. Consequently 


Infinitesimal Area,(B) = (27-1) - d6 = 27 d6 cos(6,). 
The two infinitesimal areas are thus seen to be equal. DO 


Constructing plank measure. Area-measure of a sphere whose equatorial-plane is 
D, when projected down to D, has the desired plank property. Define v by 


v(A) = Area,( Ag). 
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Figure 2.5. A sphere S whose equatorial-plane is our disk D. For any subset A CD, let Ag be its 
orthogonal projection on the sphere. Thus the projection of a subdisk concentric with D would be a pair 
of “polar icecaps”’. 


The image of a plank P C D is a band Pg, on the sphere. The sphere can then be 
oriented so that the band projects on the cylinder to a cylindrical-band P, whose 
width necessarily equals the width of the original plank P. Thus 

v(P) = Areag( Ps) = Areac(Ps) = 27 - Width(P), 


which demonstrates that every plank’s measure is a constant times its width. 
Moreover, this establishes that the natural value of ‘‘Plank’s Constant’’, A, is 27. 


§3 GELFAND’S QUESTION. Questions about high-order digits of powers, Table 
P.3, provide a perfect setting for students to formulate and test numerical conjec- 
tures. Here are some “‘mysterious empirical facts” which can be found by judicious 
use of a computer. For a positive x, let ((x}>) denote the high-order non-zero digit 


e 
of the base-ten expansion of x. Thus ((Plank’s Constant>) = 6 and ( }) 
equals 2. 
Mysterious observations? Studying the 2-column of Table P.3, 


2481361251248136125...«2"))... 


one discovers that the nine digits do appear to have frequencies* —and that they 
are unequal. Indeed, letting fr(d) denote the frequency of digit d, 


1 
fr(d) = lim vital <n < Nand ((2")) = d}|, 
N-o 
the computer suggests that the frequencies decease steadily as d increases from 1 
to 9: 
fr(1) = .301, fr(2) = 176, --- fr(9) = .045. 


The first empirical surprise is that digit d occurs with the same frequency fr(d) in 
every column of the table. This, despite the fact that the columns are decidedly 


*The existence of frequencies is not a foregone conclusion. In the sequence of high-order digits 
(<(n)>),, —, of all natural numbers, the digit “1” does not have a frequency, since its upper-density is 
five times its lower-density. 
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different—they all have patterns which appear in no other column. For example, 
the sequence “248136” occurs infinitely often in column 2, yet will never be 
witnessed by the other columns. Indeed, no sequence of length six which appears 
in a column, ever appears in any other column. 

Every digit occurs in every row of Table P.3, and yet an all-the-same row never 
appears. Thi§ suggests looking at joint frequencies. Let F,.¢.9(d,d',d”) be the 
frequency of rows n which have d in column 2, digit d’ in column 6 and d” in 
column 9. Some pairs of columns seem to be independent —columns 2:3 for 
instance— in that there is approximate equality 


F,,,(d, a’) ~ fr(d)fr(d’). 


Yet columns 4 and 5 appear strongly non-independent. Only one digit d ever 
occurs simultaneously in columns 2:5. Columns 2:4 don’t have independence, yet 
columns 3:6 do. But 3:9 do not. And so on... 

The real startler comes, however, when you vary the “seed.” Row,,, is 
obtained by multiplying row, by the tuple (2,3,...,9). Table P.3 starts with a 


“zero-th row” of (1,1,...,1). But there is no reason to use all the same 
integer—and upon consideration, why even restrict to integers? We could start 
with a ‘“‘zero-th row” (s,,..., 5 ) of positive real numbers as seeds and then study 


frequencies; let the symbol F.*.* .(d, d’) mean the frequency of those n for which 
simultaneously <(sm”)) = d and ((s’'m'")) = d’. 

At first this seems like a waste of time, since the seed appears to have no effect 
on individual columns; no matter which of 2,...,9 one takes for the multiplier m, 
apparently F*(d) = fr(d) independently of the seed s. But this independence 
abruptly disappears for joint frequencies. An energetic computer will discover, for 
example, the oddity that the mapping 


5 BS:5(3,3) (3.1) 


is continuous—but fails to be differentiable at exactly three points in the interval 
(1, 10). 


Converting to a dynamical system. The mathematics is simpler if we first just 
analyze the statistics of a single column—the 2-column perhaps or, more generally, 
the m-column for some fixed positive multiplier m. Effectively we are studying the 
“multiply map” 


T:R,>?R,ix- mx 


applied to an initial seed s; we are following the 7J-orbit of s, taking the 
measurement ((7"(s))>) along it. Since x and 10x have the same first digit, we 
might as well identify numbers whose ratio is a power of 10. Doing this collapses 
all these half-open intervals, 


... [75> 1), [1, 10), [10, 100),... 


together, thus effectively wrapping the positive-reals R, into a circle. Insofar as 
our measurements are concerned, then, the multiply map is just some homeomor- 
phism of a circle. 

The easiest way to identify 10x with x is simply to take logarithms base-ten and 
then discard the integer part. Letting w% denote this identification and letting 
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IX = [0, 1) be the circle as before, we get this commutative diagram: 
RR, 
lv jy where W(x) := (logio X) moat (3.2) 


Since the logarithm converts multiplication to addition, transformation p, := 
woToy' is simply rotation x x @a on the circle by a@ := log,,(m), where 
‘““®”’ means addition modulo 1. Thus the high-order digit of a number x is d if and 
only if w(x) is in the half-open interval 


I, = [log(d), log(d + 1)). 


(Here, and henceforth, “log” means log,,).) These nine intervals {J,}3_, partition 
the circle. 


An isomorphism. The underlying transformation for Gelfand’s Question in the 
case of a single column, we’ve discovered, is a rigid rotation of a circle—this is the 
same transformation found hidden in Poncelet’s theorem. Can this isomorphism be 
made more explicit? Yes it can. 

The ellipses C and E in Figure 1.1 gave rise to a rotation number a and 
commutative diagram 1.2. Use the ¢ of that diagram to lift the intervals {I}? to a 
partition {J,}? on C, where J, := 9 '(/,), and let m := 10%. 

Effectively @~'oy% is an isomorphism from the Gelfand m-system to the 
Poncelet CE-system: Any question about the m-column with seed s is mapped to 
the corresponding question about the Poncelet-orbit of z:= @ ‘(w&(s)). This 
R-orbit of z lands in intervals 


Vay> Fay Jays + ++ Fa,9 ++ 
precisely so that the high-order digit of sm” is invariably d,,. 


Equidistribution. Now that we know that the 2-column is isomorphic to the orbit 
of 0 on the circle under rotation by log(2), what do we know? Since log(2) is 
irrational this orbit is dense, courtesy remark 1.11, and so every digit appears 
infinitely often in column 2. We now find ourselves in the embarrassing circum- 
stance of knowing there are integers n where 2” starts with “9”—without having 
the foggiest notion of a single one. 

However, denseness —a property which ignores the time when points appear in 
the orbit— is not sufficient to determine the frequency of “9”, or whether it has a 
frequency at all. A more refined “randomness” property is needed: A sequence 
Zo, Z,,--- In the circle is equidistributed if 


1 
For any intervalI CK: lim y H{nl0 <n<N&z, €1}| = Length(/). (3.3) 
N-o 


Weyl’s EQUIDISTRIBUTION THEOREM for the circle says that for any irrational a: 
For any point z, the sequencez,z ®a,z@®2a,...,Z @na,... 
is equidistributed in K. 
For a multiplier m © {2,...,9, 77, e, etc.} which is not a rational power of ten, the 


corresponding rotation number a = log(m) is irrational. Weyl’s theorem explains 
the mysterious frequencies fr(d): 


d+1 
fr(d) = Length(/,) = oe| ; for d = 1,2,...,9. 


So the frequencies are irrational—in fact, transcendental. Weyl’s theorem also 
explains why these frequencies do not depend on the seed. 
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Randomness and Ergodicity. An irrational rotation p is “topologically random” in 
the sense that each orbit is dense. Restated, the orbit of any point a € K hits any 
non-empty open set U: There exists n so that a © p~"(U). 

An analogue of this, a notion of measure-theoretic “randomness”, is that the 
orbit of any set A of positive length hits any other positive-length set U: There 
exists n so that A(A N p~"(U)) > O. This measure-theoretic property makes sense 
for any measure-preserving transformation ([T: X,), and has this equivalent 
formulation: 


Any T-invariant’ set B is —up to a nullset— either empty 
or is the whole space. Either B or X <= Bis a nullset. 


The connection between our irrational rotation p and ergodicity is twofold. On 
the one hand, an irrational rotation is ergodic. On the other hand, any ergodic 
transformation satisfies an abstract equidistribution theorem, due to Birkhoff, 
called the PoINTtwisE ERGopIc THEOREM. This ergodic theorem is sufficiently 
strong to imply both Weyl’s theorem and —a result we will need shortly— the 
KRONECKER-WEYL THEOREM. A formulation of Birkhoff’s theorem appears in the 
appendix, as well as a proof that irrational rotations are ergodic. 

It is the possibility that a transformation can fail to be ergodic which gives rise 
to the instability of frequencies F,*.* (d, d’) as the seeds s and s’ are varied. 


Non-equidistribution. When m = 10”/¢ is the multiplier then p = p, is a rational 
rotation. Take, for example, a = 1/3. The p-orbit of any “seed” z © K thus has 
exactly 3 points. The observed frequency of a digit d does not remain constant as 
the seed is varied; for example, the frequency that the z-orbit hits J, is ; for any 
z EI, but is zero for z € J. 

We also notice another failure—that p, 73 18 not ergodic. Given any A C K, the 
set B:= A Up '(A) U p~?(A) is p-invariant. Taking an A of positive length less 
than 1/3 gives a non-trivial invariant set B. This explains lack of constancy for 
frequencies; the orbit of any seed z € B‘ hits B with zero frequency and yet the 
measure of B is positive. 

Happily, any transformation can be decomposed into disjoint ergodic transfor- 
mations. In the case of rotation by 1/3, write the circle as a disjoint union 


K = LlsoK 
O0<c<1/3 


Cc? 


where K, = {c,c ® 5,c ® §}, (3.4a) 


of p-invariant 3-point sets K.. For a point z in the circle, the particular K = K, 
containing z is called the ergodic component of z because, when K is equipped 
with this p-invariant probability measure v,;, 


Ve({c}) = vg({c ® 3}) = vx({e © §}) = es (3.4b) 
then the system (p: K,v,) is ergodic. As K ranges over the ergodic components of 


p, the measures v, form a disintegration of arclength measure A. This is why (3.4) 
is called the ergodic decomposition of (p :K, A). 


Joint frequencies. Frequencies F,.,(d, d’') are measurements made on the direct 
product of two “multiply maps” and so W& X Ww (see commutative diagram 3.2) 


T-invariant” can be taken to mean either T~'(B) = B, or the superficially weaker statement that 
the symmetric difference BAT~'B is a nullset. It is easy to check that if B satisfies the latter 
invariance, then B can be altered by a nullset to fulfill the stronger invariance. 
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carries this system to a rotation of the torus K** = K x K, 

Pa X pp: (Z,2’) > (z @a,2' @ B), 
where a = log(2) and B = log(3). So F}:3(d, ad’) is simply the frequency that the 
Py X pg-orbit of (z, z’) = (W(s), W(s’)) hits rectangle I, X Iy. 

Analogous to the one-dimensional case, if p, X pg is “random” —is ergodic— 
one might expect this frequency to be the measure of the rectangle, that is, its area 
fr(d)fr(d’). This, as in Weyl’s theorem, is a statement of equidistribution and is 
enunciated in the well-known Kronecker-Weyl theorem. 

Real numbers {a,,...,a@,} are rationally independent if they are linearly inde- 
pendent over @; the only integral solution to N,a, + :-: +N,a, =0 is the 
all-zero N, = 0,...,N, = 0 solution. On K*”, the L-dimensional torus of L- 
dimensional volume 1, a sequence of points x), x,,... 18 equidistributed if it hits 
every sub-block J, X --: X I, of the torus with frequency equal to the subblock’s 
volume. 


Kronecker-Weyl Theorem. Numbers 1,a,,...,a, are rationally independent if and 
only if under the action of rotation p, X *** X Py, on the L-dimensional torus, 
every orbit is equidistributed. (This is also equivalent to ergodicity of this toral- 
rotation.) 


As an application, since {1, log(2), log(3)} are rationally independent, we have 

that 
F3:3(d,d') =1 1+ ~)log(1 + — 
2:3 (d,d’) = log d 08 d'}’ 

regardless of the seeds s and s’. 

What happens with non-independence? Take, for example, a = log(2) and 
B = log(5). Here, the equality 2 - 5 = 10 translates to the non-zero rational rela- 
tion a + B = 1, and so p, X pg is not ergodic. To see this, regard the 2-dimen- 
sional torus [0, 1) < [0,1) as a square. Then the ergodic components are “wrap- 
around” diagonals running northwest to southeast on this square; each number 
c & [0, 1) gives us a diagonal K, which is (p, X pg)-invariant: 


K**= UU K,., where K,:= {(x,y)|x ®y =c}. 
O<c<l1 


Each diagonal K. is topologically a circle whose ergodic measure A. is simply 
normalized arclength, 


Length( A K,) 
Length( K.) 


In this 2:5 case, the system (p, X pg: K,,A,) is isomorphic to irrational rotation 
on the diagonal “circle” K, by rotation number a (or, equivalently, B). 

Weyl’s theorem then asserts that F3:$(d, d’) is the A,-length of the intersection 
of rectangle I, X I, with the diagonal K, which contains the w-image of (s, s’). 
This is the diagonal with c := w(s) © Ws’). For example, the 2:5 column-pair of 
Table P.3 corresponds to s = s’ = 1 and so c = 0; therefore K = K, is the main 
diagonal x + y = 1. It should be easy to figure out the frequency of seeing a 
doubled digit (d, d), no? 

Scanning the 2 and 5 columns of Table P.3 shows that “3” occurs doubled, 
2° = 32 and 5° = 3125. It is not surprising that no other digit occurs doubled 
since, if d # 3, then the main diagonal does not pass through the square J, X I, 
and consequently the frequency of (d,d) is zero. Conversely, if you traverse the 


\ (A) = , where 4 c K*?. 
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main diagonal K by heading southeast, you will enter the J, <X J, square at the 
point (log(3), 1 — log(3)) and exit the square at (1 — log(3), log(3)). The fraction of 
your time spent inside the square was 


F,,5(3,3) = [1 — log(3)] — log(3) = 1 — log(9), 


which is irrational. (By the way, this picture of a northwest to southeast diagonal 
explains the three points of non-differentiability of the function g(s) := F3:<(3, 3) 
mentioned in (3.1). As the seed s increases, the corresponding diagonal moves 
northeast and the length of it within the /, X J, square varies linearly except when 
the diagonal passes over a corner of J, X J.) 


The general case. What are the ergodic components of toral rotation Pa, X01 xX 
P.,?—it being clear now that to compute joint-frequencies of the form Ff? :../+, 


one needs a geometric description of these ergodic components. The Kronecker- 
Weyl theorem tells us how, in principle, to do this. However, computing frequen- 
cies for a specific set of a, will require a bit of geometry. 

Consider a maximal subset of {a,,...,a@,} which, were “1” adjoined, would 
comprise a rationally independent collection; suppose it has M > 1 members. Let 
G be the closure of the orbit of (0,.£., 0); in other words, the closure in the 
L-dimensional torus of tuples 


((2@1) moa1> (N@)) modi? ce) (N@,) moar) 


aS n ranges over the integers. This G will be an M-dimensional subtorus (indeed, 
a subgroup of K*“). So the ergodic components of Pu, X °** X py, are merely the 
translates of this subtorus. Consequently, the frequency 


F20:-71(d,,...,d,) 


Ayia 


is the M-dimensional ‘area’ of the cross-section of the block (rectangular paral- 
lelepiped) Iy, X --* X I,, which is sliced by the particular subtorus that contains 
the point (z,,...,2,). This is the subtorus G © (z,...,Z,). 

Even for the case of three rotation numbers with exactly two being rationally 
independent, determining frequencies might be tricky. Here one wants to compute 
the area of the region which is the intersection of a plane slicing through a 
3-dimensional block. Depending on the tilt of this plane, the intersection could be 
a triangle, a quadrilateral, a pentagon or a hexagon. 

So we end up by finding another point of commonality among the problems of 
Poncelet, Tarski and Gelfand: Existence of a natural invariant measure does not, 
alas, imply that the measure is going to be easy to compute... 


§A APPENDIX. Here we give a terse history of the three questions, as well some 
related conundrums. A technical note: In this article all measures were tacitly 
Borel measures, and all sets and functions were Borel measurable. 


Poncelet’s theorem, history. Jean-Victor Poncelet was an officer of engineers in 
Napoleon’s army during the invasion of Russia. Like other invasions of Russia this 
one failed, and on November eighteenth of 1812, Poncelet was captured in the 
retreat from Moscow. During his 1812-1814 captivity, which he later wrote about 
in PONCELET 1862, he developed the notions of Projective Geometry which led him 
to the Closure theorem. (For a fuller history, see BELL 1937.) 


Commentary. The idea of using an invariant to prove Poncelet’s theorem goes 
back to Jacobi and Bertrand, according to SCHOENBERG 1983; and Schoenberg’s 
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beautiful paper gives an efficient proof using an invariant integral—which in this 
context is the same thing as an invariant measure. 

Nonetheless, Schoenberg’s proof uses non-elementary notions: The Brouwer 
fixed-point theorem and facts in projective geometry concerning the polar line of a 
point. In contrast, by focusing on an invariant measure for Poncelet’s transforma- 
tion, even the small amount of computation in Schoenberg is entirely avoided. This 
is merely a change in viewpoint, not a change in proof, yet it has the advantage of 
yielding a theorem even in the case that the given pair of ellipses have no 
circuminscribed polygon, and thereby suggests the link with Gelfand’s question. By 
avoiding Brouwer’s theorem, which is non-constructive in nature, one can actually 
compute the invariant density, eg. (1.10), and thus in principle determine whether 
the rotation number is rational so as to ascertain whether a circuminscribed 
polygon exists. 

An extensive history and “‘pre-history” of Poncelet’s theorem, as well as several 
proofs, appear in a paper by Bos, Kers, OorT, RAVEN 1987, a paper which I came 
upon after this article was written. 

There is a connection between the Poncelet-measure determined by confocal 
ellipses C and E, and the invariant “billiard measure” of an elliptical billiard table. 
The ergodic components of the billiard measure of C turn out to be the 1-parame- 
ter family of Poncelet CE-measures obtained by varying E, the inner confocal 
ellipse. Billiard measure is defined in a companion article to this one, [Kina], 
where it is used to study a billiard question on a table with a cusp at infinity. 


Plank Problem, history. Archimedes proved in proposition 3 of Book II of On the 
Sphere and the Cylinder, see HEATH 1897, that a plane orthogonal to a diameter of 
a sphere cuts the sphere into two regions with areas in the same ratio as the 
lengths of the two pieces of the diameter. Since the areas of the sphere and 
cylinder were known to Archimedes, this proposition is tantamount to showing 
that radial projection is area-preserving. I am indebted to C. E. Thompson for 
indicating this proposition to me. 

Given here was the elementary case of Tarski’s Plank Problem, proposed in 
1932. What Tarski had asked was this: 


Suppose a convex compact region D in the plane is covered by countably many 
planks. Must the sum of their widths dominate the width of D? 


(The width of a shape is the width of the narrowest single plank which covers it.) 
This problem does not immediately follow from the disk case, since some convex 
shapes —the equilateral triangle being a notorious example— have a width 
exceeding the diameter of its inscribed circle. Almost 20 years later, Tarski’s 
problem was solved by an ingenious argument of TH@GER BANG, 1951. The 
elementary version of Tarski’s problem suggests the following question which, to 
my knowledge, is open. 


Question A.1. What is the infimum, W(r), of total widths L*_, Width(P,) 
taken over all systems {P,}9 of planks which cover the annulus of outer-radius 
1 and inner-radius r? 


Gelfand’s Question, history. The “frequencies of ((2”)>” problem appears on 
page 37 of AvEz 1966, where it is attributed to Gelfand. 

Weyl’s theorem is usually proved by noting that definition (3.3) of equidistribu- 
tion is equivalent to this formulation: The sequence xo, x,,... is equidistributed in 
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the circle K if 


1 N-1 
For any continuous g:K > C: lim — )> g(x,) = fear (A.2a) 
Now N n=0 


Weyl’s theorem concerns those g for which, for each z, the sequence {z © na}’_, 
is equidistributed. Since the set of such g forms a closed subspace (in the 
supremum-norm) of the space of all continuous functions, the Stone-Weierstrass 
theorem tells us that it is enough to check this when g is a group-character, where 
it is easily verified. 


The Pointwise Ergodic Theorem. There is a more general ‘‘equidistribution theo- 
rem” which applies when T is a measure-preserving transformation on probability 
space CX, ww). In the special case when T is ergodic, Birkhoff’s theorem states that 
for any g © L'(y) the limit and equality below exist. 


N-1 
For a.e.x © X: lim * Y) g(T"x) = edu. (A.2b) 
N>o N n=0 
(The nullset of “bad” x violating equality will generally depend on g; there is no 
universal full-measure set working for all functions. An earlier ergodic theorem, 
convergence in the L?-norm, was proven by von Neumann.) The pointwise ergodic 
theorem is a far-reaching generalization of the Strong Law of Large Numbers 
(SLLN). It also quickly implies Weyl’s theorem —that continuous functions g 
have no bad points— once irrational rotations are shown to be ergodic. We show 
that now. 


Proposition A.3. Any irrational rotation p = p,, is ergodic. 


Proof: Suppose p-invariant set B has positive mass. Take a short open interval J 
in the circle such that 


A(I.0 B) > (.99)A(J). 


By the density of the orbit of an endpoint of J, we can find times k,,...,k,, such 
that the translated intervals p*»(J) are disjoint, and 


M 
| LJ p(1) > 98. 
m=1 
But B= U™_,p*(B) > U™_,p*(1 2 B). Consequently 


A(B) 


IV 


M 
LL A(pi"( 19 B)) 
m=1 


M 


dy (.99)A(p*"(1)) = (.99)(.98) > .97. 
m=1 


IV 


Of course this “.97” could have been made as close to 1 as desired, so ACB‘) = 0. 
LJ 


The upshot is that Birkhoff’s theorem simultaneously contains the probabilistic 
equidistribution of SLLN and the topological equidistribution of Weyl. 
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Poncelet-measure is unique. Although we will not go into detail (see for example 
PETERSEN 1983), ergodicity and equidistribution ideas answer question 1.12 affir- 
matively by showing that 


Under an irrational rotation p,, arclength X is the unique invariant Borel 
probability measure. 


This gives uniqueness of Poncelet-measure yu, when the rotation number a, 
coming from ellipses C and E, is irrational. But even in the case when a is rational, 
, must still simply be the image, under @, of the circle’s arclength measure: 
Dilating the inner ellipse E slightly causes the rotation number to vary continu- 
ously, and only requires a small change in the affine map .. Hence the Poncelet- 
measure for a CE-pair with rational a is the limit of Poncelet-measures from 
nearby irrational rotation numbers, and all these measures are the g-image of 
arclength. 


The Kronecker—Weyl theorem. The preceding proposition derived ergodicity of an 
irrational rotation directly from the density of every orbit. Exactly the same proof 
would show that a toral rotation is ergodic, once orbits are known to be dense. 
Also analogous to the 1-dimensional case of the circle, ergodicity together with 
Birkhoff’s theorem can be used to demonstrate equidistribution, which in the 
multi-dimensional case is Kronecker-Weyl. Thus the fundamental idea in proving 
K — W is to show that rational independence of the rotation numbers implies that 
the orbit of the origin is dense. 

Let us conclude by proving density in the 2-dimensional case, so as to illustrate 
how density can be lifted from the 1-dimensional case. 


Warmup for the K —W theorem. Suppose numbers a, B, 1 are rationally indepen- 
dent. Then under toral-rotation p., X pg, the orbit of (0, 0) is dense. 


Proof: Fixing e we show that the (a, B)-orbit of (0, 0) is e-dense. Pick N > 1 such 
that 


(4,8) = [pa X pp] ” (0,0) 


is within ¢ of (0,0) in the torus; this, by the same Pigeon-hole argument used in 
(1.11). This @ is of the form Na minus an integer, and likewise for B. Rational 
independence implies that (d@, B) is not (0,0) and thus 


the (a, 8)-orbit of (0,0) is e-dense in the line 
L = {(x,y) ER X R | xB = ya} 


in the “unwrapped” torus ie., in the plane. Wrapping the plane back up, this line 
L winds densely around the torus, since its slope @/B is irrational; this again 
follows from rational independence. In consequence, the (a, B)-orbit of (0,0) is 
e-dense in the torus, and therefore so is the (a, B)-orbit. DO 


Final remark. We know the frequency of leading-digit “9” in the doubling 
sequence 2, 4,8, 16,32,.... But what is the frequency of leading-digit ‘9’ in the 
sequence 


2,4, 16, 256, 65536,... (*) 


where —instead of doubling— the next term is always the square of the current 
term? 
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Map w of diagram 3.2 carries squaring to the map S(x) := 2x (mod 1) on [0, 1). 
This 2-to-1 map, an endomorphism of the circle group, preserves Lebesgue 
measure and is ergodic, and so Birkhoff’s theorem tells us that the orbit of almost 
every point x visits each J, interval with Lebesgue frequency. But sequence (*) 
asks this question of a specific point, x = log(2). Is log(2) a “bad” point (relative to 
the J, intervals) for Birkhoff’s theorem? 

To this day, nobody knows... 
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The n-Queens Problem 


Igor Rivin, Ilan Vardi, and Paul Zimmermann 


1. INTRODUCTION. The n-queens problem asks how many ways can one put n 
queens on an n X n chessboard so that no two queens attack each other. In other 
words, how many points can be placed on an n X n grid so that no two are on the 
same row, column, or diagonal (see Figure 1). 


Figure 1. Chess Queen. 


This question was first posed for the ordinary 8 X 8 chessboard as an anony- 
mous problem [3], later attributed to Max Bezzel [1, p. 211]. The problem received 
wide attention, however, when posed by Franz Nauck in 1850 [22]. Writing about 
the problem in a letter to the astronomer Schumacher, Gauss conjectured that 
there were 72 solutions [9]. Soon after this 92 solutions were published which 
convinced Gauss that he had been incorrect. The 92 solutions are commonly 
represented by 12 “fundamental” solutions, that is, solutions that are not reflec- 
tions and rotations of each other (see Figure 2). That 92 was the right answer, 
however, was not proved formally until 1874 by Dr. Glaisher [10], [27], using an 
idea of Giinther (see Section 5). For a history of early results see [1] and the 
bibliography of [28]. 

These days the 8-queens problem is most often encountered as an exercise in 
introductory artificial intelligence programming courses. In fact, the n-queens 
problem is one of the benchmarks by which backtracking algorithms have been 
compared [32], [12], [11]. 

For the general n X n case denote by Q(n) the number of solutions. It is not 
immediately obvious whether Q(n) > 0 for general n and there have been a 
number of independent proofs showing that Q(2) = Q(3) = 0, Q(n) > 0,n > 3. 
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Figure 2. Fundamental solutions to 8 x 8 problem. 


The first proof seems to be by Ahrens [1], but proofs by authors unaware of this 
reference appear in [34], [14], [6]. Other proofs can be found in [28], [5], [23]. It is 
interesting that none of these notes that O(1) = 1 [35]. 

The precise nature of Q(n) seems very difficult to understand and a more 
tractable problem appears to be the toroidal n-queens problem: How many ways 
can one place n-queens on an n X n chessboard so that no two queens can be on 
the same row, column, or extended diagonal (see Figure 3). This problem was first 
studied by Polya [1, p. 363-374] who showed that 7T(n)> 0 if and only if 
(n,6) = 1, where T(n) denotes the number of n X n toroidal queens solutions. 
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Figure 3. Toroidal Queen. 


Finding closed expressions for Q(n) and T(n) seems to be an intractable 
problem, so our results deal instead with estimating the asymptotic order of these 
quantities. 

First, note that every queen solution is also a rook solution, i.e., no two queens 
can be on the same column or row, and that each rook solution corresponds to a 
distinct permutation of {1,...,} and there are n! such permutations. It follows 
that the trivial upper bounds are T(n) < Q(n) <n! < e” 8”, 

It appears that the only previous non-trivial lower bound is the one of Lucas [17] 
stating that T(p) > p(p — 3) if p is prime. In this paper we will show 


Theorem I. 


(a) Let p be a prime such that (p — 1)/2 is not prime, then T(p) > 22717@9, 
where d be the smallest nontrivial divisor of (p — 1)/2. In particular, if p = 1 
(mod 4), then T(p) > 27/4, but in general one only has the bound 
T(p) > 2V~D72 | 

(b) If n is divisible by a prime = 1 (mod 4) then T(n) > 2"”. 


Note that the above results also hold for O(n) since Q(n) = T(n), while (b) holds 
for almost all n, since the set of integers not divisible by a prime = 1 (mod 4) has 
density zero [13]. 


We have not been able to find super exponential bounds for Q(n) and T(n) but we 
believe 


Conjecture 1. 
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Finally, T(n) can also be thought of as the number of arrays of non-negative 
integers with row, column, and broken diagonals summing to one, i.e., a pandiago- 
nal magic square [27] with common sum equal to one. Following ideas of [30] we 
propose 


Conjecture 2. The generating function U_ (T(n)/n!)x" has a closed form. 


2. SURVEY OF PREVIOUS RESULTS. First, write a queens solution as a function 
f(k), k =0,...,n — 1, so that the k’th queen is placed at the (k, f(k)) coordinate 
of the chessboard. It follows immediately that f represents a toroidal solution if 
and only if k > f(k),k — f(k) + k (mod n), and k > f(k) — k (mod n) are all 
one to one. 

Similarly, f(k) represents a (not necessarily toroidal) queens solution if and 
only if k + f(k), k > f(k) +k, and k > f(k) — k are one to one. 

We now present an elegant proof [5], [2], that there is always a queens solution 
for n > 3. The proof splits up according to the residue class of n (mod 6). 


(a) Ifn = 6m + 10rn = 6m + 5 then (n, 6) = 1 and one lets f(k) be given by 
f(k) = 2k (mod n). This is clearly a toroidal solution (thus an ordinary 
solution). Note that this is what one would ordinarily consider as putting 
queens one “knight’s move” apart. 

(b) If n =6m or n=6m +4 then one takes the solution of (a) for the 
(n + 1) X (n + 1) board and removes the queen in the (0, 0) position (i.e., 
the leftmost column and bottom row). The resulting position is an n 
solution. 

(c) Ifn =6m+20rn = 6m + 3 first construct a 6m + 2 solution as follows: 
Put a queen at (k, f(k)), where 


k) = 2k+(n—2)/2(modn), if0<k<(n-2)/2 
Ik) = n-1-—f(n-1-—k), ifn/2<k<n-1 


One checks easily that this is a solution. Note that this is a straightforward 
generalization of solution (10) in Figure 2 for the 8 X 8 case. 

Since this solution does not have a queen on the main diagonal, one can construct 
a 6m + 3 solution by adding a row and column to the edge of the board and 
putting a queen on the new corner. O 

Turning to T(n), we prove Pélya’s result characterizing n for which T(n) > 0. 
This is also proved in [5], [19], [2]. An extension is given in [20]. 

Let (n,6) = 1, then, as before, f(k) = 2k (mod n) is a toroidal solution, so 
T(n) > 0. Conversely, one shows that T(n) > 0 implies that n is not divisible by 2 
or 3. 

Assume that f(k) represents an n X n toroidal solution so f(k) — k (mod n) is 
a permutation of 0,...,” — 1 and 


(n - 1) 
2 


y (f(k) —k) =yn-" (mod n). 
k=0 k=0 


But this sum is also 


'S (f(k) —k) = Elk) - Dk=0, 
k=0 k=0 k=0 
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since f(k) is a permutation of 0,..., — 1. Therefore n divides n(n — 1)/2 and it 
follows that n is odd. 


One similarly shows that n is not divisible by 3 by using the more elaborate sum 
> (f(k) —k)’ + ¥ (f(k) +k) -4 S20 (mod n). 0 
k=0 k=0 k=0 


The next result, also due to Pélya [1], shows how to compose solution, i.e., if an 
m Xm solution and an n Xn solution are given then one tries to construct an 
mn X mn Solution by placing a copy of the m X m solution where each queen 
appears in the n X n solution (see Figure 4). 


Figure 4. Composed solution. 


Let m,n > 3, where (n,6) = 1. Then if g is a toroidal n Xn solution and f is an 
ordinary m X m Solution, then one can compose these to an mn X mn solution. 


Proof: Every integer mod mn can be written uniquely as an + b,a = 0,...,m— 
1,5 =0,...,n — 1. The claim is that h(an + b) = f(a)n + g(b) is an mn X mn 
solution. For example, checking the condition that h(k) + k is one to one: 

The assumption 


(1) h(an+b)+an+b=h(an+b') +an+0' 


is seen to imply g(b) + b = g(b’) + b’ (mod n), which gives b = b’ since g is a 
n Xn toroidal solution. Equation (1) then gives that 


f(a)n + an =f(a')n+a'n, 
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so a =a’ since f is a queens solution. The other cases are exactly similar, giving 
the result. 

We now turn to the Lucas estimate T( p) > p(p — 3) which is proved by noting 
that for p a prime, each pair a # 0, + 1(mod p) and b = 0,..., p — 1, generates 
the distinct toroidal solution f(k) = ak + b, and the number of such solutions is 
p(p — 3). 

We end this section by mentioning results that are tangentially related to the 
n-queens problem. 

(i) Upper bounds for T(n) might be obtained by replacing the toroidal queen 
with a toroidal semiqueen, a piece that moves like a toroidal queen but cannot 
travel on negative diagonals. The toroidal semiqueen problem can be expressed 
very simply in terms of permanents and this question was studied by I. Rivin and 
I. Vardi in [33, Chapter 6]. 

(ii) A simpler question than the n-queens problem is to compute how many 
ways one can place kA non attacking queens on an n Xn chessboard. Call this 
number Q,(n). For k = 2,3 there are closed forms [1], [28], 


n(n — 1)(n — 2)(3n — 1) 


Q,(n) = 6 , 
(n — 1)(n — 3)(2n4 — 12n? + 25n? — 14n + 1) 
Q,(n) = WoO eee. 


In general, one can show [31, Problem 4.15] that for fixed k the generating 
function 


y Q,(n) x” 


is a rational function. 
(iii) The ideas in Pélya’s paper have been used to give another proof of Fermat’s 
result that every prime = 1(mod 4) is a sum of two squares [16]. 


3. COMPOSING NEW SOLUTIONS. Examining a typical composition con- 
structed by Theorem 3 one notes that there are large areas of the board that have 
no queens (see Figure 4). 

We noted that these regions can be used effectively to construct many more 
solutions with only a slight variant of the basic composition idea. 


Theorem 2. Let m,n > 3 be given, (n,6)=1, and let f,, fas--+>foumy 5e all 
m X m queens solutions, and let g be a toroidal n X n solution. Then for each map 
wr: {0,...,n — 1} > {1,..., QCm)} the function h(an + b) = f,,,,(a)n + g(b) gives 
a distinct mn X mn solution. 


Proof: The proof that each of these gives a queens solution is exactly as in the 
previous section, while the fact that each solution is distinct is clear from the 
definition. oO 


Corollary 1. Let (n, 6) = 1, m > 3 then Q(mn) > [Q(m)]"T(n). In particular, if N 
is a number divisible by 5, and (N,6) = 1, then Q(N) > 4%”. 


Proof: The first part follows directly from counting the number of solutions 


generated by Theorem 2. The second result follows by letting n = 5,m = N/5, 
and noting that Q(5) = 4. One then checks the special cases N = 10, 15. 0 
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Remark. Corollary 1 gives the first example of a set of n’s for which Q(n) grows 
faster than a polynomial in n. 


4. USING THE MULTIPLICATIVE STRUCTURE OF Z/pZ. In the previous 
section we constructed mn X mn solutions out of m X m solutions and n Xn 
solutions. Such a method will not work for the case of p X p chessboards, when p 
is prime, so constructitig an exponential number of solutions in this case requires a 
new idea. 

The basic idea of composition was to generate solutions using an additive 
subgroup of Z/nZ. We will now use the multiplicative structure by constructing 
‘‘quasi-linear” solutions of the form f(k) = c,k, where c, is constant on cosets of 
a multiplicative subgroup of (Z/pZ)*. 

The simplest case is when p = 1 (mod 4). It is well known [13, page 85] that for 


such p there is a number i mod p with the property that i* = —1 (mod p). 

Consider an equivalence relation on {1,...,p —1} by a ~b if a =i*b, for 
some k. This defines (p — 1)/4 equivalence classes (a,), (ay),...; (A p— 1/4? for 
some sequence numbers 0 < a,,..., Apa <P. 


Now consider f(k) = c,k, where c, =i orc, = 1/i, and c, is constant on each 
set (k). The claim is that f(k) is a toroidal queens solution. 


Theorem 3 (i). Each map o: {1,2,...,(p — 1)/4} — {+1} yields a distinct toroidal 
DP X p solution. 


Proof: For each o identify {1,...,(p — 1)/4} with the distinct classes 
(41),-++5@p—14) and define f by f(k) = i76*k. 
To see that f is a toroidal solution we check the three conditions of Section 2: 


(a) Assume that f(k) = f(k’), then 
GEM = GTC DK! => kk =i tk’ > (kK) = (kK!) Sk =k! 


so f is one to one. 
(b) Assume that f(k) + k = f(k’) + k’ then i@™k + k = i7*D' + k’ so 


k(1+i*') =k (1 +i*') 


for one of four choices of +1, + 1. All these cases lead to (k) = ¢k’), for 
example, if k(1 + 1/i) = k’(1 +1) then the identity 1+1/i=(+i)/i 
gives k = ik’. It follows that (k) = (k’) so k = k’. 

(c) If f(k) —k = f(k’) — k’, then one gets that k = k’ as in part (b). The only 


difference is that i? = —1 (mod p) is needed to take case of the case when 
kG —-1)=k’O/i — 1) and kU /i — 1) = k'Gi — 1). 
Finally, it is routine to check that for each distinct o one gets a distinct f. oO 


In the general case let g be the smallest divisor of p — 1 that is even and 
greater than two. Jt is known [13] that x? = 1 has q solutions mod p and that 
these form a cyclic group. Let € be a generator and, as before, define an 
equivalence relation by a ~b if a =€é*b for some k. This gives (p — 1)/q 
equivalence classes (a,),...,4@,-1)/q, Where (a) represents the equivalence 
class of a (i.e., {a, €a, €7a,..., €7 'a}). The result is 


Theorem 3 (ii). Each map ao: {1,...,(p — 1)/q} — {+1} leads to a distinct toroidal 
PD X p solution. 
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Proof: The proof proceeds exactly as in part (i) of the theorem. It is important to 
note that as in part (c) above, one needs to have é/ = —1 for some j. This explains 
why qg must be chosen to be an even divisor of p — 1. 0 
Theorem 1(a) now follows immediately by counting the number of solutions 
generated by Theorem 3. 


Remark 1. We have found no improvement on the lower bound p(p — 3) for 
primes of the form p = 2q + 1,q prime (Cunningham primes). 


Remark 2. For a given q dividing p — 1, the solutions constructed by this method 
have the same cycle structure when taken as permutations, i.e., a product of 
(p — 1)/q cycles of length gq. 


Remark 3. This method of constructing solutions can be used to give more 
complicated forms of compositions of solutions. For simplicity consider two distinct 
primes p,, D> = 1(mod 4) (the general case is similar). 


Theorem 4. For each map oa: {1,...,(p, — 1)/4 > (1,...,T(p,)} and 7: 
{0,...,D, — 1} > (1,...,27 2”) there is a distinct p,p» X P,P» toroidal solution. 


Proof: Let f,,...;frrp,) be the toroidal p, X p, solutions and g,,...,8y,,—1/4 
be the solutions as constructed in Theorem 6 (i). One can write each. number 
(mod p, p,) uniquely as ap, + bp, where a = 0,...,p, —1,b =0,...,p, — 1. It 
can then be shown that for each o, 7 the function 


h(ap, + bp) = f£°”(a)P, + 8acay(b) Po: 


gives a distinct toroidal solution. 0 
Counting the number of solutions generated by Theorem 4, one gets that 


T( pio) = T( pay? OO 2PAP NA, 


This can be extended to more complicated compositions for products of more than 
two primes. A computation shows that in the limit this gives the lower bound 
T(n) > 20-9"/3\ where ¢ > 0 as the number of prime factors of n that are = 1 
(mod 4) goes to infinity. 

We now turn to the proof of Theorem 1 (b). Consider n, (n,6) = 1, and n is 
divisible by a prime p = 1 (mod 4). It follows from Corollary 1 and p > 5 that 


T(n) =T((n/p)p) > T(p)"T(n/p) > 2%?- 0/4?) 5 277, oO 


Remark. The reader may have noted that our techniques have been elementary. It 
is an interesting question why nontrivial lower bounds have escaped the large 
literature on this subject. We believe that there are two reasons for this. 

The first comes for the original formulation of the problem on the 8 X 8 
chessboard. Note that in the proof of Theorem 1 the hardest case was for 
numbers = 2 (mod 6), since they cannot be reached from the more tractable 
toroidal problem. This might also explain why the toroidal problem has not been 
extensively investigated. 
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The second reason is the emphasis on classifying “fundamental” solutions, i.e., 
solutions that are not rotations or reflections of each other. This problem is 
difficult even in the much simpler case of the rook’s problem, and counting the 
number of fundamental solutions has proved to be nontrivial [18], [4], [21], [26]. 
Note that fundamental solutions under toroidal symmetries for the toroidal case 
are very easy to classify—they are the ones with a queen at (0, 0). 


5. COMPUTATIONAL RESULTS. As mentioned in the introduction there has 
been much interest in the computation of Q(n) and T(n) using backtracking 
algorithms. A different algorithm has been advanced by Igor Rivin and Ramin 
Zabih [24]. Their idea is similar to a method proposed by Giinther: Consider 
independent variables Xo,..., X3,~7,Y_n41.+++»Y,—, and the matrix ||X;,,Y;_ ll 
then the squarefree term in X,Y of the permanent of this matrix (determinant 
with no minus signs) gives the number of queens solutions. As before the toroidal 
case is much cleaner. One has 2n variables and considers the squarefree term of 
the permanent of ||X;,; moa n¥j—j mod nll: 

To estimate the running time of this method in the toroidal case, note that the 
standard methods for evaluating a permanent give a running time of about 2” 
multiplications, and the terms are squarefree polynomials in 2n variables and of 
degree < 2n. It follows that there are at most 27” terms. Since a multiplication 
takes time about 8”, this gives a running time on the order of 16”, but with space 
requirement of 4” (the running time can be reduced to 8” [25)]). 

Since backtracking algorithms generate all solutions these take at least T(n) 
steps to compute T(n). It follows that a lower bound T(n) > y”, where y > 8, 
would show that the Rivin-Zabih algorithm always runs faster than backtracking (if 
Conjecture 1 holds, then this algorithm will be much faster than backtracking). 

On the other hand, backtracking takes very little space so it is still the more 
practical method for computing large values of T(n) and Q(n), and all the values 
in Figure 5 were computed this way (Q(19) and Q(20) were computed by 
A. Shapira [29]). For example, the value 7(23) = 128850048 was computed using 
backtracking and a number of implementation shortcuts. Toroidal symmetries 


n T(n) log T(n) /(n log n) Q(n) log O(n) /(n log n) 
4 2 0.125 
5 10 0.286 10 0.286 
6 4 0.129 
7 28 0.245 40 0.271 
8 92 0.272 
9 352 0.297 

10 724 0.286 

11 88 0.170 2680 0.299 

12 14200 0.321 

13 4524 0.252 73712 0.336 

14 365596 0.347 

15 2279184 0.360 

16 14772512 0.372 

17 140692 0.246 95815104 0.382 

18 666090624 0.391 

19 820496 0.243 4968057848 0.399 

20 39029188884 0.407 

23 128850048 0.259 


Figure 5. Values of T(n), O(n). 
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reduced the number to be computed to 1482252 solutions. A further saving was to 
eliminate impossible consecutive triplets. The computation was done in LeLisp as 
a distributed computation over a network of 20 Suns at INRIA, Rocquencourt, 
and took 267 days of CPU time. 


Note that the computational evidence supports Conjecture 1 since the values of 


log O(n)/(n log n) and log T(n)/(n log n) seem to be monotonically increasing. 
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The Prince of Algebra 


Madam Professor, 

Let me introduce myself— 
I’m Albert James, 

whom you may know 

by my test score 

that’s lower than my age. 


Your algebra tests 
are too long for me 
in fifty minutes, 

but I am proud 

of my attendance— 
I never miss class, 
never come late. 

I am preparing 

for a new career. 


For thirty years I was 
with the Postal Service 
never absent, 

never late. 


Your mathematics 
is important! 

It runs the clock 
which runs the mail. 
Now I train to be 

a first grade teacher. 


I will teach 
mathematics 

by punctuality 

and perfect attendance. 


From Intersections: Poems by JoAnne Growney, 
Kadet Press, Bloomsburg, PA, 1993, p. 52-53. 
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What’s the Difference Between Cantor Sets? 


Roger L. Kraft 


In this article, we will look at a family of sets called the middle-a Cantor sets and 
we will try to show that not all of these sets are of the same “‘size.”’ We will do this 
without computing any of the myriad kinds of fractal dimensions. Instead, we will 
prove three theorems about what are called the difference sets of the middle-a 
Cantor sets and these theorems will provide the evidence we need to conclude that 
some middle-a Cantor sets are “larger” than others. The proofs of these theorems 
will use methods that emphasize the ideas of self-similarity, rescaling and renor- 
malization. 

The middle-a Cantor sets are a straightforward generalization of the classical 
middle third Cantor set (see [B, pp. 33-36] or [E, pp. 1-6]). To define a middle-a 
Cantor set in the interval (0, 1], first choose a € (0,1), let 8 = (1 — @)/2 and then 
define the affine maps 


To(x) = Bx and T(x) =Bx+(1-B). 
Let J, = [0,1] and then, for n > 1, inductively define 
I, — T)(L,-1) U T\(1,-1)- 


Then the middle-a Cantor set in the interval [0, 1] is defined as 
r= ()\1,. 
n=0 


Let’s look at this definition more carefully. Notice that T,(J>) = [0,8] and 
T,,) = 1 — B, 1], so J, = [0, 8] U [1 — B, 1]. The “hole” in J, has length 1 — 28 
= a. So another way of describing J, is to say that we remove from the middle of 
I, the open interval of length a, leaving two closed intervals of length 8B. Now 
notice that T,(/,) = [0, B*] U[B(U — B), B] and that 7,/,) =[1 - 6B, B*° + - 
B)] U [A — B) + BU — B), 1]. So we get I, from J, by removing from the middle 
of each component of J, an open interval of length af. Each of the four 
components of J, has length 6’. In general, J, is a disjoint union of 2” closed 
intervals of length B” and we get J,,, by removing from the middle of each 
component of J, an open interval of length af”. The collection {J,}?_, is a nested 
sequence of compact subsets of [0, 1] and it has the finite intersection property, so 
by the compactness of [0, 1], [, is a nonempty set. Each [, is a compact, nowhere 
dense, perfect subset of the real line and hence is a Cantor set. (How would you 
modify T, and T, to define a middle-a Cantor set in the interval [a, b]?) 

Fix a choice of a € (0, 1) and let x be a point in [.,. Define the address in [, of 
x to be a sequence {5p, 5,,55,...} where each s; is either 0 or 1. To determine s5,, 
consider x as a point in J,,, and if x is to the left of the nearest hole of length 
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af’, then s, = 0 and if x is to the right of the nearest hole of length af’, then 
s; = 1. The primary usefulness of the address of a point x from I’, is the fact, 
which you should check, that x has a unique representation of the form 


Ms 


x= ),s5,B'(1—B) where s, € {0,1}, 


i=0 


and the “digits” in this representation are the terms of the address. (Hint: The 2” 
points of the form £”~,5,8'(1 — B) are the left-hand endpoints of the 2” compo- 
nents of J,.) You should also check that when a = B = 1/3, this representation is 
equivalent to the standard ternary representation of the points in the middle third 
Cantor set. 

You may have noticed that the number B is more useful than a in describing 

T,. This is because B~' is a scaling factor for [,. By this we mean that 
B-'(T, 1 [0, BD) = I, where, if y is a real number and A is a subset of the real 
line, then yA = {yx|x © A}). In the rest of this article, when referring to a 
middle-a@ Cantor set, we will freely go back and forth between the two numbers 
B=(1 —a)/2 and a =1 — 28. 
- Compare the middle-9/10 and the middle-1 /10 Cantor sets. One seems “‘larger” 
than the other. But how can we demonstrate this? We could try comparing the 
number of points in the two sets but it’s not too hard to show that all the middle-a 
Cantor sets have the same cardinality (in fact, any two Cantor sets are homeomor- 
phic; see [HY, pp. 97—100]). So we can’t use cardinality to compare the sizes of 
different middle-a Cantor sets. Another way to compare the sizes of two sets is to 
compare the values of their Lebesgue measure. But we’ll now show that each of 
the middle-a Cantor sets has Lebesgue measure zero. Choose a € (0, 1). Each set 
I, covers [,. I,, contains 2” components, each of length B”. So the total length of 
I, is 2”B” = (2B)”. So the total length of [, is less than or equal to (28)” for all n. 
Now a € (0,1) implies that B € (0,1/2), so 28 <1 and therefore the total 
lengths of the J, go to zero as n goes to infinity. And this is what it means to say 
that the Lebesgue measure of I, is zero. So we can’t use Lebesgue measure to 
compare the sizes of the middle-a Cantor sets. (It is not true, however, that every 
Cantor set has measure zero; see [B, pp. 63—64].) 

So we can’t use cardinality or Lebesgue measure to distinguish between differ- 
ent middle-a Cantor sets. But it would be nice if we could find some way to use 
these two simple notions to demonstrate that the middle-1/10 Cantor set is 
“larger” than the middle-9/10 Cantor set. And that is exactly what we will be able 
to do, by applying cardinality and Lebesgue measure to the difference set, TT, — T_, 
of a middle-a@ Cantor set. The name and notation for the difference set come from 
the definition, 


lr, —T, = {x -ylx,y €T,}. 
Here is a more “dynamical” way of defining the difference set, 
r,—-T, = {ll a (1, + 4) # @} 


where [, + ¢ = {x + tlx €T,}. In this second definition, [, + ¢ is a translate of 
l, and [, — [, tells us which of these translates intersect with [,. Imagine a copy 
of [, moving down the number line with constant velocity one such that it 
coincides with I, when t = 0. The difference set describes those times when the 
moving copy of I, intersects with T,. Notice that for any choice of a € (0, 1), 
r, -—T, c({-1, 1]. 
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How can difference sets be used to show that one middle-a Cantor set is larger 
than another? Intuitively, the larger a set is, the more often it should intersect with 
a translate of itself, so the larger its difference set should be. Also, the larger a set 
is, the larger we would expect the intersection that set has with its translates to be. 
So what we will do is apply cardinality and Lebesgue measure to the difference 
sets [, — [, and to the intersections T, 4 (T, + t) for t € T, — T,. We will show 
that, as a decreases, the Lebesgue measure of [, — I, will increase, and the 
minimum cardinality of T, n ({, + t) for t €¢ T, — [, will increase. However, the 
price we pay for using such simple concepts as Lebesgue measure and cardinality is 
that we can only demonstrate changes in the size of I, for a few values of a. We 
will not be able to demonstrate a continuous increase in the size of the middle-a 
Cantor sets as a decreases, which is really the case. 

Our first theorem is about the measure of the difference sets. We will state the 
theorem in terms of B instead of a. Notice that the sets I’, “grow” as B increases. 


Theorem 1. Jf B < 1/3, then T, — T, ts a Cantor set of measure zero. If B = 1/3, 
then T, —T, =[-1,11. 


In other words, if B < 1/3, then T, M (1, + t) is empty for almost all ¢ € [—1, 1] 
and if B > 1/3, then T, N(T, + t) is nonempty for all ¢ © [—1, 1]. So there is a 
significant change in the way that the middle-a Cantor sets act when B (or a) 
crosses 1/3. We will see latter that this change is even more dramatic than is 
indicated by this theorem. 


Proof: A very elegant proof by picture for the case when B = 1/3 can be found in 
[B, p. 110]. That proof can easily be seen to generalize to all B € [1/3, 1/2). So we 
will leave it to the reader to look up the proof of this part of the theorem. 

We will prove the part of the theorem where B < 1/3. We will use the fact 
(which you should prove) that for a € (0, 1), 


We will show, by induction, that J, — I, is a disjoint union of 3” closed intervals 
each of which has length 28”. This implies that T, — I, has measure zero by an 
argument very similar to the one used to prove that [., has measure zero. (Each 
I, —1, is a cover of T, — T, and the total length of J, — I, is 3"2B” = 2(3B)” 
which goes to zero as n goes to infinity since B < 1/3.) The fact that T, —T, isa 
Cantor set will follow from the way that J,,, — [,,, is derived from [, — I,. 

To make clearer the geometric structure of T, — I’,, we will begin the induction 
with n = 1, rather than with n = 0. If B <t < 1 — 2£, then it’s easy to see that 
IAC, + t) = © (notice that if B < 1/3, then a > B; now consider the following 


picture of J, and J, + ¢). 


t B+t 1+t 
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And if -1+ 28 <t < —f, then we again have J, 1 (U7, +t) = ©. 


For any other choice of ¢ € [—1, 1] we will have 7, 1 (U/, +t) # ©. So I, — I, is 
the disjoint union of three closed intervals, and each of these intervals has length 
2B, 1.e., 

I, I, — [-1, -1+ 2B] U [-B, B] U [1 7 2B, 1). 

Now suppose that J, — I, is the disjoint union of 3” closed intervals, each of 
length 2B”. 

For n> 1 let J’ = 1, 1 (0, B] and let * =7, N[1 - B,1], so I’ and I* are 
the left and right “halves” of J,. Because of the self-similarity of [,, we have 
[, = BOT So 

BOT BOTs —_ , — I, 
or 
hora _ Tre — BCL, 7 I,) 
and so the induction hypothesis implies that I, , — I/,, is the disjoint union of 3” 
closed intervals each of length 28”*'. Since 1%, , = 1%,, + (1 — B), we have 


a _ | re _ (Cave _ T.1) _ (1 _ B) = BCI, _ I,,) ~ (1 _ B) 
and similarly, 


| ve _ foe — (Lire _ T.1) + (1 ~ B) _ BCL, _ I,) + (1 — B) 


so the induction hypothesis implies that both J%,, — [%,, and 1*,, — 1%,, are a 


A 


disjoint union of 3” closed intervals each of length 28”*'. Now notice that 
Dna. ln = (Lire — T.1) U Ceae — T41) U Cane T.1) 


and, since B < 1/3, the unions are disjoint. So J,,, — J,,, is a disjoint union of 


A 


3"+! closed intervals each of length 28”+'. The following picture illustrates 
I,-1, Il, — 1, and I, — J,. 


—] —~1+2£ —B B 1— 2B 1 
- San et 
r— FF b-— FF -—4 bs FH 
Fikitk Fikik Fikitk FArFJFH Fitkitk FarFJAKHM FikFditk FFE FiakrdikK€ 


Exercise. As B increases from 0 to 1/3, the sets T, — I, grow in much the same 
way that the sets [, grow as B increases from 0 to 1/2. Show that if B < 1/5, 
then the difference set of the difference set (T, — T,) — (1, — T,) is a Cantor set 
of measure zero and if B => 1/5, then (T, — T,) — (1, — I) = [-2, 2]. 
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When B = 1/3 we know that [, N(T, + t) is nonempty for all ¢ © [-—1, 1]. 
Now we will show that for some values of B and for some ¢ € (— 1,1), T, N (TL, + t) 
will be as small as a nonempty set can be, ie., a single point. Consider the 
following picture. 


0 B 1-8, 1 
be 4 
1 - 


t 1+¢t 


This picture represents J, and J, + ¢ for some, as yet, unspecified values of B and 
t. If we can choose 8 and ¢ such that I* and I/ +t are positioned, relative to 
each other, the same way that J, and J, + ¢ are positioned, then, when we look at 
IR and If +t, the above picture will reproduce itself, on a smaller scale, inside 
the intervals J* and I + t. See the following picture (notice that the above 
picture, when it is reproduced below in the intervals /* and J; + ¢t, has its 
orientation reversed). 


f 1+t 


So if B and t can be chosen so that the kind of self-similarity described above 
occurs, then for all n, J, and J, +t will have only one pair of overlapping 
intervals. And then we will have only one point in T, 9 (1, + t). The self-similar- 
ity we want can be described by looking at the proportion of [0, 1] that lies to the 
left of [t, 1 + ¢] and equating it to the proportion of [1 — B, 1] that lies to the right 
of [t, 8 +t]. This can be expressed as 


t 1-(B+t) 
1 B 
Solving for ¢t we get 
_ 
t= Pe (1) 
1+ 8 


But we also need to require that B < t. If we take the value of ¢ given in equation 
(1) and put it in this last inequality, we get B? + 2B — 1 < 0 which is solved by 
B < V2 — 1. So when B < v2 — 1 and we give ¢ the value (1 — B)/(1 + 8), then 
raf, + t) will contain exactly one point. (Exercise: Find the value of this 
point.) When B = y2 — 1, the intersection of [, and ., + ¢ will be a countable 
number of endpoints along with the unique point that is in the interior of all the 
overlapping pairs of intervals. Notice that v2 -1>1 /3. 
For any B € (0, V2 — 1) and any a, if we let 
1 _ 
et) 

1+ 8B 
then the intersection of [, and [+ 4¢, will contain exactly 2” points. For 
example, consider the following picture, where n = 1. 
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This picture shows J, and I, + ¢,, [7 and If + t, (and also I* and If + t,) are 
positioned, relative to each other, the same way that J, and J, + ¢ were in the last 
example. So the intersection of [, and I, + ¢, will contain exactly two points. 

These examples leave us with two questions. First, for B € [1/3, V2 — 1) how 
often is the intersection of IT, and I, + ¢ a finite number of points? Second, for 
B= V2 — 1, can the intersection of lr, and [+ ¢ be finite for any choice of 
t © (-—1,1)? Before answering these questions, let’s look at a simpler one; for 
B = 1/3, how often is the intersection of [, and [, + ¢ a single point? 

For any a € (0,1), let A, denote the middle-a Cantor set defined in the 
interval [—1,1] (so A, = 2T, — 1). A, can also be defined by 


1=0 


A, = | y) u;B'(1 — B)lu; = {-1, 1} for all i. 


(What is the geometric significance for A, of the finite sums L7~,u;B'(1 — B) 
where u; € {—1,1}?) Notice that since A, is a middle-a Cantor set, it has zero 
measure. 

The next lemma shows that A, contains all the points of [, — I, for which 
ra (Lr, + t) is a single point. So we have an answer to our last question above. If 
B = 1/3, then for almost all ¢ © [—1,1], T, N (1, + t) contains more than one 


point. 


Lemma 2. Choose B € (0,1/2). If T, A(T, + t) is a single point, then t € A,. 


Proof: Let x denote the single point in T, N (T, + ¢). Let {5p,5,,55,...} be the 
address in [., of x and let {ry,r,, r2,...} be the address in T, of x — t. Then 


t=x-(2-1)- ¥ (i n)6C ~ By. 


To show that ¢t © A,, we must show that s, — r, © {—1,1} for all i => 0, ie., we 
need to show that s, # r; for all i. Suppose there is an 7 such that s,=7,. Let B 
and B’ be the components of J, that contain x and x — ¢t respectively. Assume 
that s; = r, = 0. Consider the following picture of J,,, 0 B and U,,, 0 B’) +t. 


(The two holes have length af‘ and the four closed intervals have length B'*!.) 
Since s; =r; = 0, the point x is to the left of the open intervals Gif s, = r; = 1, 
then the point x will be to the right of the open intervals). The two “halves” of the 
above picture are translations of each other, so in the right half of the picture the 
point y =x + B‘(1 — B) will also be in T,, A (I, + ¢) contradicting that x is the 
only point in this intersection. 0 
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Exercise. If T, A(T, + t) = {x}, then t = 2x — 1. 
Let A, denote the union of the images of A, under all the affine maps of the 
form 


n—1 
T(x) =B"x + )/ v,B'(1 - B) 
i=0 


where n is any integer greater than 0 and v, © {—1,0,1} for i=0,...,n — 1. 
Then 


A, = | >) 5;8'(1 — B)|for some n > 0, s,€ {—1,0,1} forO <i<n-1 
i=0 


and s, € {-1,1} fori =n} 


Let’s notice several things about A ,. First of all, A,, © {[—1,1]. In fact, A, CT, — 
I, (why?). A, has measure zero since it is a countable union of sets of measure 
zero (the affine image of a set of measure zero has measure zero). The series 
representations of points in A, are not unique. For example, if B = v2 —1and 
t=(1 — B)/Q + B), then the sequences {0,1,1,1,...} and {1, —1,1, —1, 
1, —1,...} can both be used to represent f. 

The significance of A, is that it contains all the points of [, — [, for which 
ra, + t) is a finite set of points. This will be shown in the proof of the 
following theorem. 


Theorem 3. Jf B = 1/3, thenT, O (T, + t) isa Cantor set for almost all t = [—1, 1]. 


Note. So T, N (1, + t) goes from being almost always empty when B < 1/3, to 
being almost always a Cantor set when B > 1/3. This shows that the change in the 
difference sets described by Theorem 1 is even more dramatic than was indicated 
by that theorem. 


Proof: Let B € (0,1/2). We will show that if t € (T, — T,)\ A,, then T, 9 (LT, + 
t) is a Cantor set. The intersection of any two Cantor sets is a compact, totally 
disconnected set. To show that T, (1, + t) is a Cantor set, we only need to show 
that it is a perfect set. 

Suppose that ¢ € fT, — T, and T, N (TI, + 2£) is not a perfect set, i.e., there is a 
point x €T, N (1, + t) and an e > O such that (x —e,x +eE)NT, AMT, +t) = 
{x}, or in other words, x is an isolated point in T, N (1, + t). We’ll show that 
t © A,. Choose n large enough so that B” < e«. Let B denote the component of I, 
that contains x and let B’ be the component of J, that contains x — t. Then 
lr, 8B and (T, 9 B’) + t are two middle-a Cantor sets whose intersection con- 
tains only the point x. Let {5 ,...,5,_,,0,0,0,...} be the address in T, of the left 
hand endpoint of B and let {ry,...,7,_,,0,0,0,...} be the address in I’, of the 
left hand endpoint of B’. Let T denote the affine map 


T(y)=B "\y - ¥ 5,B'(1 — B) J. 


i=0 


The image of [, 1 B under T is T,. The image of (1, 4 B’) +t under T is 
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l, +t’ where 

n—1 

= Tlt+ di 7,B(1 - B)}. (2) 

i=0 
Because T is an affine map, we have T, ON (T, + #’) = TUT, N BY) AN CT, 9 B’) + 
t)), so T, A (Lf, + t’) contains only one point. Then the previous lemma implies 
that t’ € A,. By the definition of A,, ¢t’ has a unique series representation of the 
form 


t’'= )u,B'(1 — B) where u, € { -1, 1} for all i. (3) 
i=0 
Now solve for ¢ in Equation (2) using the series (3) to represent t’. We get 
n—-1 
r= TU) ~ DnB 8) 
i=0 
1 


= B't' + yr (s; — r;)B'(1 - B) 


i=0 
oo n—-1 
=p" diup'(1-B) + » (s;-7)8'C — B) 
i=0 i=0 
n—1 oo 
= Vi (s,-r)B'C - B) + Yu;,_,B'C — B) 
i=0 i=n 


which implies that t © A,. 

So far, we have only assumed that B © (0,1/2) and we have shown that 
whenever ¢t € (T, —T)\ Ay, then T, 1 (1, + t) is a Cantor set. When B => 1/3, 
r, — T, =[-1,1]. So when B = 1/3 and t € [—-1,1]\ Ay, then T, 1 (TI, + £) is 
a Cantor set. The fact that A, has measure zero completes the proof. O 


We have now answered one of our earlier questions. For B € [1/3, v2 - 1], the 
intersection of [, and I, + ¢ contains an infinite number of points for almost all ¢ 
in [— 1,1]. The next theorem answers the other question. 


Theorem 4. If B € (¥2 — 1,1/2), then T,, A (TV, + t) contains a Cantor set for all 

t © (—1, 1). 

Proof: For B € (V2 — 1,1/2), define a function fg: [—1,1] > [-1, 1] by 
B-'t+(p-'-1) for-l<t<-—B 


fa(t) = \Bo't for -B <t<B 
p-'t-(B-1-1) forB<t<1. 


Let’s see how f, was derived. If t © [—1, —8], then J) and Jf + ¢ overlap but 


If +t and If do not overlap (except when t = —£ and they have a common 
endpoint). 
0 B 1-8 
[Fy /_ +4 
nen t-——_——1 
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If we concentrate only on the intervals I’ and J;* + t, then we can renormalize (or 
rescale) this pair of intervals by mapping J’ onto [0,1] and mapping J;* + ¢ onto 
an interval [t’,1 + ¢’] in such a way that the relation of [0,1] to [t’,1 + t’] is the 
same as the relation of J; to [;* + t. The required value of t’ is given by f,(t). If 
t <= (—£, B), then If and I + t overlap and of course J* and I* + t also overlap 
(it may be that the pair J? and I +t or the pair [* and I’ + ¢ overlap, but we 
will ignore these pairs in this case). 


0 B 1-B£B 1 
++ HI 
be] t———_—_—_-—— 


The value of f,(t) gives us, in this case, the renormalization of the pair of intervals 
I~ and If +t (and simultaneously the renormalization of If and [* +t). If 
t = [6,1], then only J and If + ¢ overlap, and f,(t) renormalizes this pair of 
intervals. 

The orbit {f3(0)}?_9 of the point ¢ gives us information about I, 9 (T, + ¢). 
For example, if the nth iterate of ¢ is in the interval (—8, 8), then T, n (T, + ¢) 
contains at least two points (why?). If fg'(t) © (—B, B) for some finite set of 
integers {n,}*_,, then I, M (I, + t) contains at least 2* points. What we will prove 
below is that if f(t) € (—B, B) for some infinite sequence of integers {n,}7_,, 
then [T, N (IT, + ¢) contains a Cantor set. But first let’s show that for any 
B € (V2 — 1,1/2) and for any t € (—1, 1), there is a sequence {n,};_, of integers 
such that fg(t) € (—8, B) for all i > 1. 

To prove this fact, it suffices to show that if t € (—1, —B] U[B, 1), then for 
some n, fz(t) € (—8B, B) (so the orbit of ¢ will always return to (—8, B) after it 
leaves it). Notice that the points —1 and 1 are both fixed points for f,. Suppose 
that ¢ © [B,1) (the proof for t © (—1, —£] is similar). The image under f, of 
[B, 1) is the interval [2 — B~',1) which, for B > V2 — 1, is a subset of (—B, 1). So 
the point f,(t) is contained in either (—£, B) (in which case n = 1 and we are 
done) or it’s still in[, 1). Notice that the distance of f,(t) from 1, i.e., 1 — f,(t) = 
1-—(p-'t —(p7! -— 1) = B"' — 2), is strictly greater than the distance that t¢ 
was from 1, ie., 1 —¢ (since B~' > 1). So as long as f(t) stays in [£, 1), its 
distance from 1 is B "(1 — t). But eventually B~"(1 — t) will be strictly greater 
than 1 — B (the length of [B, 1)) and f'(t) will be in (—8, B). 

Now let’s show that if B € (V2 — 1,1/2) and t € (—1, 1), then Mma, +t) 
contains a Cantor set. If B is a component of J,, then B contains two components 
of [,,,. Let (B OI,,,)” denote the left hand component of J,,,, contained in B 
and let (BM I,,,)* denote the right hand component. Let B, = Bg = [0, 1]. For 


n > 0, define inductively the following sequences of nested closed intervals 
[Bn Une)” if HC) € [-1,8) 
n+1 R . n 

(B,0l,e1) if fe(t) © 16,1] 


and 
(Bi. OT,41)" if Ff(t) € [-1, -B] 
(BL Olas)” if f8(t) €(-B, 1]. 


n+1 


648 WHAT’S THE DIFFERENCE BETWEEN CANTOR SETS? [August-September 


The lengths of the B, go to zero, so (\7_,)B, is a single point in [. Let x denote 
this point. Similarly, let y = 1\7_)B,. B, and By have been defined so that B, 
and B) +t are the overlapping pair of components from J, and J, + ¢ that are 
renormalized by applying f, to fg ‘(t). In the case where fg ‘() Ee (—B, B), so 
fg is simultaneously renormalizing two pairs of overlapping intervals, B, and 
Bi, + t are defined to be the left-hand pair. It follows that N*_)B, = N°_,( By, + 
t), ie., that x =y + ¢. Let {59, 5,,55,...} and {7o,7,,7r5,...} denote the addresses 
in [, of x and y respectively. So 


0 if f7(t) © [-1,8) 1 if fg(t) €[-1, -B] 


V1 if f(t) € [8,1] and t= \Q if f2(t) €(-B, 1]. 


Notice that s, =r, if and only if fg(t) € (—8, B) and s, =r, = 0. 

Let {n,}P_, be the set of integers for which f,' (ft) E(- —B, B). Let ./ denote the 
set of all sequences {@9, @,,@5,-..} © {0,1}* such that a, = 0 if n & {nF 
ae Y, let 


= 2 (s,+4,)B"(1-B) and y,= L (7, + 4,)6"(1 - B). 
n=0 n=0 
For anya © Y, x,, y, — T, (recall that s, =r, = Oif n € {n}P_,), x, —y, =x - 
y=tandsox,€7T,N(T,+ 1%). Let Z,= (x, aw) So ZycT,ad, + t). 
Let’s show that 2, is a Cantor set. Since ZC I, it is totally disconnected. We 
need to show that it is closed and perfect. If {z,} i a sequence of points from Zy 
that converge to z, then z € T, (since [, is closed). It’s not hard to show that for 
any M there is an N such that for n > N the addresses of z, and z will agree in 
the first M places. This implies that z € x, , 80 By is closed. To show that Ky 
is perfect, choose a € Y and define a* "ak, ate ak,...}€ WL by ak =a, if 
n <k and ak = Oif n> k. Then for all k, x,« € Z, and the sequence (x Fo 
converges to x,. So X wv 18a perfect set. LJ 


Exercise. When B < 1/3, show that every t € T., — [, has a unique series repre- 
sentation 


s,B'(1 — B) where s, € {—1,0,1}. 


~™ 
I 
it: 


i 


Call {s5, 5,, 5,,...} the address of t. Show that if {s,, 5,, 55,...} is the address of a 
point ¢ in T, —T, and a is the cardinality of {n|s, = 0}, then the cardinality of 
ra (Lr, + t) is equal to 2%. (What about the case B = 1/3? See also [BM].) 

Let’s summarize what we have proven. If B < 1/3, then [, — [, is a Cantor set 
with Lebesgue measure zero. If B € [1/3,1/2), then [, — T, = [-—1, 1]. In addi- 
tion, if B © [1/3,V2 — 1), then for almost all ¢ @[-1,1], T, N(T, +1) is a 
Cantor set, but for some ¢ € (—1,1), T, N (TZ, + ¢) is as small as one point. If 
B = 72 — 1, then ra, + t) is a Cantor set for all but a countably infinite 
subset of [—1, 1], and for t € (—1, 1), the smallest cardinality that T, 9 (T, + ¢) 
can have is countably infinite. And if B © (V2 — 1,1/2), then T, A(T, +2) 
contains a Cantor set for all ¢ € (—1, 1). (The case where B = V2 — 1 is actually 
an exercise for the reader.) These results can be generalized to arbitrary Cantor 
sets embedded in the real line if we replace the parameter B with something called 
the thickness of a Cantor set (see [K]). 

In this article we have used the difference sets of middle-a@ Cantor sets as a way 
to make concrete the intuitive idea that a middle-a Cantor set with a close to zero 
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is larger than a middle-a Cantor set with a close to one. We did this using just 
cardinality and Lebesgue measure. Of course, what we lose by using such simple 
concepts of size is the ability to distinguish between, say, r,, and T,, when both a, 


and a, are in (1/3, 1). For this, more sophisticated concepts such as Hausdorff or 
box-counting dimension are needed (see [E]). 
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Excerpt from “ ‘Billiards Is a Good Game’: Gamesmanship and America’s First 
Nobel Prize Scientist” by Norman Maclean, The University of Chicago Magazine 67 
(Summer 1975, pp. 19-23). 


For instance, Leonard Eugene Dickson, the outstanding mathematician, who at 
the time was writing his classic works on the theory of numbers, was sometimes a 
poor card player. Anton J. Carlson was also not a good bridge player, although he 
was nationally famous as an exponent of the scientific method of biological 
sciences. 


Dickson, the master of numbers, was sometimes expectedly brilliant in a game 
where only 13 X 4 numbers were involved; his habitual troubles were at least 
partly environmental—he had come here by way of Texas. He almost consistently 


overbid and, when he lost three of four hands in a row, he would slam his cards 
down on the table and leave the room in a rage, always denouncing Carlson on the 
way out. No matter who had misplayed—Carlson, Michelson, or himself—he 
always denounced Carlson. While the cards were still shivering on the table he 
would shout, “Why the hell, Carlson, don’t you go back to your lab and feed your 
dogs? And don’t let Irene Castle catch you killing any of them.” 

Overbidding three or four hands in a row and then blaming the great biologist 
seemed to put the great mathematician in the right state of mind to race back to 
his office and resume his classic studies on the theory of numbers. 


Contributed by Jeffrey C. Lagarias 
AT&T Bell Laboratories 

600 Mountain Avenue, P.O. Box 636 
Murray Hill, NJ 07974-0636 
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Morphisms, Squarefree Strings, and the 
Tower of Hanoi Puzzle 


Jean-Paul Allouche, Dan Astoorian, Jim Randall, 
and Jeffrey Shallit 


1. INTRODUCTION. The Tower of Hanoi puzzle consists of three numbered pegs 
and N disks. Initially, the disks, which have radius 1,2,..., N, are all placed on 
peg 1 in increasing order of size, such that the smallest disk is on top, and the 
largest disk is at the bottom. See Figure 1. 
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Figure 1. The Tower of Hanoi Puzzle 
A move of the puzzle consists of taking the top disk off one peg and moving it 


to another. A move is called Jegal if it does not involve covering a smaller disk with 
a larger one. The object of the puzzle is to transfer all of the disks from peg 1 to 
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one of the other two pegs, using only legal moves. We are most interested in the 
optimal solutions to the puzzle; that is, solutions for N disks that use the smallest 
number of legal moves. 

At each step there are six possible moves: moving a disk from peg 1 to peg 2; 
from peg 2 to peg 3; from peg 3 to peg 1; and their inverses. (Of course, not all of 
these may be legal at any given time.) We code these moves using the letters a, b,c 
and a, b, ¢ respectively. 

Thus, for example, we can transfer 3 disks from peg 1 to peg 2 using the 
following sequence of moves: 


atbacba. 


This sequence is optimal—no shorter sequence of legal moves will work. Actually, 
it is easy to construct an optimal solution for any number of disks; we discuss this 
further in the next section. 

Fred Dixon, a student in an undergraduate course on formal languages taught 
by the fourth author, posed the following natural question: 

When performing an optimal solution to the Tower of Hanoi with N disks, does 
there ever occur a sequence of moves that is immediately repeated? 

In this article, we answer this question negatively. More precisely, we provide a 
self-contained proof, based on the theory of iterated morphisms, that optimal 
solutions to the puzzle are always squarefree. (More general results, including a 
discussion of the so-called “cyclic Tower of Hanoi’, will be given in [2].) We will 
use the Tower of Hanoi as our motivation to explore some concepts from formal 
language theory and combinatorics on words. 


History. The Tower of Hanoi puzzle has been around for at least a hundred years, 
and is a favorite subject of recreational mathematicians. It was apparently invented 
by Francois-Edouard-Anatole Lucas (1842-1891), a famous French number theo- 
rist and recreational mathematician [14]. 

A nearly complete set of references to the puzzle may be obtained by consulting 
the articles of Allouche and Dress [4] and Hinz [12]. 


2. THE CLASSICAL OPTIMAL SOLUTION. We now describe the classical solu- 
tion to the puzzle, which has the virtue of being optimal. To move N disks from 
peg 1 to peg 2, say, first move N — 1 disks (recursively) from peg 1 to peg 3, using 
peg 2 as intermediate storage. Once this is completed, move disk N from peg 1 to 
peg 2, and then finally move the N — 1 disks on peg 3 to peg 2 (recursively), using 
peg 1 as intermediate storage. Letting 7,, be the number of moves in this solution, 
we see that 7, = 1, and 7, = 27,,_, + 1. By induction, one can easily prove that 
Ty = 2% - 1. 

That the classical solution is indeed optimal is not hard to see (although much 
fuss has been made on this point; see [10, 19]). To transfer N disks from one peg to 
another, we must at some point move disk N at least once. In order to move disk 
N, it must be alone on its peg, and some other peg must be empty; hence the 
remaining peg must contain the N — 1 smaller disks. Hence no algorithm can do 
better than to (i) move the first N — 1 disks to the same peg, using the optimal 
strategy; (ii) move disk N; and (iii) move the N — 1 disks again, using the optimal 
strategy, and covering disk N. Hence, if T’(N) denotes the total number of moves 
used by the optimal strategy to transfer N disks, we see that T’(N) > 1+ 27’(N 
— 1). Since T’(1) = 1, we see that T’(N) > 2% — 1, as desired. 
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3. MORPHISMS. One of the most important concepts in formal language theory 
is that of the homomorphism, or just morphism for short. A morphism is a map h 
that assigns a string h(t) to each symbol ¢ of a finite alphabet >. This map is 
extended to >%*, the set of all finite strings of symbols chosen from >, using the 
rule h(xy) = h(x)ACy). 

For example, ~)nsider the map 7(0) = 01 and 7(1) = 10. Then 7(0110) = 
7(0)7(1)7(1)7(0) = 01101001. 

If the length |h(2)| of the string h(Z) equals a constant k for all letters Ze &, 
then the morphism h is said to be k-uniform. A simple example of a 1-uniform 
morphism on the alphabet {a,b,c,a,b,c} is the morphism o defined as 
follows: 


g(a) =b a(a) =b 
a(b)=c a(b) = 
a(c)=a a(c) =a. 


This particular morphism has the following interpretation in terms of the Tower 
of Hanoi puzzle: if a string x has the effect of moving some disks from peg 1 to 
peg 2, using peg 3 as intermediate storage, then a(x) moves the same configura- 
tion of disks from peg 2 to peg 3, using peg 1 as intermediate storage. This 
morphism will play a major role in our results. 

We define the morphism oa! in the obvious way, namely: 


ao '(a)=c ao ‘(a) = 
oa '(b) =a a” '(b) = 
ao ‘(c)=b ao ‘(t) = 


o| oO 


oO | 


4. FIXED POINTS OF MORPHISMS. Consider the morphism 7(0) = 01 and 
7(1) = 10 mentioned in Section 3. We can iterate such a map, using the definition 
Tt !(x) = r(7"(x)) for n > 0. and 7°(x) = x. In this case, it is easy to verify that 
the string 7”(0) is a prefix of the string 7”*'(0). Hence there is a unique infinite 
sequence 


t =tot,t> -*- = 0110100110010110--- 


of which all of 7(0), 77(0), --- are prefixes. Since 7(t) = t, we call t a fixed point 
of the map r+. (An annoying technicality is that we have only defined 7 for finite 
strings, but the extension of 7 to infinite sequences is easy to define.) There is one 
other fixed point ¢ of this map 7, obtained by replacing every 0 in ¢ with a 1, and 
every 1 with a 0. 

The infinite sequence ¢ is the celebrated Prouhet—Thue—Morse sequence 
[16, 18,15]. Our method for constructing ft is a particular case of the following 
observation: if there is some symbol a © % such that h(a) = ax, where x is 
nonempty, then A has a unique fixed point p Starting with a. If further h is 
non-erasing (i.e. h(b) is not the empty string for any b € >), then p is an infinite 
sequence. Note that the morphism 7 that generates the Prouhet—Thue—Morse 
sequence 1s 2-uniform. 

Cobham [7] was the first to study in detail the properties of uniform morphisms. 
See also [1, 9]. 


5. CODING THE SOLUTION TO THE TOWER OF HANOI. As we have seen in 
Section 2, for each integer N > 0, there exists a sequence of 2” — 1 moves that 
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constitutes an optimal solution to the Tower of Hanoi puzzle with WN disks. 
Actually, there are two different solutions: one that results in the disks ending up 
on peg 2, and another that leaves all the disks on peg 3. 

As above, we can code these solutions as strings of symbols, where each symbol 
represents a move. In what follows, we only consider the solution to the puzzle 
which takes the disks from peg 1 to peg 2 if N, the number of disks, is odd, and 
from peg 1 to peg 3 if N is even. This choice might at first seem unnatural, but its 
advantage is that the sequence of moves for N + 1 disks begins with the sequence 
of moves for N disks. Hence there is actually an infinite string of symbols 


H =hjyh,h, -+: = atbacbatbacbacb °:: 


which codes the solution to the puzzle for N = 1,2,3,... disks. Another interpre- 
tation is that H solves the puzzle for an infinite number of disks! 

The infinite string H can be described as the limit of the sequence of strings 
(H;);., where each H, is a string of length 2' — 1 that gives the solution to the 
puzzle for i disks. We can obtain a recursive formula for the H, using the 
description of the optimal solution obtained previously: 


Observation 1. We have H, = «€, the empty string, and 
Hay, = Ay a 0 '(Apy), for N = 0; (1) 
Any = Ayn; © O(Hny_;), forN21. (2) 


The recursion formulas may appear mysterious at first, but they are actually 
quite simple. For example, Eq. (2) says in order to solve the puzzle for 2N disks, 
first move 2N — 1 disks from peg 1 to peg 2. Then, using the move ¢, move the 
2Nth disk from peg 1 to peg 3. Finally, move 2N — 1 disks from peg 2 to peg 3; 
note this is accomplished using the morphism o that was introduced in Section 3. 

Using these recursion formulas, we get, for example: 


H, =a; 
H, = acb; 
H, = atbacba; 


The discerning reader will note that the sequence H does not appear to be 
periodic, but there is a kind of “‘pseudoperiodicity” of period 6. More precisely, we 
have the following lemma (where the notation x” represents the infinite string 
XXX owe ): 


Lemma 2. We have 
H =(aCbAcB) , 


where A is a symbol denoting either a or a, and similarly for the symbols B and C. 


Proof: It follows easily by induction on N that 


Abnyi = (acbAcB)??/%a, for N > 0; (3) 
Hyy = (acbAcB)? ~*/*acb, for N > 1. (4) 
| 
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6. SQUAREFREE STRINGS. We say a finite string y is a factor of a (finite or 
infinite string) u if we can write u = xyz, where x is a finite string. We say a (finite 
or infinite) string u is squarefree (or repeat-free) if it has no factors of the form yy, 
where y is a finite, nonempty string. While it is easy to see that no string of length 
4 or more over an alphabet of two letters can be squarefree, Thue produced an 
infinite squarefree string s over a three-letter alphabet. 

Similarly, we say a (finite or infinite) string u is cubefree if it has no factors of 
the form yyy, where y is a finite, nonempty string. Thue showed that the infinite 
sequence ¢t = 01101001 ---, discussed in Section 4, is cubefree. Actually, he 
showed even morv: it is overlap-free. That is, it has no factors of the form ayaya, 
where a is a Single letter. 


History. Axel Thue initiated formal language theory in 1906 by studying squarefree 
and cubefree strings; see [18]. The book of Guy [11, Sect. E21] contains many 
references to papers on this topic. Also recommended is the paper of Bean, 
Ehrenfeucht, and McNulty [5] and the survey of Berstel [6]. 


7. DESCRIBING THE HANOI SEQUENCE H VIA MORPHISMS. In this sec- 
tion, we will prove the following surprising observation: we can describe the Hanoi 
sequence H as the fixed point of a certain 2-uniform morphism , defined below. 


p(a) =ac p(a) = ac 
g(b)=cbh g(b)=cb 
p(c) = ba p(c) = ba. 
This fact was first noted in the paper of Allouche, Betrema, and Shallit [3]; also 


see Allouche and Dress [4]. 
Now it is readily observed that 


p( Ho) =; 
p(H,) = ac; 
o(H,) = acbacb; 


o(H3) = acbacbacbacbac; 


so we see that o(H,)a = H,, o(H,)b = H,, o(H,)a = H,, and o(H,)b = H,. We 
might reasonably guess that this pattern repeats, and that o(H,)a = H,,, for n 
even, and g(H,,)b = H,,,, for n odd. 

To prove this guess, we first prove a lemma: 


Lemma 3. Let > = {a, b,c, a,b, c}. Then for w © 3*, we have 


e(a(w)) =a '(9(w)); (5) 
e(o-(w)) = o(e(w)). (6) 
Proof: It suffices to verify the two equations for each letter in >. a 


We can now prove our guess about the form of o(H;): 


Lemma 4. For all i > 0, we have o(H,,)a = H,;,, and o(A,;, Jo = Ay; 45. 
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Proof: By induction on i. The assertions are easily verified for 1 = 0. Then for all 
i> 0, 
p(H2;)a = (Hy;-, ¢ o(H2;_,)) a (by (2)) 
= 9(H2;_;) ba g(o(Hp;_1)) a 
= 9(H2;_;) ba o *(p(Ap;-,)b) (by (5)) 
= H,, a 0 '(H,;) (by induction) 


= A,;,,. (by (1)) 


The proof that 9(H,,, ,)o = o(H,;,,) is similar and is left to the reader. This 
completes the proof of Lemma 4. a 


As a corollary, we have 


Corollary 5. The Tower of Hanoi sequence H is a fixed point of the map 9; i.e. 
H = 9( #1). 


Proof: It suffices to give an infinite sequence of prefixes of H that are mapped by 
g into longer prefixes of H. But this follows from Lemma 4, for we see that o(H,,;) 
is a prefix of length 2*'*+' — 2 of H,,,,; hence a prefix of H. A similar statement 
holds for @(H,,, ,). a 


8. THE HANOI SEQUENCE H IS SQUAREFREE. We are now ready to tackle 
the problem of showing that the Tower of Hanoi sequence H is squarefree. The 
proof we will present is due to the second and third authors, who were undergrad- 
uates at the University of Waterloo. 

We will need three lemmas. The first two lemmas, given in [4], follow easily 
from Lemma 2: 


Lemma 6. Ignoring the bars, the symbols in H are periodic of period 3. 
This lemma can also be found in [17]. 


Lemma 7 
a, ifi =0(mod6); 
h,=(b, ifi = 2(mod6); 
c, ift = 4(mod6). 
Hence the symbols appearing in even-numbered positions in H are unbarred. 
This lemma (essentially) can be found in [8]. 


Lemma 8. H does not contain four consecutive unbarred symbols. 


Proof: For if it did have four consecutive unbarred symbols, say h;h,,,h;.5h;+3; 
then i is either even or odd. If i is even, say i = 2k, then since H 1s a fixed point 
of ~, we can write g(h,h,,,) =h;h;,,h;,2h;,3. But since only a barred symbol 


has an image under @¢ that consists of unbarred symbols, h, and h,,, must both 
be barred, contradicting Lemma 7. 
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If ¢ is odd, then by Lemma 7, h,_, must be unbarred, and we can repeat the 
Same argument on h,_,h;h,,,h;,>. = 
Here is the main theorem of our paper: 


Theorem 9. The Tower of Hanoi sequence H contains no factor of the form xx, for x 
a nonempty string. 


Proof: Suppose H did contain a “square”’ factor 

XX = X1X_ °° XXX. Xy. 
Then without loss of generality, we may assume that xx is (i) as short as the 
shortest square occurring in H and (i) among all squares with this length, xx 
occurs “earliest” in H. 

We now divide the proof into three cases: (A) |x| is odd; (B) |x| is even, and 
begins at an even-numbered position in H, and (C) |x| is even, and begins at an 
odd-numbered position in H. 

Case (A). If n = |x| is odd, then it is easy to see from Lemma 7 that all the 
symbols of x must be unbarred. (For example, if xx = hj,h,,, 
hjsn—thjanhjens1 °° ° Ajson—1 for j even, then h; = x1, hj, = %3,..-,Ajgn—1 = 
x, are all unbarred, and h,.444 =%2, Ajins3 =X40--+>Ajsan—2 =%Xpn—1 are also 
unbarred. A similar argument applies if j is odd.) This contradicts Lemma 8 if 
n > 3, and if n = 1, it contradicts Lemma 6. 

Case (B). Suppose n = |x| is even, say n = 2m, and that our supposed square 
xx begins at an even-numbered position in H. More precisely, assume that 


xx = hajhy;541 i hj 42m—122j+2m42j+2m+1 —_ hy; 44m-—1- 
Then since # is a fixed point of », we have 
P(A hye  Ajam—thjemAjemei hy42m-1) — AX. 
Since each letter has a distinct image under ¢, we must have h;,,, = hj, hjam+1 = 
hyatse++sAjsam—i = Niemi Thus hAjhj., °°* Ajso2m—-1 = yy, where y =hjh,,, 
‘++ hi, m1 But then yy is a nonempty square that is shorter than xx, a 
contradiction. 


Case (C). Suppose n = |x| is even, and that xx begins at an odd-numbered 
position in H. Thus we can write 


Qxx = Ghy 544 os hojenrjens i hj 42n 


for some symbol q = h,,. Now by Lemma 6, n must be divisible by 3; hence n = 0 
(mod 6). Then by Lemma 7, q = hy;,,,. Hence q = x,. Thus 


XyXXq 00 Xp XyXjX%2 0° Xy-1 = hy; i hyj42n-1 


is a square of the same length as xx, but occurring earlier in H, a contradiction. 
This completes the proof of our theorem. a 


9. CONCLUSIONS. We have given a brief tour of the theory of iterated mor- 
phisms and squarefree strings, and used this theory to prove that no sequence of 
moves iS ever immediately repeated when performing the optimal solution to the 
Tower of Hanoi puzzle. 

There are many interesting topics we could not explore further in this short 
article. For example, the fixed points of k-uniform homomorphisms are strongly 
related to the so-called “automatic sequences,” which have many interesting 
number-theoretic properties. The interested reader is referred to the survey of 
Allouche [1]. 
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The study of squarefree and cubefree sequences properly belongs to the field of 
“combinatorics on words”; for further information, the reader may want to study 
the book of Lothaire [13]. 
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NOTES 


Edited by: John Duncan 


Sierpinski’s Theorem is Deducible 
from Euler and Dirichlet 


A. A. Ageev 


A famous theorem of Dirichlet says that the sequence 

an + b, n=1,2,... 
contains infinitely many primes if a and Db are relatively prime integers. It is a 
similar long-standing conjecture that the quadratic sequence 

n? +t, n = 1,2,... 


contains an infinite number of primes for any positive integer t. The first result in 
this direction is due to Sierpinski [6] who showed that for any M there exists t’ 
such that the sequence 


n?+t', n= 1,2,... 


contains at least M primes. Recently several new proofs and extensions of 
Sierpinski’s theorem have appeared [4][3][1]. 

In this note we show that a slightly stronger result can be easily derived from 
the following well-known number theory facts: 


Theorem 1 (Euler) ((5, Theorem 251]) Every prime of the form 4n + 1 is repre- 
sentable as a sum of two squares. This representation is unique up to the order of the 
summands. 


Theorem 2 (Dirichlet) ((2, p. 34]) The series 


> 1/p 


prime p=b (mod a) 


where a, b are relatively prime integers, diverges. 


Denote by P the set of primes of the form 4n + 1. Note that by Theorem 1 the 
set of primes in {(2k + 1)? + 4/*|k, 1 © Z,} coincides with P. For any k € Z,, 
denote by P, the set of primes in {4/* + 4k(k + 1) + 1|1 © Z,}. Now Theorem 1 
can be reformulated in the following way: 


U P, =P, (1) 
kEZ,, 
P, OP, =O forall k', k" © Z, such that k' # k". (2) 
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Theorem 3. For any k,, M=0 and any 6 &€ [0,1[ there exists a positive integer 
k* > ky such that 


|P,«| > M(2k* + 1)°. 


(To obtain Sierpinski’s theorem, take k, = 6 = 0.) 


Proof: By (1) and (2) we may formally write 


LDiy= LY Y 1p. (3) 


peP kKEZ, peEP, 


Note first that 


~ ip< 


———_ < + VkKEZ,. 4 
peP, jez, 40° +1 ° * ) 


Now assume to the contrary that, for some M, k, = 0 and some 6 € (0, 1[, 
IP,| < M(2k +1)? WkEZ,N[ko, +f. 


This implies that, for any k € Z,O[ky, + of, 


y 1/ Pa “ ~ (5) 
DSO SOT aos 5ST: 

peP, 4k(k +1) +17 (2k 41)? ° (k4+1)9°° 

It follows from (4) and (5) that the series in the right hand side of (3) converges 

and thereby so does the series in the left hand size. But this contradicts the 

statement of Theorem 2. a 


The following generalization is in fact an easy consequence of Theorem 3. 


Corollary 1. For any k,,M,r, s=0 and any 6 €[0,1[ there exists a positive 
integer k* > ky such that 


\P,«| > M(rk* +)’. = 
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On Nonnegativity of Symmetric 
Polynomials 


F. Matus 


The nonnegativity of elementary symmetric polynomials (cf. [5]) 


X, t XH HH HX TH Xz Ht TX, Xe HX. X, 


implies the nonnegativity of all x,, x,,..., x, in the domain of real numbers, see 
[3], Problem 10C. For complex numbers this ceases to be valid and thus to get the 
same conclusion we must assume more. The nonnegativity of the polynomials 
below in all powers k > 1 of the arguments 


k ka... k yk yk kyk a... k k 
Xi, +X + +X, XX FXzX3 + FX 1X XX. 


is also not yet enough. Namely, the choice of all n-th roots of unity, m > 3 prime, 
renders x{, x%,..., x* to be the all roots or ones and having in mind the Viéte 
formulae for roots of unity we see immediately the claimed nonnegativity. 

Where a = (a,,...,a@,) iS an n-tuple of nonnegative integers and x = 
(x,,...,X,) iS an n-tuple of complex numbers, we shall write x® = [17_,x*. The 
symmetric polynomials (called in [6] “Potenzproduktsummen’’) 


Pra X19+++> Xn) = Pra *) = yy x8, xEC", 
Bela] 


where B ranges over the set [a] of all permutations of a, include all the 
polynomials displayed above. For them a is to be taken as ku, where M is a 
nonempty subset of N = {1,2,...,n} and t,, is the indicator function of M. A 
natural question asks which of these polynomials must be supposed nonnegative in 
order to conclude that all the arguments are nonnegative. More formally, denoting 
by & the family of all our n-tuples a, find all @ Cc YM such that x © C” and 
Pta*) = 0, ace ZB, imply x, 2 0, LEN. 

The aim of this short note is first to show that any @ with finite complement in 
LS will suffice. Then we explore the same question replacing the complex numbers 
by square complex matrices with either the usual matrix product under commuta- 
tivity assumption or the coordinatewise Hadamard product. At the same time the 
nonnegativity is considered either in all coordinates at once or as positive semidefi- 
niteness. 


Theorem. Let x € C% and let the values Pia) be nonnegative for all a © &. 
Then necessarily all coordinates x,, i © N, are nonnegative. 


Proof: We proceed by induction on the cardinality n of the set N. The case n = 1 
is trivial. Let n > 2 and x € C%, having at least one nonzero coordinate, fulfill the 
assumptions of the theorem. If the support M = {i © N; x, #0} # © of x isa 
proper subset of N then the restriction y = x|y € C” satisfies Prat yh¥?) = Pia*) 
for all a € Y such that aly—_y = 0, and then the induction argument yields 
x, = y, > 0,1 © M. So we now suppose that IT; <x; # 0. 
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Let us suppose first that we can partition the set N into two nonempty subsets J 
and J such that |x,| > |x,| for i € J and j © J. Where b is an upper bound of the 
set {|x,|; i € N} and an n-tuple a is zero on J, we can estimate the difference 


Phare) []x;" — Pre ( *lz) 
LE 


< E {lx T]lx,|-*; B € [a + kt], aj EJ: B; > 0| 
1el 


De 
< n!bie% r{ TT lx,“ TT “3 ON, #7, II! = i}, k>1. 
iel 


vel’ 
The point is, due to the choice of the partition, that 


Pratku;\(*) 


Pia *I1) = iim. xy ker 


However, x‘! is a positive number since the numbers x,, i © N, being roots of the 
polynomial equation £7_,(-1)'c,x”~' = 0 with the nonnegative coefficients c, = 
Piax) (a’s are indicators), can be either nonnegative or match into pairs of 
mutually adjoint complex numbers. This enables effective use of the induction 
argument to establish x, > 0,i € J. A little reflexion about the vector y = (x; '');-y 
and the polynomial values 


1 xP Piki ~a(*) 
a0) op a 
Bé[a] Belkiy—a] 


(k is taken sufficiently high) persuades us that the above reasoning, applied to y, 
gives x, > 0, ) EJ. 

It remains to analyse the case |x,| =a, i€ N. Due to homogeneity of the 
polynomials there is no loss of generality in assuming a = 1. Denoting by %, the 
family {a € YL; a; < m, i © N} we can write 


a*)= YL pai= Y YxF= YO [][ xe =T] Ex), mao. 


[a]o,, [elo Bela] aeH, 1EN iEN 1=0 


Now, no x;, i © N, can be a root of unity different from 1 for x” = 1 would imply 
Iim-%x) = 0, k = 1, and this yields p,,(x) = 0 for every a € &, contradicting 
Il,<*; = 1. If one of the coordinates x; is, however, not a root of unity then an 
old number-theoretic fact says the sequence (x*, k > 1) is dense in the unit circle 
and, by compactness, for some numbers k, < k, < --: every sequence Cx}, 
1 > 1) converges to some y,, j € N. The limit point y, can be adjusted to — 1. The 
continuity of the polynomials implies that y = (y,);-y satisfies the nonnegativity 
assumptions, which contradicts the presence of —1 among its coordinates. It 
follows that x, = 1,1 EN. = 


Remarks. 1. When the word “nonnegative” is replaced in the formulation of the 
theorem by “positive” the new claim becomes, obviously, valid. Let us mention 
that we encountered the need for these assertions during an analysis of special 
symmetric random arrays. 

2. The assumptions of the theorem can be further weakened. For example, the 
nonnegativity of the polynomials can be required only for ae @= Y- LY, 


(apply the theorem to (x/"*'),. ,, and (x/"*7), — ,,, consecutively). 
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Corollary. Let X,, X,,...,X, be normal and pairwise commuting square complex 
matrices. The polynomials Pras X1,.--,X,), @ © L,-are positively semidefinite if and 
only if all the matrices X,,..., X,, possess this property. 


Proof: By [2] (Ch. IX, Par. 15) there exists a unitary matrix U (UU* is the identity 
matrix) such that all matrices UX;U*, i © N, are diagonal. Then p,,,(X,,..., X;,) 
= 0 (a self-explaining abbreviation for positive semidefiniteness), a € /, if and 
only if 

UP taj X13---, X,)U* = Pia (UX{U*,...,UX,U*) = 0, ae x, 


and by the theorem this is equivalent to the nonnegativity of all elements of the 
diagonal matrices UX,U*, i € N, i.e., to UX,U* = 0 and finally to X, = 0,1 E N. 
| 


Remarks. 1. The assumption of normality cannot be omitted as is easily seen from 
the example of the following two commuting matrices 


xX, = F 1 and X, = F | 
Note that any symmetric polynomial of X, and X, is a multiple of the identity 
matrix. 
2. If we want to drop the assumption of commutativity we must redefine the 
symmetric polynomials Pray No reasonable generalization of the corollary can be 
expected as there exist two matrices 


x= (9 1 | = 0 x= (1 1] = 0. 


such that the Hermitian matrix X,X,+ X,X, is not positively semidefinite (see 
[1)). 

3. If the matrices of X = (X;);., are nonnegative (coordinatewisely) then 
trivially also all polynomials Piaf X ) are nonnegative but the contrary is not valid 
as the following matrices 


17, 4 tf 4-1 
= - X. =- 
* si 1 2 |i 1 
testify (X, and X, are idempotent and they commute with X,X, = 0). 
4. Let us consider the symmetric polynomials Pio X 1; ...,X,) built as above 
but with Hadamard product on the place of the usual one (see [4]). Since this 
product preserves positive semidefiniteness once matrices X = (X;);< y have this 


property all their polynomials Pla X ) share it, too. The converse is, however, not 
the case: the polynomials p,,, in the two matrices 


41 1 _10 O 
w=19 0) %=[f || 
are positively semidefinite. 
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New Tricks for Old Trees: Maps and the 
Pigeonhole Principle 


N. Graham, R. C. Entringer, and L. A. Szekely 


A TRICKY PROOF. The second author conjectured and the first author proved 
the following theorem on spanning trees of the hypercube Q,: 


Theorem 1. For every spanning tree T of the hypercube Q,,, there exists an edge e in 
E(Q,,) — E(T) whose addition to T forms a cycle of length at least 2n. 


Proof: Let T be any spanning tree in Q,. For any vertex v of Q, there is an 
antipodal vertex v in Q,,. There is a unique path in T from v to v; orient its first 
edge towards vy and repeat this process for every vertex v. Since the tree JT has 
fewer edges than vertices, the pigeonhole principle implies there is an edge (u, v) 
which has received two orientations. The distance between u and u (and between 
y and v) is n in Q, and hence is at least n in T. Finally, if (u, v) is an edge in Q, 
then (u,v) also is an edge in Q,. Hence (u,v) and (u,v) and the edges of the 
u — u and v — vp paths yield the cycle sought. a 


Theorem 1 is clearly best possible since adding any further edge to any 
breadth-first search tree T yields a cycle of length 2n or less. Theorem 1 also 
implies the result that any spanning tree T of Q, has a diameter which is at least 
2n — 1 [6]. 

After introducing some definitions we show that the proof technique for 
Theorem 1 applies to several tree properties. In a connected graph G we denote 
by d(x, y) the distance between vertices x and y. 


Definition. The eccentricity of v = V(G) is given by ecc(v) = max, <q) du, v). 


Definition. The graph G has an antipodal isomorphism if 

(i) ecc(v) is the diameter of G for every v € V(G), 

(ii) for every v € V(G) there exists a unique » € V(G) such that d(v, v) is the 
diameter of G, and 

(iii) the map @: V(G) — V(G) defined by ¢(v) = v is an isomorphism of G. 
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Theorem 1’. Assume the graph G has an antipodal isomorphism. Then, for every 
spanning tree T of G, there exists an edge e in E(G) — E(T) whose addition to T 
forms a cycle of length at least two times the diameter of G. a 


Some other examples of graphs admitting an antipodal automorphism are the 
one-dimensional skeletons of regular and semi-regular polytopes which admit a 
central symmetry [4] and toroidal grids (of which the hypercube is a special case). 


MORE THAN A TRICK. We soon realized that the method of the proof of 
Theorem 1 applies to a number of “little theorems” or exercises, yielding shorter 
proofs than the usual ones. The standard proofs for most of these exercises can be 
found in Lovasz [7, Chapter 6, pp. 41-42]. 


Definition. The center of a tree T is the set of vertices v in V(T) with minimum 
eccentricity ecc(v). 


Theorem 2 (Jordan [8]). The center of a tree T contains either one vertex or two 
adjacent vertices. 


Proof: For every vertex v find a vertex v’ such that d(v, v’) = ecc(v). Orient the 
first edge of the unique v — v’ path towards v’. By the pigeonhole principle, some 
edge (p,q) receives two orientations. Observe that any vertex v #p in the 
component with p in T — (p,q) has ecc(v) > ecc(p) and any vertex v # q in the 
component with g in T — (p,q) has ecc(v) > ecc(q). Assume, without loss of 
generality, that ecc(p) > ecc(q). If ecc(p) > ecc(q), then gq alone realizes the 
minimum eccentricity. Otherwise ecc( p) = ecc(q) and the second possibility in the 
theorem holds. a 


Definition. The maximal subtrees containing a vertex v of a tree T as an endvertex 
are called the branches of T at v. The weight of a branch B, denoted by bw(B), is 
the number of edges in it. The branch weight of a vertex v, bw(v), is the maximum 
weight of a branch at v. The centroid of a tree T is the set of vertices v of T with 
minimum branch weight bw(v). 


Theorem 3 (Jordan [8]). The centroid of a tree T contains either one vertex or two 
adjacent vertices. 


Proof: For every vertex v find a branch B, of maximum weight at the vertex v. 
Orient the only edge of B, incident with v towards B,. The rest of the proof is 
identical with that of Theorem 2, changing ecc to bw. a 


Definition. An endomorphism of a tree T is a map @: V(T) > V(T) with the 
property that for any (p,q) € E(T) either ¢(p) = ¢(q) or (d(p), 6(q)) € ECT). 


Theorem 4 (Lovasz [7]). Any endomorphism @ of a tree T has either a fixed vertex 
or a fixed edge. 


Proof: Assume @¢ has no fixed vertex. Then, for every vertex v, there is a unique 
non-trivial path in T from v to ¢(v). Orient the first edge of this path towards 
o(v). Repeating this for every vertex v implies, by the pigeonhole principle, the 
existence of an edge (p, q) with two orientations. Hence the edge (p, qg) separates 
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p from ¢(p) and q from ¢(q). Since ¢ is an endomorphism, the orily possibility is 
o(p) = q and d(q) = p, i.e., (p, q) is a fixed edge. a 


Theorem 5 (Clarke [3, Theorem 3], Caro and Sch6nheim [5, Theorem 2]). A tree T 
has a perfect matching if and only if deleting any vertex v yields exactly one odd 
component. 


Proof: The condition for the perfect matching is necessary since, deleting v, the 
component containing the mate of v in the matching is odd and no other 
component is odd. The condition is sufficient for the following reason. For any 
vertex v, there is a unique edge e joining v and the odd component of T — v; 
orient this edge e toward the odd component. Denote the other end vertex of e by 
v'. Observe that in T — v’ the component containing v is odd (since 7 has an even 
number of vertices). Therefore, repeating the same procedure with v’ instead of v, 
we obtain e with the opposite orientation. Repeating the procedure for all 
vertices, we obtain a set of edges with two orientations which cover every vertex 
exactly once, i.e., a perfect matching. 

We note that this proof also shows that if a tree has a perfect matching then the 
matching is unique. This fact is implicit in results of Beineke and Plummer 
[2; Theorem 2]. 


Theorem 6. For a simple graph G on n => 2 vertices, the following propositions 
are equivalent: 


(i) G is connected and has n — 1 edges 

(ii) G is cycle-free and has n — 1 edges 
(iii) G is cycle-free and connected 
(iv) G is connected but deleting any edge disconnects G 

(v) any two vertices of G are connected by a unique path 
(vi) G is cycle-free and adding any new edge creates a cycle. 


It is not our intention here to give a complete proof. The cyclic proof (in this cyclic 
order) may use standard proofs except for the following two. 

(i) > Gi) Assume that G has a cycle C. Put a cyclic orientation on the edges of 
C. For any vertex not in C, there exists a shortest path connecting v to C since G 
is connected. For each such vertex v choose a shortest path and orient toward C 
the (unique) edge incident with v on this path. Observe that from each vertex not 
on C an oriented edge goes out and no edge is oriented in both directions. Hence 
the number of edges is at least 1, a contradiction. 

(vi) — G) G is connected, otherwise, adding an edge between two vertices in 
different components would not create a cycle. Fix any vertex :z of G and for each 
x € V(G), x # Zz, orient toward z the edge incident with x in the unique xz path 
in G. Since G is acyclic, every edge is oriented exactly once; thus G has n — 1 
edges. (Any edge e of G is the last edge of a path starting at z, since G is 
connected. This is the unique path connecting its end vertices, since G 1s cycle-free. 
Therefore e gets an orientation. No edge gets two orientations since this would 
yield a cycle.) a 


There is an interesting alternative proof that (vi) — Gi). After noting, as before, 
that G is connected, we assume G has m vertices and prove m =n — 1 with the 
following argument. Define the map 7: E(G) > [E(G)J]*, where G is the comple- 
ment of G, as follows: adding any edge e € E(G) creates a unique cycle, let 7(e) 


666 NOTES [August-September 


be the set of two edges adjacent to e in this cycle. Now the reader can show that 7 
is both an injection and a surjection and, so, a bijection. Thus (") —-m= ( "| 


and, hence, m =n — 1. 
ACKNOWLEDGMENT. We thank a referee for providing important additional references. 
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The constructs of the mathematical mind 
are at the same time free and necessary. 
The individual mathematician feels free to 
define his notions and set up his axioms as 
he pleases. But the question is will he get 
his fellow mathematician interested in the 
constructs of his imagination. We cannot 
help the feeling that certain mathematical 
structures which have evolved through the 
combined efforts of the mathematical 
community bear the stamp of a necessity 
not affected by the accidents of their 
historical birth. Everybody who looks at 
the spectacle of modern algebra will be 
struck by this complementarity of freedom 
and necessity. 


—H. Weyl, 1951 
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Do You Know the Way to Vertex A? 


Jeffrey Ondich 


When I want to communicate with a member of my wife’s family, it is often most 
efficient to send my message through: my mother-in-law, Elinor. At any given 
moment, Elinor knows where everyone is, what they are doing, and the easiest way 
to reach them. Especially if I] want to broadcast a message to the whole family, 
Elinor provides me with a fast, reliable communication mechanism. 

When I send electronic mail to a friend on the Internet, things are a bit 
different. My message gets divided into one or more packets (depending on how 
long-winded I am that day), and each packet is handed from machine to machine 
until it reaches my friend’s computer. There is no central, omniscient authority like 
Elinor on the Internet, so each machine along each packet’s path needs to make its 
own routing decisions. That is, each machine must decide which of its neighbor 
machines should receive my packet to ensure the most efficient delivery of my 
message. 

Elinor provides my family’s communication system with centralized control. Of 
course, if Elinor gets sick or heads out in the camper with no forwarding address, 
our system will be in trouble. To avoid similar problems, the Internet relies on 
distributed control of its communications. In this column, I will describe one class 
of distributed algorithms used by portions of the Internet to make routing 
decisions. 


THE SET-UP. Most large computer networks are divided into smaller sub-net- 
works, or autonomous systems. All communications between distinct autonomous 
systems pass through machines called gateways. A gateway is a computer (often 
but not always specially designed for communications duty) that is part of at least 
two autonomous systems. One gateway can pass a packet directly to another only if 
both gateways are a part of the same autonomous system. 

A graph is a natural way of representing a computer network. For our purposes, 
the vertices will be gateways, and two vertices will be connected by an edge if the 
two corresponding gateways are part of the same autonomous system. Each edge 
will have a positive length, representing the cost of passing a packet from one 
gateway to the next across the autonomous system they share. The path a packet 
follows through our network will correspond to a walk on this graph. 
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And now, a bit of notation. 


¢ d,,, = the length of the edge connecting vertices v and w (d,,, = 0). 


¢ D,,, = the length of the shortest path between vertices v and w. 
- H*, = the length of the shortest path from v to w involving no more than k 


edges or hops. 

¢ N(v) = the set of v’s neighbors (that is, v and the vertices adjacent to v). 

¢« D(v,w)(t) = an estimate of D,,, stored at v at time t. (When the time is not 
important, we will use D(v, w).) 

¢ F(v,w)(t) = the first vertex on what v believes to be the shortest route from 
v to w at time f¢. 


THE BELLMAN-FORD ALGORITHM. Suppose every gateway in our network 
contains a copy of the network’s gateway graph. When a packet addressed to 
gateway A arrives at gateway B, B must decide which of its neighbor machines 
Should receive the packet next. If B has been using its time wisely, it has already 
computed a shortest path to every other vertex on the graph, including A, and will 
know the proper neighbor to hand the packet to. 

One algorithm for computing the lengths of the shortest paths between vertices 
is called the Bellman-Ford algorithm [1]. It is an iterative algorithm that computes 
every H*, during its kth iteration, like so: 


H* =0 

Hy, = +2 

Hx, = min {d,, + Hxy'}, (1) 
uEN(v) 


where v and w are distinct vertices, and k is any positive integer. The idea behind 
(1) is this: vertex v can send a packet to w in at most k hops by handing the 
packet to vertex u and instructing u to get the packet to w in at most k — 1 hops. 
The length of the resulting trip will be d,, + H*,'. By taking the minimum of 
these distances over all of v’s neighbors, we find the shortest k-hop distance from 
v to w. 

The Bellman-Ford algorithm ends when H*, = H*~' for all v and w. At that 
point, each Hx, will be equal to the correct distance D,,,,. These distances satisfy 
Bellman’s Equation: 


D,,,, = 0 


vv 


Dw = min {d,, + D,,} (2) 
ue Nv) 
A proof that H*, = D,,, for sufficiently large k depends upon the fact that (2) has 
a unique solution, consistent of the shortest path distances D,,,,. We will use this 
uniqueness later. 


THE DISTRIBUTED BELLMAN-FORD ALGORITHM. To apply the Bellman- 
Ford algorithm to a graph, you need to know the whole graph. But to assume that 
every gateway in a network has a complete and accurate picture of the whole 
network is to assume too much. Computers crash and come back on line, and edge 
lengths between adjacent gateways change (if, for example, lengths are dependent 
on the amount of packet traffic). It takes time for word of these changes to make 
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its way through the network. So gateways need to make their decisions based on 
their most current information, which may or may not be correct. 

Routing decisions are really buck-passing decisions. The decider does not need 
to know the whole path to the destination—just the first step of that path. A 
distributed variant of the Bellman-Ford algorithm provides a mechanism for 
maintaining this first-step information at each vertex. 

Each gateway on a network using the distributed Bellman-Ford algorithm stores 
a distance vector for the network, a table whose entries consist of the identity of a 
destination, an estimate of the distance to that destination, and the first gateway 
along the shortest route to that destination. For example, the distance vector 
stored at B in Figure 1 would be 


Destination Distance First Step 
w D(B, w) F(B,w) 
A 2 E 
B 0 B 
C 2 D 
D 1 D 
E 1 E 


When a packet arrives at a gateway, the gateway consults its distance vector 
entry for the packet’s destination, hands the packet to the appropriate neighbor, 
and goes about its business. (If the gateway sees no valid route to the destination, 
it might return the packet to its sender, store the packet in a dead or delayed letter 
file, or just discard the packet.) If every gateway’s distance vector contains the 
correct distances (that is, D(v, w) = D,,,, for all v and w) and best first-steps, then 
this method delivers packets along the shortest paths. 


(A, 0, A) (A, 2, E) (A, 0, A) (A, 10, A) 
10 10 


1 


(A, l, A) (A, 2, E) (A, 3, D) (A, 2, E) (A, 3, D) 
Start After update from A 
(A, 0, A) (A, 10, A) (A, 0, A) (A, 10, A) 


(A, 11, B) (A, 3, D) (A, 11, B) (A, 12, D) 
After updates from B After updates from D 


Figure 1 


670 DO YOU KNOW THE WAY TO VERTEX A? [August—September 


Since networks change over time, we need a way of changing distance vectors. 
The way this is normally done is to insist that each gateway periodically send an 
update—that is, a copy of its distance vector—to each of its neighbors. 

Suppose A receives an update from B. Then for each destination C, A accepts 
(instead of ignoring) the update by setting 


D(A,C) = D(B,C) + dyp 
F(A,C) =B 
if any of the following is true: 


1. D(B,C) + d,, < D(A,C) 

2. F(A,C)=B 

3. F(CA,©) has not sent an update to A for a long time, and is assumed to be 
unreachable. (In [2], a “long time” is defined as 180 seconds.) 


Let’s see how this works in an example. Figure 1 shows distance vector entries 
for destination A. These entries were correct before gateway F crashed. After FE 
has been out of commission for a while, B and D mark their routes to A invalid, 
since these routes first pass through E. C’s route to A is also invalid, but C 
doesn’t know it yet. 

Suppose now that (1) A sends an update to B, then (2) B sends updates to C 
and D, and finally, (3) D sends updates to B and C. Figure 1 shows the modified 
distance vector entries after each update. 

The distance vector entries for destination A are all correct after the three 
updates have been sent. Of course, the order in which the updates are processed is 
important. Had D sent its update to C before receiving the update from B, then 
C’s distance vector would not be correct yet. But as long as every gateway keeps 
sending updates occasionally, the distance vectors will eventually converge to 
correct values. The following proposition formalizes this claim. 

You may find it odd that the hypotheses of this proposition require the network 
to remain static. The point of the updating scheme is, after all, to allow distance 
vectors to adapt to changes in the network. You should imagine a sequence of 
network changes separated by periods of no change. After a change, the distance 
estimates can have essentially arbitrary values. Hypothesis (a) allows arbitrary 
initial values, and so the proposition adequately models a changing network. 


Proposition. Let N be a finite, connected network that does not change after time t, 
(no gateways or lines crash or come up, and no edge lengths change). Suppose that 
for any pair of distinct vertices v and w, and any time t > ty: (a) Div, w\(t) is an 
arbitrary element of [0,+]; (b) either F(v, w)(t,) is a vertex in N, or the route from 
v to w is marked as invalid; (c) D(v, vt) = 0; and (d) the sequence of times at 
which v receives updates from each neighbor w is unbounded. Then there exists a 
finite time after which D(v,w\(t) = D,,,, for all v,w, and t. 


Proof: Start by defining H*, as above, and L*, by 


Lt =0 

L°,,=0 

Lk = min {do + Lk}, (3) 
ue Nv) 


where v and w are distinct vertices, and k is any positive integer. 
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L*,, and H*, are estimates of D,.,, computed using the Bellman-Ford algorithm 
with the lowest and highest possible initial conditions. By using (1), (2), and @) to 
make four simple and nearly identical induction arguments on k, we can show 


k k+1 k+1 k 
L = Li < Dy S Aw Ss Ay 


vw = vw — 
for every v, w, and k. Furthermore, both Hx, and Lk converge to D.,,, in a finite 
number of steps. To see this for L*,,, note that the number of possible values for 
L*, is finite. Since L*,, is non-decreasing and bounded above, L‘, = L‘>! for 
sufficiently large k. Thus, for large enough k, Lk = 0, and 

Lk, = Lit! = min {d,, + Lk} 
uEN(v) 

for each v and w. That is, the numbers L*, (for this sufficiently large fixed k) 
satisfy (2). By the uniqueness of the solution of (2), we have L*,, = D,,,,, as desired. 
A similar argument shows that H*, converges to D,,,, (see [1)). 

Now that we have two sequences squeezing in upon D.,,,,, we will fit our distance 
vector estimates D(v,w)(t) between L*, and H<«, so the estimates will also 
converge in finite time to D.,,,. 

More specifically, for every integer k > 0 there exists a ¢t, > 0 such that 
whenever ¢ > f¢,, 


vw? 


Lin < D(v,w)(t) S Hoy (4) 


for every pair of vertices v and w. 

Our induction on k to prove the left inequality of (4) starts easily. Since 
L°,, = 0, D(v,wXt,) = 0, and d,,, > 0 for every v and w, the left half of (4) holds 
for k = 0. 

Now suppose (4) holds for a fixed k. Hypothesis (d) guarantees that there is a 
time ¢t,,, > ¢, such that each vertex receives at least one update from each of its 
neighbors between times ¢, and ¢,,,. During this time interval, hypothesis (b) 
ensures that every vertex v must accept at least one update for each destination w. 
Since 


Lkt'= min {d,, + Lt} 
ueN(v) 


and D(u,w\t) > Lk, for every t > t,, we have 
D(v,w)(t) = L**! 


OW 


for any ¢t > t,,,. By induction then, the left half of (4) holds for all k. The proof of 

the right half of (4) is a bit trickier than this, and can, again, be found in [1]. 
Assuming the claims for 1 x are true, we now know that for sufficiently large k 

and t>¢t,, Lk, =D,, =H<é,, and L*,, < D(v,w\t) < Hx,. That is, there is a 


finite time after which D(v,w)(t) = D,,, for every v, w, andt. O 


THE REAL WORLD. The preceding proposition guarantees that the distance 
vectors will adapt to network changes within a finite time, which should satisfy 
most mathematicians. In practice, of course, some finite times—the short 
ones—are better than others. 

Consider Figure 1 again, but suppose that just the AE edge goes down, while FE 
and its other connections continue to function. Vertices B, C, D and E will send 
out about 10 updates each before their A-distances inch up above 10 (try it, 
supposing updates go out in alphabetical order). If updates are sent from each 
gateway every 30 seconds (the maximum allowed in [2]), service to A will be 
disrupted for at least 5 minutes. In the meantime, packets bound for A will circle 
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around the network until the distance vectors are correct, further clogging the 
system. 

If A had crashed instead of just the AE connection, then the D(v, A)’s would 
climb as before, but this time without bound. Real systems use this counting to 
infinity behavior to identify unreachable gateways. 


FURTHER READING. You can find a detailed and more general discussion of 
the Bellman-Ford algorithm in [1], which contains a wealth of ideas about the 
analysis of computer networks. For a pragmatic approach, [2] discusses implemen- 
tation issues in distance vector algorithms, and defines the Routing Information 
Protocol (RIP), a formal specification of a distance vector routing scheme. 

For more on routing algorithms, see [3]. For the entertaining story of a major 
network crisis caused by some subtle routing algorithm flaws, see [5] and [4]. 

References [2] and [5] are Requests for Comments. You can get RFC’s by 
anonymous ftp at nic.merit.edu, in the documents/rfc directory. If you have a 
stout heart and a real interest in the workings of the Internet, RFC’s are 
indispensible reading. 
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Standard mathematics has recently been ren- 
dered obsolete by the discovery that for years 
we have been writing the numeral five back- 
ward. This has led to reevaluation of counting 
as a method of getting from one to ten. 
Students are taught advanced concepts of 


Boolean algebra, and formerly unsolvable equa- 
tions are dealt with by threats of reprisals. 


—Woody Allen 
Return to Mathematical Circles by Howard W. Eves, 


Boston: Prindle, Weber and Schmidt, 1988. 
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THE EVOLUTION OF... 
Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


On the Calculus of Variations and Its 
Major Influences on the Mathematics of 
the First Half of Our Century.* Part I. 


Erwin Kreyszig 


1. JOHANN BERNOULLI’S BRACHYSTOCHRONE. CARATHEODORY’S 
METHOD. The calculus of variations evolved from the differential and integral 
calculus, “‘the calculus,” for short. An initial motivation of the latter was the 
determination of extrema of functions, as shown by the title of the earliest relevant 
published paper A new method for the determination of maxima and minima... 
(Leibniz, 1684). However, the calculus had to be extended to the calculus of 
variations in order to take care of more general problems involving the determina- 
tion of stationary values of functionals,’ given in the simplest case by a definite 
integral involving an unknown function and boundary conditions. The earliest (not 
very simple) solved problem of this kind was Newton’s determination of the shape 
of a gun shell of least air resistance (letter to Gregory of July 14, 1694). 

The earliest problem that received general publicity [due to a rather bombastic 
advertisement in Acta Eruditorum by Johann Bernoulli (1667—1748)] in 1696 was 
the problem of determining the brachystochrone, the curve along which a particle 
will fall from one given point to another in the shortest time. This problem was 
solved by Newton, Leibniz, and Johann Bernoulli as well as by his brother Jacob 
(1654—1705), the solution being a cycloid. Thus 1696 can be called the birthyear of 
the calculus of variations. Johann Bernoulli not only posed that problem but also 
gave a solution capable of extensive generalization worked out in 1908 by 
Carathéodory. The resulting general method was later named after Carathéodory. 


2. SIMPLEST GENERAL PROBLEMS. Although they arose from different geo- 
metric and physical applications, many of the early problems led to functionals that 
depended on real functions defined on an interval and satisfying boundary condi- 


* Abbreviated version of a paper with the same title. 

A functional is a function defined on a set of functions. A stationary value of a functional is its 
value at a “point” (= function) that satisfies the necessary conditions for an extremum (see Section 3 
below). 
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tions, and all functionals were of the same form (as needed to create a general 
theory). 


ty] =f L(x yy) de, y(%0) =¥o, YO) AY Xo <4 (2-1) 
XQ 


The task was to determine a function y(x) that satisfied the boundary conditions 
in (2.1) and rendered J[ y] stationary, possibly yielding a minimum or a maximum 
of J[y]. At this early stage, the existence and uniqueness of solutions was 
“obvious” for physical reasons because a solution could be verified experimentally 
if desired. Also, with the concept of function not yet sharply defined, nobody made 
an attempt to characterize the set of functions in which such a y(x) was to be 
found. This was accomplished well over one hundred years later in the works of 
Jacobi (1804-51) around 1835 and, especially, of Weierstrass (1815-97) around 
1880. 


3. EULER AND LAGRANGE. As the birthyear of the theory of the calculus of 
variations one usually considers 1744, the year in which Euler published his famous 
book Methodus inveniendi lineas curvas maximi minimive proprietate gaudentes, sive 
solutio problematis isoperimetrici latissimo sensu accepti (A method for discovering 
curved lines that enjoy a maximum or minimum property, or the solution of the 
isoperimetric problem taken in its widest sense). Thus Euler replaced “art of 
invention” (ars inveniendi), a very popular term in the works of Tschirnhaus and in 
other works of Leibniz’s time, by ““method of invention,” a remarkable turn toward 
systematization. This book, a landmark in the development of the subject, con- 
tained the Euler equation 


(3.1) 


(first published by Euler in 1736) as a necessary condition for y(x) satisfying (2.1) 
to yield a minimum of J[y]. In more explicit form it is the equation 


Lyyy" + Lyyy’ + Ly, —_ L, = 0. (3.2) 


This equation suggests calling (2.1) a regular problem when L,,,, is never zero, and 
then assuming that L,,,, > 0. 

Euler’s book also contains a fascinating collection of 66 problems. Carathéodory, 
the editor of the book as a volume of Euler’s Works, said that it 


“is one of the most beautiful mathematical works ever written. We cannot emphasize enough the 
extent to which that Lehrbuch over and over again served later generations as a prototype in the 
endeavor of presenting special mathematical material in its [logical, intrinsic] connection.” 


Euler’s inspiration came from geometry and even more from the principle of 
least action, according to which nature realizes all motions in the most economical 
manner; more precisely, among all possible ways of reaching a given goal, nature 
chooses the one which minimizes the action integral {muds over the path (m = 
mass, v = speed, s = arc length). The beginning of the principle is often dated 
back to Leibniz because of a (lost) letter he is supposed to have written on the 
principle in 1707, but the question is still an open one. The principle is usually 
named after de Maupertuis (1698-1759), president of the Berlin Academy under 
Frederick the Great. Actually, Euler most likely discovered it earlier, formulated it 
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mathematically more rigorously, and applied it to a nontrivial problem (involving 
central forces). In contrast to this, Maupertuis published the principle (in 1744 and 
1746) in a vague and almost theological form. He defended vigorously his (ques- 
tionable) priority, but failed to realize that a rigorization of the principle would call 
for specification of conditions to be satisfied by the motions with which the actual 
motion was to be compared. Accordingly, his main merit seems to be that he was 
searching for a minimum principle. 

The great significance of the calculus of variations in mathematical physics is 
due to the transparent and coordinate-free form that the laws of nature take in 
this calculus. That fact became apparent in the work of Euler, and even more 
impressively in that of Lagrange. In his path-breaking memoir Essai d’une nouvelle 
méthode pour déterminer les maxima et les minima des formules intégrales indéfinies 
(1760-61) Lagrange substantially overtook Euler (as Euler was well aware). His 
work was a milestone in the development of the field and in its application to 
geometry and analytical mechanics. In it he invented the “method of variations” 
together with the symbol 6. His new idea was to use “comparison functions,” 


y=yten, nEC*([x,x1]), n(x) = n(x) = 9, (3.3) 
in (2.1) and to conclude from the vanishing of the first variation of (2.1), 


as ¥] 
O€ 


=€E 


= ¢. f (yn + Lyn!) dx = 0, (3.4) 
Xo 


e=0 


(and an integration by parts) that Euler’s equation (3.1) gave a necessary condition 
for y(x) to render J[ y] stationary. (For a detailed and transparent explanation of 
this vital point see pp. 505-508 of G. F. Simmons, Differential Equations, second 
ed., McGraw-Hill, 1991.) In that paper Lagrange also started working on problems 
with variable endpoints, with application to brachystochrone and other problems. 
As another important step forward, he explicitly formulated his multiplier rule 
(without proof). This rule became a basic tool in his Méchanique analitique, in 
which he also included his theory of the calculus of variations and derived from the 
principle of least action his equations of motion, equivalent to Newton’s second law 
and constituting analogues of Euler’s equation. They are: 


area pel 0 p= 1,2,3 3.5 
—_——.- + — | —— = = . 
ax, dt\ax,}  ° 9 7” (3.5) 


where V and T are the potential and kinetic energy, respectively. 

Whereas Euler’s interest in the calculus of variations centered around applica- 
tions, Lagrange’s emphasis was on algorithmic aspects of analysis. Lagrange’s 
entire work excels by its wealth of original discoveries as well as by the outstanding 
assimilation of the historical material. 


“‘By generalizing Euler’s method he arrived at his remarkable formulas which in one line contain 
the solution of all problems of analytical mechanics. 


[In his Memoir of 1760—61] he created the whole calculus of variations with one stroke. This is 
one of the most beautiful articles that have ever been written. The ideas follow one another like 
lightning with the greatest rapidity...” 


These enthusiastic lines are taken from lecture notes by C. G. J. Jacobi 
(1804-51). 
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4. MINIMAL SURFACES. Euler’s Methodus inveniendi of 1744 marked not only 
the beginning of the theory of the calculus of variations but also of one of its most 
fascinating geometric applications related to the creation of a remarkable class of 
surfaces called minimal surfaces. These were originally obtained from the calculus 
of variations as (portions of) surfaces of least area among all surfaces bounded by a 
given space curve. Nowadays we define them as surfaces with vanishing mean 
curvature H, 


1/GL —2FM + EN \ 
2 EG — F? 


using the discovery of Meusnier (1756-93) in 1776 that (4.1) is a necessary 
condition for least area. Here x, and x, are the principal curvatures, and E, F,G 
and L,M,WN are the respective coefficients of the first and second fundamental 
forms of the surface. 

What is most important to us here is Euler’s discovery of the first non-trivial 
minimal surface, the catenoid. Euler obtained it by minimizing area, as the surface 
generated by rotating a catenary [a cosh curve, the curve of a hanging chain 
(catena) or cable], say, 


1 
H = (Kk, + k2) = = 0, (4.1) 


r = Acosh x, (4.2) 


where r is the distance, in 3-dimensional space, from the x-axis. 

Euler’s discovery of the catenoid was a major accomplishment in his geometric 
work and marked the beginning of the study of minimal surfaces. It was followed 
by Lagrange’s systematic theory developed in his Memoir of 1760-61. In this paper 
and in a subsequent one he extended his method to double integrals for functions 
of two variables. 


J{z] = J [LCs ¥ 2, p.4) dedy (p =2z,,q=2,) (4.3) 


over a domain Q) in the xy-plane subject to given boundary conditions; the 
corresponding Euler-Lagrange equation [taking the place of (3.1)] is 


L,- —L,- —L, =0. (4.4) 


5. LEGENDRE, JACOBI, WEIERSTRASS. In the calculus, y’ = 0 is only a neces- 
sary condition for a minimum of a function y(x), and for a decision one must also 
consider y”. Similarly, in the calculus of variations Euler’s equation is only a 
necessary condition for a minimum, and for a decision one must also consider the 
second variation of (2.1), 


e? O7I[¥] 
2 de’ 


2 


*1 72 
7 = ral (L, yn? + 2L, ny + Lyyn ) dx, (5.1) 
E= XQ 

introduced by Legendre (1752-1833) in 1786. It was formally motivated by Taylor’s 
theorem 


I[y + en] =Jly] + 63 + 87J, (5.2) 


where the tilde means that the arguments are y + én, y’ + éy’ with € € (0, €]. 
Legendre obtained the condition L,,,, > 0 along a minimizing curve and L,,,, < 0 
along a maximizing curve (very similar to the calculus!) but he did not justify his 


analysis completely. 
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In fact, it took another fifty years before Jacobi succeeded in rigorously 
demonstrating that L,,,, > 0 and the so-called Jacobi condition, which asserts that 
x, should be closer to x, than the so-called conjugate point* of x,, suffice for a 
local minimum, that is a minimizing § among y © C'[x , x,] satisfying the 
boundary conditions in (2.1) and lying close to § in the C! sense, that is, satisfying 


(a) ly-yl<p, (b) ly’ —y'l <p for small positive p. (5.3) 


Jacobi’s discovery of the conjugate point and of its significance closed a substantial 
gap. However, condition (5.3b) seemed to be too restrictive and in no way 
suggested by the nature of the problem. Weierstrass emphasized that one should 
extend the domain of (2.1) and consider strong minima, that is, one should drop 
(5.3b). 

For this program of obtaining a sufficient condition for a strong minimum, 
Weierstrass set up entirely new machinery centering around two ingenious con- 
cepts. The first was a field of extremals of (2.1), which he defined as a domain 2, in 
the xy-plane such that through each of its points there passes precisely one 
extremal of a one-parameter family of extremals of (2.1) (solution curves of Euler’s 
equation) depending continuously on the parameter, and the second was the 
so-called E-function, a turning point in the history of the calculus of variations. 

To define the E-function, Weierstrass started from the slope function p = 
p(x, y), the slope at (x, y) of the extremal of a field of extremals y = h(x, a); thus 


p(x, y) =h'(x, a)le=alx,y)s (5.4) 


where ’ refers to differentiation with respect to x. Then he defined the E-function 
by 


E(x, y,p,y') =L(x,y,y’) — L(x, y,p) — (y' —p)Ly(x,y,p), (55) 


where y = y(x) is any C!-curve in the region covered by the field of extremals. 
Now he could prove that if, for an extremal y = y(x) of the field, the above 
sufficient conditions for a local minimum are satisfied, and if E > O at every point 
in the field and for every y’, then y(x) gives a strong minimum of (2.1). 

In addition to his path-breaking new method, Weierstrass also revolutionized 
the calculus of variations by stressing—practically for the first time—the impor- 
tance of a precise definition of the domain D(J) of the functional J[y] and of 
admissible functions, the functions y € D(J) satisfying the side conditions. 
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*The conjugate point of x, is the first value x > x9) where a nonzero solution of 


d dw d 
(Ly ge | 7 (,, ~ ats w= 0, w(X%9) =0 (42%) 


vanishes. Here w(x) = dy /dal,g=o9 and a = O corresponds to y in the family of extremals y = y(x, a). 
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Hofstadter’s Law: It always takes longer than 
you expect, even when you take into account 
Hofstadter’s Law. 


—Douglas R. Hofstadter 
Godel, Escher, Bach, New York: Basic Books, 1979. 
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NOTES 


(10401) The winding number of a contour C about a point zo ¢ C is the net number of 
times C encircles zg in the counterclockwise direction. It can be defined using the complex 
variable z = x+iyas (2m7i)7! f c a2/(Z—Z0). The oldest known knight’s tour (due to al-Adli 
in the ninth century) is illustrated below, together with the 49 winding numbers w;; at the 
corner points between cells. In this example, >> w; j = 49 and D> (XKYKAL — Xk41Yk) = 98. 
(10402) We use the convention that N denotes non-negative integers. A good reference 
for products of countably many measure spaces is E. Hewitt and K. Stromberg, Real and 
Abstract Analysis, section 22. 


111041 =-1 -!1 
241 2 1 -1 -2 
1112 2 -1 0 
0 01 2 1 1 1 
12 2 3 1 2 1 
23 2 1 2 1 0 
120 1 0 — 0 1 


Figure for 10401 


SOLUTIONS 


Polygonal Polynomial Identities 


6639 [1990, 622]. Proposed by Paul Monsky, Brandeis University, Waltham, MA. 


Let N be a convex centrally symmetric plane n-gon of area 2 divided into n triangles 
by rays from an interior point P to the vertices. Let a,,a2,...,a, be the areas of these 
triangles, taken in cyclic order. 

(i) If N is affinely equivalent to the regular polygon of 2k sides, prove that 


ay +a3 +--+ an-1 = 42 +44 4+-++ +a = 1. 


(ii) In general prove that a1, a2, ..., d, satisfy a polynomial identity with integer coef- 
ficients and constant term 1. 


Solution by John H. Lindsey II, Fort Myers, FL. Evidently n = 2k is even and at least 4. We 
take all subscripts from the set [n] of integers modulo n. It is convenient to regard N as lying 
in the xy-plane in cartesian 3-space with the origin O as its center. Let v; = (xj, yi, 0), for 
i = 1,2,...,n be the position vectors of the vertices of N taken in clockwise order. Since 
N is centrally symmetric we have 


Vi-k = —V; (1) 
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for all z € [n]. Let p be the position vector of P and / be the vector (0, 0, 1). Then 
2aj = (vig1 — Pp) X (YU — p)- 1 = vi41 x YF +1 + (UY; — Uj41) x Pl (2) 


for alli € [n]. 

In part (i) there is an affine transformation of N onto a regular n-gon N’ in the x, y—plane 
such that N’ is also centered on O with area 2. This transformation multiplies all areas by 
a constant, which must be 1 since N and N’ have the same area. Thus the problem is left 
invariant by this transformation. So we may assume that N is N’. Let S be the set of odd 
integers modulo n. Then summing either v; or v;41 over all i € S is like summing the k' 
roots of unity in the complex plane, and so yields 0 since k > 2. Since the vj41 x vu; -/ are 
all equal to 4/n, part (i) follows from summing (2) over alli € S. 

When k = 2, part (ii) follows from part (i), since every convex centrally symmetric plane 
quadrilateral is affinely equivalent to a square. So in giving the proof of (ii) we may assume 
k > 2. 

Summing (2) over i,i+1,...,i+k—1 yields 


2(aj + aj41 +... + Gji4n—-1) = (Vi41 X V7 + :Uj42 X Vig ees + U4 X Vi4-K-1)°1 
+(uj — Vida + Uj41 — Ui42 + +++ + UE4k-1 — Vi4k) X Pol 


Because N is symmetric about O, the first sum on the right hand side of this equality 
is precisely the area 2 of N. The second sum collapses to (uv; — vj+x) x p- 1, which is 
2u; x p-l by (1). Hence 


uy X pl =a; +aj41 +--+ + aj4n-1 — 1 (3) 
for alli € [n]. 
It follows from (1) and (2) that 
A; + Qj4k = Vj X Uj +1 = Xj41yi — Xi i441 


for alli € [n]. Multiplying this equation by (—1)' xj42x)43 ...Xj+k-1, Summing over 
i € [k], and using (1) to give x; = —xj+,, we obtain 


k 
Yo (=) xi42%i43 6 Xitk—-1(@j + Qik) = 


i=l 
k 
| oy 
SY OLAD) vixig xine Xitk—1 — (HI) vig a eigoxigs -. Xipk—1xi +4): 
i=l 


Evidently this last sum collapses to — y1x2x3...x% + (—1)* Vk41Xk42Xk43 ---X2k, Which is 
equal to zero by (1). Thus 


k 
YH)! xi 420x143 6 Xitk-1(aGi + aj4n) = 0. (4) 


i=] 


Suppose that P 4 O. Rotate the configuration to put P on the positive y-axis. Apply 
the area-preserving linear transformation x +> rx, y +» r~'y for appropriate r so that p 
becomes (0,1,0). Then (3) gives 


Xj = Qj +41 +--+ + Gj4x-1 —1 (5) 


for any i € [n]. These values of the x; may be substituted in (4) to obtain an identity satisfied 
by the a;. More specifically, let R be the polynomial ring Z[A1, A2, ..., An] inv variables 
A1, A2,..., An over the integers Z. For eachi € [n] we define 


Xj = X;(Aq, A2,..., An) = Ai + Aigi +... Aisze-1—1ER (6) 
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and let e be the polynomial 


k 
e(A1, A2,.--, An) = Y\(-0)' Xi42Xi43 +» Xi4k-1(Aj + Aitk) € RK. (7) 


1=] 
The above argument tells us that 
e(a1,a2,...,an) =0 (8) 
whenever P + O. 

If P = O, then X;(a1, a2,...,a,) = Qj + aj41 +-°++ + Qj4x-1 —1=0 for alli € [n], 
since N is symmetric about O with area 2. In view of (7) this implies that (8) holds when 
P = O. Thus (8) holds in all cases. Of course, the fact that the area of N is 2 says that 

f (a1, 42,...,a,) =), (9) 


where f is the polynominal 
f(A1, A2,..., An) = Ai + A2 +--+ +An — 2. (10) 


It follows from (6) that X; + X;41 = Aj; + Aj+x (mod 2R) and X; + Xj4, = f for any 
i € [n]. We conclude that X; = —Xj4+4 = Xj4x (mod 2R+ FR) for all suchi, and hence 
that 


k 
e= \> Xi42Xi43 oe Xiae—1(X; + X41) 
i=] 


k k 
= > Xj41Xj42--.Xitk—-1 + > Xj42Xi43--.Xi+k 
i=l 


i=1 


= X2X3-°- Xe + Xe42Xk43°°° XK =O, 


where all equivalences are modulo the ideal2R+ fR of R. Thus there is some polynomial 
g € R such that h = e — fg has all coefficients even. 

Let e(0), f (0), g(O) and h(0) be the constant terms of e, f, g and h, respectively. Since 
f = A; + A2+---+ An (mod 2R), the terms of degree one in fg are congruent to 
g(0)(A; + A2 +---+A,) modulo 2R. The terms of degree one in e are 


k 
Y iH 1 (Ai + Aine) = Al + Ao +--+ + An (mod 2R), 


i=] 


while those of h are all even. Since e = fg +h, we conclude that g(0) is odd. It follows 
that h(O) = e(0) — g(O) f(0) = 2¢g(0) is exactly divisible by 2. Thus h/2 — jf € R has 
constant term 1 for some j € Z. Because e, f and h all vanish at (a), a2,..., ay), the 
polynomial h/2 — jf provides a solution to part (ii) of the problem. 

For example, in the case k = 3 we get 


e = (1—A3—A4g—A5)(A1+A4)+(A4+A5+A6—1)(A2+As)+(1—As—Aop—A1)(A3 +A) 
and we may take g = 1+ A4+ As5 + Ao and j = 0. This gives the polynomial 
<h— jf =s(e— fe) 
2 2 
=1—Az)+A4+ Ao — AZ — A? 
— Ai(A3 + Aq + As + Ao) 
— A3(Ag + As + Ao) — Aa(As + Ao) — AsAo, 
which provides a solution of part (ii) for k = 3. 


Solved also by the proposer. Part (i) only was solved by Weiqi Gao. 
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The Difference between Graphs of Even and Odd Size 


6673 [1991, 965]. Proposed by Paresh J. Malde and Allen J. Schwenk, Western Michigan 
University, Kalamazoo, MI. 


Let cn = €, — dy, Where e, is the number of connected labeled n-vertex graphs with an 
even number of edges and d, is the number of connected labeled n-vertex graphs with an 
odd number of edges. For example, cp = 0 — 1 = —1 andc3 = 3 — 1 = 2. Find a general 
formula for cy. 


Composite solution I by Stephen C. Locke, Florida Atlantic University, Boca Raton, FL, 
and Herbert S. Wilf, University of Pennsylvania, Philadelphia, PA. The answer is cn = 
(—1)"~!(n — 1)!. More generally, for a fixed positive integer k, let cn, be the same excess, 
but for graphs with exactly k components. We claim that cy, 4 is the signed Stirling number 
of the first kind. 

Let f(n, e, k) be the number of labeled graphs with n vertices, e edges, and k components, 
and form the exponential generating functions 


Fear, y) = Yo fla, 
k > y 7 nie, n! . 


The computation also uses the corresponding generating function for all labeled graphs, 


F(x, y)= )0 f(n,e)y®x"/nl, 


where f (n, e) is the number of labeled graphs with n vertices and e edges, so that f(n, e) = 
(6%?) with C(n, 2) = (5). Then by the Exponential Formula we have 


F(x, y)* 
k! 

as formal series. For fixed k, the exponential generating function of cy 4 is Fx(x, —1). 
The formula for f (n, e) yields F(x, y) = >, sp A+ y)o”x"/n!, so that F(x, -1) = 

1+.x. Thus, F;)(x, —1) = In(1 + x) and Fy (x, —1) = In(1 +. x)*/k!. These are known 

exponential generating functions of the claimed values. 


Fx(x,y) = and F(x, y) = exp (F1(, y)) 


Composite solution IIT by Bruce E. Sagan, Michigan State University, East Lansing, MI, 
and Richard Holzsager, The American University, Washington, DC. We construct a bijective 
proof of the formula cy, = (—1)"~!(n — 1)! by associating graphs in pairs that are identical 
everywhere except on the pair {v;, v2}; one has this pair as an edge and the other does not. 
Note that every n-vertex graph belongs to such a pair. If both graphs in a pair are connected 
or both are disconnected, then the pair contributes zero to cy. 

In computing c,, we may therefore restrict our attention to connected graphs containing 
the edge v1; v2 as a ccut-edge. Let z = v,. Since zv2 is a cut-edge, z and v2 have no common 
neighbor. Hence contracting v2 into z by contracting the edge zv2 reduces the number of 
edges in every such graph by exactly one. 

Among these graphs, we consider those that do or do not contain the edge zv3. All are 
paired off with zero contribution to c, except those in which zv3 1s a cut-edge. Again we 
can contract this cut-edge in the remaining graphs, absorbing v3 into z. We continue this 
process until the graphs that remain have been contracted to a single vertex. 

We claim a graph survives the pairing at each stage if and only if foreach 7 > 1 the vertex 
v; has exactly one edge to a vertex with lower index. If some v; has two lower neighbors, 
then at the moment when the higher of these would be absorbed by contraction, it would 
have a common neighbor with z. If some v; has no lower neighbor, let P be a shortest path 
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from v; to a vertex with lower index; any vertex along P whose index is a local maximum 
now has two lower neighbors. 

The number of graphs satisfying this structural property is (n — 1)!, and they all have 
n — 1 edges, so the desired formula holds. 


Editorial comment. Use of the generating function, as in Solution I, is the more straightfor- 
ward approach. Indeed, these generating functions are well known in the literature of graph 
theory. The Exponential Formula may be found in H. S. Wilf, Generatingfunctionology, Aca- 
demic Press, 1990. Also see I. P. Goulden and D. M. Jackson, Combinatorial Enumeration, 
Wiley, 1983 (where the problem of finding the relation between the generating functions 
F\(x, y) and F(x, y) is given as Problem 3.4.7 on p. 210), and the review paper, E. M. 
Palmer, “The enumeration of graphs”, pp. 385-415 in L. W. Beineke and R. J. Wilson, eds., 
Selected Topics in Graph Theory, Academic Press, 1978 (where this result may be found on 
p. 391). In all, eight solutions to the present problem of this type were submitted. 

V. A. Liskovets notes that similar results hold for different classes of graphs, such as 
digraphs allowing edges in both directions, digraphs not allowing such edges, graphs with 
loops, and even strongly connected digraphs. He supplied a reference to V. A. Liskovets, 
Doklady AN BSSR 17 (1973), 1079. 

Biective proofs, as in Solution II, were submitted by four solvers. 


Solved also by G. Calinescu (Student, Romania), R. J. Chapman (U. K.), F. Galvin, V. A. Liskovets (Belarus), 
R. Martin (student), J. Mobek, F. Schmidt, and the proposers. 


Squarish Numbers 


10220 [1992, 461]. Proposed by Solomon W. Golomb, University of Southern California, Los 
Angeles, CA. 


Suppose € is a given positive number. A positive integer n will be called €-squarish if 
and only if it has a factorization n = ab with 1 < a < b < (1+ €)a. Prove that there are 
infinitely many occurrences of six consecutive €-squarish numbers. 


Solution by Robert B. Israel, University of British Columbia, Vancouver, B.C., Canada. If x is 
sufficiently large, then (x—1)(x+1) = x?—1and (x—2)(x+2) = x*—4 are both €-squarish, 
as is x itself. Also, if x = t* ++ — 2 for some integer ¢, then (x —t)(x +t4+1) = x*—2 
and (x — 2t + 1)(x + 2t +3) = x* —5 are both e-squarish for ¢ sufficiently large. Finally, 
if x = 2s* — 2 for some integer s, then (x — 2s + 1)(% + 2s +1) = x* —3is €-squarish 
for s sufficiently large. 

This gives 6 consecutive €-squarish integers x* — 5, x* —4,...,x7ifx =17+t-2= 
2s* — 2 with s and ¢ sufficiently large. It suffices to show that u? = 857 + 1 has solutions 
with arbitrarily large s and u = 2t +1. One could use the theory of Pell’s equation here or 


simply take ; 
S 3. 1 0 
(i) =(8 3) G) 


Editorial comment. All solutions used essentially the same construction, and none of 
them considered longer sequences of consecutive €-squarish numbers. To illustrate the 
construction, both Richard Holzsager and the proposer mentioned the numerical example 


55 - 89 (= 4895), 68-72, 59-83, 62-79, 69-71, 70-70 (= 4900) 


for € > 34/55. The proposer declined to conjecture arbitrarily long runs, but does not 
expect that this problem gives the best possible result. 


for large positive integer n. 


Solved also by C. P. Grant, R. Holzsager, I. Kastanas, and the proposer. 
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The Perimeter of an Oval 


10227 [1992, 463]. Proposed by Antonio Montes, Universitat Politécnica de Catalunya, 
Barcelona, Spain. 


Suppose that a is a convex simple closed curve in the plane which is piecewise C! and 
suppose that the origin lies inside a. 
(a) Show that the length of @ is given by 


~ p pedi, 
04 


where T is the unit tangent vector in a counterclockwise direction and ¢ is the vector from 
the origin to the curve. 


(b) If, in addition, a is C' and piecewise C*, show that the length of a is given by 


27 
| r(0)*K (0) dé, 
0 


where r(@) indicates the distance from the origin to the point on a with polar angle 6 and 
k(@) denotes the curvature of a at the point (0, r(@)). 


Solution I by Richard Holzsager, The American University, Washington, DC. (a) One rep- 
resentation of the standard integral for length is { dp - tT. Suppose a is broken by points 


(0; P1,+++, Pn = po into C! segments a, a2, ... 0. Integrating by parts on the segment 
aj gives 


lengthy = f aB-#)— f p-d? =H) -H— Hj to- | B-aF 
J J J 


where the subscripts 0 and 1 on Tt refer to the values at the two ends of the segment. If we 
sum over j, and let dt; represent the difference between the outgoing tangent Tj+1,9 and 
the incoming tangent T;,;, then summation gives 


length = -5 | p-a%- Lp; dt; =- | 5a. 
j 
provided we interpret the last integral in the Riemann-Stieltjes sense, taking the gaps dT; 
into account. 

(b) Let o(6) = r(A)e!? in complex notation. Since the curve is convex and surrounds 
the origin the normal v equals iT. 


me dp do dé 
> ( / 10 . °) 
— fo LT a) 6 
T(s) 9 ds ; r'(@)ye’ +ir(O@)e 


and 4 
v(s) = iT(s) — ——(ir’(@ye"® _ r(oye”) 


where s is the arclength parameter. But t’(s) = k(s)v(s) which implies 


= 


| at | | da 2 
p(s) - Pan k(s)p(s) - u(s) = ——r(@(s) )“k((s) ) 
Ss ds 


and (b) then follows from (a) by the transformation of integral formula. 


Solution IT by G. D. Chakerian, University of California, Davis, CA, and W. J. Firey, Oregon 
State University, Corvallis, OR. Let K denote a plane convex body with interior points; write 
a for the boundary of K. Choose rectangular coordinates (x, y) such that (0, Q) is an interior 
point of K. Assign the counter-clockwise orientation to a. 
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For each ¢, 0 < @ < 2zm there is a unique half-plane of points (x, y) such that 


xcoso + ysing < h(¢) (1) 


contains K and its bounding line L(¢) meets @ but not the interior of K. L(@) is the support 
line to K which corresponds to the outer normal direction v(¢) = (cos @¢, sin ¢) to (1) and 
to K and a; h(@) is the positive distance from (0,0) to L(¢). Orient L(¢) consistently 
with the orientation of a. Thus t(¢) = (— sing, cos @) is a generalized forward tangent 
direction corresponding to v(@). 


K 1 L(@) is either a point (x(%), y(@)) or a closed segment of points 
(x(g) —Asing, yg) +Acosg), O<ASI, 


where / is the length of K 1 L(@). Define o(@) = (x(@), y(@)), that is either the point 
K  L(@), or the initial point of K % L(@). Note that 


(x(p) — Asin g) cos p + (y(P) + Acos g) sing = p(P) - V(P) = h(9) (2) 


for all allowed A. This shows that, when K  L(@) is a segment, any systematic definition 
of o(¢) as a point on that segment will produce the same result. Also 


dt(~) = (— cos g, — sin g)dp = —v(9)d¢. (3) 
The convexity of K ensures that all discontinuities of 0(@) are finite jumps of the form 


,lim (0(d) — P(h0)) = (x(h0) — L sin go, y(ho) +! cos ho) — (x(¢0), y(go)) = !T (ho) 


at go. Each corresponds to a segment of length / in a; the set of these discontinuities is 
countable and the corresponding sum of length / is finite. This is all due to the convexity of 
K. Hence any product 0(¢) - 4(¢), “(¢) continuous, is integrable over 0 < ¢ < 27. 

Part (a) of the problem requires us to prove that the length of a is 


- | p-az. (4) 


Here a, 0(¢), T() have the meanings given above. Any one of the ways of defining the 
length of a (e.g. by Hausdorff measure, integral-geometric procedures, of as a limit of 
lengths of approximating polygons) leads to the representation of the length of @ as 


20 
i h(p)d¢ (5) 


valid for any a, since the convexity of K entails the continuity of h(¢). Thus it suffices to 
prove that (4) equals (5). This is immediate from (2) and (3). Note that the restrictions in 
the statement of part (a) of the problem are not needed. 

Part (b) is obtained from part (a) by transforming the integral as in Solution I. Here the 
assumption that a is piecewise C? is used to provide integrable expressions in the formulas. 


Editorial comment. Solution II shows that one can obtain a stronger result by exploiting 
the convexity. This approach was also taken by H. W. Guggenheimer. The other solvers 


followed the approach of Solution I. 


Solved also by J. Anglesio (France), R. J. Chapman (U. K.), H. W. Guggenheimer, A. Nijenhuis, and the proposer. 
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Powerful Inequalities 


10261 [1992, 872]. Proposed by Wu Wei Chao, He Nan Normal University, Xin Xiang City, 
He Nan Province, China. 


Let x be a real number such that 0 < x < 2/4. Prove that 


ysin x )oos* 


(sin x < (Cosx 


Solution I by Alain Salinier, Université de Limoges, Limoges, France. We produce a positive 
lower bound on In(cos x) — tan x In(sin x) for 0 < x < 2/4. For all real numbers a, b, and 
A witha > 0,b > Oand Ad € (0, 1), the strict concavity of the natural logarithm gives 


In (Aa + (1 —A)b) > Alna + (1 —A)Inb. (1) 


Let x € RwithO < x < 7/4 and define a = sinx, b = sinx + cosx, A = tanx. The 
above conditions are satisfied, so (1) gives 


In(cosx) > tan x In(sin x) + (1 — tan x) In(sin x + cos x). 


The last term is positive since tan x < 1 and sinx + cosx = /2 cos(r/4 — x) > 1. 


Solution IT by Ilias Kastanas, Californa State University, Los Angeles, CA. Let u = sin? x 


and v = cos’*x. Thenu+v=1and0 <u < 1/2 < v < 1. The desired inequality is 
equivalent to the following chain of inequalities: 


Julnu < Jvinv 
2 3 2 3 
Vi(v+ F4O4...) > (ue S4S4...) 


2 3 2° 3 
Jilia eye 4 Jalia egy 
Uv _ —_ a _— — woe fe 
27 3 MET 27 3 


Since all coefficients and exponents are positive, the last series represents an increasing 
function, and the result follows. 


Solution III by Heinz-Jiirgen Seiffert, Berlin, Germany. Since 0 < tan x < 1, the weighted 
arithmetic-geometric mean inequality gives 


2 1—tanx 7 tan x 7 7 
(1 + tan x) (tan x) < (1 — tan x)(1 + tan x) + tan x tan“ x. (2) 


The left side of (2) is (sin? xy" / cos* x, and the right side is 1 + tan*x — tanx < 1. 


Hence (sin? x)" < cos’ x, from which the desired result follows by raising both sides to 
the power (cos x) /2. 


Solution IV by Mark A. Pinsky, Northwestern University, Evanston, IL. Write a = sin x and 
b = cosx. We must show alna < blnb for 0 < x < 27/4 ® .7853981635. The function 
f(u) = ulnu is convex for 0 < u with a minimum at u = e~! and f(e~'!) = —e7!. If 
x > arcsin(e~!) ® .3767275081, then e~! <a <b implies f(a) < f(b) as required. 
Otherwise, (a, f(a) ) is below the chord joining (0,0) to (e~', —e~'), and (b, f(b) ) 
is above the tangent to y = f(x) at (1,0). Thus alna < —a, blnb > b —1, and 
binb—alna > b—1+a. Since (a + b)* = 1+ 2ab > 1, the result follows. 

Note that we have shown the more general result that0 < a < banda-+b > 1 implies 
a? < b?. 


Solved by 81 readers (including those cited) and the proposer. Two incorrect solutions were also received. 
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Collaborating editors: David F. Appleyard, Paul T. Bateman, Bruce C. Berndt, Duane 
M. Broline, Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. Griggs, 
Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, Daniel J. Kleitman, 
Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, Stephen L. Portnoy, J. O. Shallit, John 
Henry Steelman, Kenneth B. Stolarsky, David E. Tepper, Douglas B. Tyler, Daniel Ullman, and 
William E. Watkins. 


Two EZ Proofs of sin?z + cos*z = 1 


Leon EHRENPREIS and Doron ZEILBERGER 


The two proofs contrast two origins of the sine function: (a) the zeros of 
sin z, and (b) the power series for sin z. 

For the proof based on (a), we study the zeros of f(z) := sin? z+cos? z—1. 
Observe that f vanishes when z = nz or (n+ 5)% , for any integer n. This 
is the Pythagorean theorem for degenerate triangles. But, even more, f has a 
second order zero at these points, because f is even (so it must have a double 
zero at z = 0), and periodic of period 7/2. 

Now we appeal to a theorem on entire functions. f(z) is an entire function 
of exponential type 2. In fact |f(z)| < Ce?!?!. It is a standard consequence of 
the argument principle that such a function cannot have more than (2 + €)r/7 
zeros (counting multiplicities) in |z| <r, unless it vanishes identically. But we 
have produced 4r/z zeros there. Thus f(z) =0. O 

The same method of proof can be used to prove many identities for elliptic 
functions. 

For the proof based on (b), observe that sinz (resp. cosz) is the ez- 
ponential generating function (henceforth e.g.f.)' for increasing sequences of 
integers of odd (resp. even) size, weighted by (—1)!*#¢/2], Hence sin? z + 
cos’ z is the e.g.f. for ordered pairs of increasing sequences of integers (a, < 

. < ap; b) < ... < bs) such that r+ s is even, {aj,...,a@,,61,...,65}= 
{1,2,...,r+s}, and the weight is (—1)!"/2I+1*/2]_ Let the mate of such a pair 
be (a1,...,@p,b5 ; 61,...,63-1) if a, < bs, and (aj,..., a, 3; b1,...,65,a,) if 
a, > 6,. Since every pair has opposite sign from its mate, the total weight 
of each couple is 0. The only left-over is the pair (empty, empty) that has no 
mate; its weight is 1, and its size is 0, hence its e.g.f.is 1.0 

A similar (but not identical) proof was found independently by Ed Schein- 
erman and will appear in Math Magazine. 

Temple University 
Philadelphia, PA 19122 

[leon, zetlberg/@math.temple. edu. 
Nisan 12, 5754. 


1 The e.g.f. of a combinatorial family of labelled objects according to a weight WwW is 
peer Anz” / n!, where Qy, is the sum of the weights of all the objects of size N. The product 
A XB of two such combinatorial families is the set of ordered pairs (a,b), a € A , bE B, with 
the labels of a and b disjoint, size(a,b):=size(a)+size(b), and weight(a,b)=weight(a)weight(b). 
It is easy to see that the e.g.f. of A X B is the products of the e.g.f.s of A and B. See D.Foata, 
M.Schutzenberger, LNM # 138 (Springer), and H. Wilf, “generatingfunctionlogy”, Academic 


Press. 
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How to Teach Mathematics. By Steven G. Krantz. American Mathematical 


Society, 1993, v + 76 pp., $21.00. 


Reviewed by Meyer Jerison 


Addressing teachers of college mathematics, Steven G. Krantz proposes in How to 
Teach Mathematics “‘to set down the traditional principles of good teaching in 
mathematics” as he sees them. “While perhaps most experienced mathematics 
instructors would agree with much of what is in this booklet,” he says, ‘“‘in the final 
analysis this tract must be viewed as a personal polemic on how to teach.” 

His central message is that good teaching is important not only for the 
administration and the students but also for the morale of the teacher and that 
thought and effort devoted to preparation will pay off handsomely in time, effort, 
and satisfaction in performance. In 47 short sections, he makes concrete sugges- 
tions, aimed primarily at the inexperienced instructor, on how to be a good 
teacher. He encourages instructors to choose methods for themselves: “If you do 
not agree in every detail with what I say, then I hope that at least my remarks will 
give you pause for thought. In the end, you must decide for yourself what will take 
place in your classroom.” 

First and above all he emphasizes instructors’ respect for themselves as well as 
for their audience, noting that “exercising patience requires no more effort than 
exercising your vocal chords with an insulting remark.” He predicts that “‘things 
will go smoothly if the attitude in your class is that you and the students are 
working together to conquer the material ... make it clear from the outset that 
you are on the student’s side.” Again and again he urges the instructor to be 
thoroughly prepared and to speak and write clearly. He strongly favors encourag- 
ing questions from students and answering them with respect, and he gives 
examples of good and bad ways to answer questions. 

As an example of his ideas on presentation I cite his criticism of the formulation 
in many calculus books of Green’s theorem with special hypotheses on the domain. 
“This is because the authors are looking ahead to the proof, and want to state the 
theorem in precisely the form in which it will be proved. The entire approach is 
silly. Why not state Green’s theorem in complete generality? Then it is simple, 
sweet, and students can see what the principal idea is. When it is time for the 
proof, just say ‘to keep the proof simple, and to avoid technical details, we restrict 
attention to a special class of domains’.”’ 

He is a skeptic on innovations in teaching. “This author, and this booklet,” he 
says, “has a built in bias toward traditional methods, such as lectures ... There are 
those who will criticize this book for being reactionary. I welcome their remarks.” 
He is clearly not an admirer of professional education. He does, nevertheless, 
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endorse the periodical UME Trends, saying “It is important that we explore new 
methods to teach mathematics.” 

He has evidently thought seriously about computers and programmable calcula- 
tors in mathematics courses and claims that we do not yet know how they can best 
contribute to learning. He is convinced that they are being oversold, and asserts: 
“Using the quadratic formula is easy. Analyzing word problems is hard. A person 
who cannot do the first will also probably not be able to do the second—with or 
without the aid of a machine.” 

I think that virtually everyone will agree with what has been reported so far as 
the author’s advice or will grant him the right to his personal opinion. Some 
helpful advice might be added to his. 

In preparation for a lecture, for example, an inexperienced teacher should be 
warned that choice of notation can be critical. It is altogether too easy, when doing 
a problem, to get stuck in a corner with inconsistent notation. The students should 
be told how and why some of the choices are made. 

Krantz objects vigorously and properly to an examination where the average 
score is 32%. He says that students lack the perspective to estimate their standing, 
for example, by doubling their scores. But I would go even further and say that no 
reasonable student will equate a grade of 45% with one of 90%. The message of a 
grade of 45% is clearly that there is a huge gap in the student’s grasp of the 
material, and no amount of rationalization can change that. 

To the section entitled Why Do We Need Mathematics Teachers? 1 would add 
two reasons. One is that the teacher provides a pace for the student. Most people 
need help in setting a reasonable pace for themselves when they try to master 
difficult material. The second is that the teacher aims to improve the student’s 
ability to read. The notion that some students cannot read while others can read 
obscures the reality that everybody (professors included) can read some material 
and cannot read other material. If I were to set a single criterion for the aim of a 
college education it would be to expand every student’s ability to read. Ideally, 
education does not end with college. The broader the range of material that a 
person can read by the time of graduation from college the more likely that that 
education will continue for the rest of her/his life. 

I do have some serious objections to the book. A major one is that the author 
does not apply his own “principles of good teaching” to the book itself. “The first 
few lectures ... set the tone for the entire semester” he says, and “respect ... the 
audience.” But the longest paragraph on the very first page of the preface is 
devoted to a caricature of the unenthusiastic professor who gives “a lecture 
ranging from dreary to arrogant to boring to calamitous.” That paragraph made 
me uncomfortable just as I was beginning to read the book, and I shuddered at the 
thought of seeing it quoted (partially) in some publication that trashes college 
teaching and attributes it to the publisher, the prestigious American Mathematical 
Society. 

In a similar vein, quite properly he warns against sarcasm or saying anything 
that will diminish a student’s self respect. But he says to the reader: “If that is not 
your role as teacher then what, pray tell, could it possibly be?’ He warns: 
“sometimes the professor just doesn’t realize that he/she is behaving in a manner 
that some students take amiss,” but there are statements in the booklet that some 
readers will take amiss. His remarks about education departments and about math 
anxiety are examples. 

The publisher likes the following quote from the preface so much that it is 
repeated on the outside cover: ““The good news is that it requires no more effort, 
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no more preparation, and no more time to be a good teacher than to be a bad 
teacher. The proof is in this booklet.” However, the author is realistic enough to 
contradict it often. On page 2 he says: “Throughout this booklet, I will repeatedly 
exhort you to prepare your lectures.” An entire section is entitled Prepare. It 
begins with a quotation from the bad teacher: “‘My time is too valuable. I am not 
going to spend it preparing.” Then the author says: “I cannot over-emphasize the 
fact that preparation is of utmost importance if you are going to deliver an 
effective lecture or give a stimulating class.” (He also warns against over-prepara- 
tion.) In the section Lectures, he says: “you really have to work at making your 
lectures reach your students.” He then criticizes the tendency ‘“‘not to put a lot of 
effort into our teaching.” I believe that a case can be made for the proposition that 
preparation, time, and effort will pay off in the long run, and the good habits that 
the author urged teachers to develop will eventually lead to more effective 
teaching with less effort and more satisfaction than the minimal attention to 
teaching that he criticizes. But the booklet is “primarily for the graduate student 
or novice instructor’ who is being urges to devote more effort, more preparation, 
and more time than is needed just to get by in the classroom. 

As the subtitle of the booklet suggests, the author does not refrain from talking 
about himself, and that imparts a liveliness to the text, but it also has some curious 
consequences. At the beginning of the preface he writes of deterioration with age 
of the quality of teachers and says: “‘and I speak here of myself as much as 
anyone.” But three pages later he writes: “I find that my teaching gets better and 
better.” 

In spite of the reservations that I have expressed, I can recommend that 
beginners read this book. They will find many useful suggestions for improving 
their effectiveness and consequently their comfort in the classroom. They will also 
be encouraged to think seriously about what they are doing. 


Department of Mathematics 
Purdue University 
West Lafayette, IN 47907-1395 


One of the big misapprehensions about mathe- 
matics that we perpetrate in our classrooms 1s 
that the teacher always seems to know the 
answer to any problem that is discussed. This 
gives students the idea that there is a book 
somewhere with all the right answers to all of 
the interesting questions, and that teachers 


know those answers. And if one could get hold 
of the book, one would have everything settled. 
That’s so unlike the true nature of mathemat- 
ICs, 


—Leon Henkin 


Teaching Teachers, Teaching Students, by L. A. Steen 
and D. J. Albers, eds., Boston: Birkhauser, 1981, p. 89. 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 


T : Textbook P : Professional Reading 1-4: Semester 
C : Computer Software L : Undergraduate Library ** : Special Emphasis 


S: Supplementary Reading 13: Grade Level ?? : Questionable 


Readers are advised that price information is subject to change. Selected books and 
software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, 


MN 55057-1098. 


General, S(15-16). Der Goldene Schnitt. 
Hans Walser. BG Teubner Leipzig, 1993, 
140 pp, DM 16,80 (P). [ISBN 3-8154-2070-9] 
The golden mean as it appears in geometry and 
other areas (e.g., fractals, number sequences, 
probability). JD-B 


General, S*(13-—16), L**. Newton’s Clock: 
Chaos in the Solar System. Ivars Peterson. WH 
Freeman, 1993, xiii + 317 pp, $22.95. [ISBN 
0-7167-2396-4] An inviting history of celes- 
tial mechanics from antiquity (e.g., navigational 
star charts) to present (e.g., chaos vs. Stability 
in the solar system). Informal but informative, 
learned but readable. PZ 


General, S(15). Mathematik fir Wirtschafts- 
wissenschaftler. Volker Nollau. BG Teubner 
Leipzig, 1993, 260 pp, DM 29,80 (P). [ISBN 3- 
8154-2046-6] An introduction to mathemat- 
ics germane economics. Set theory, proposi- 
tional calculus, number systems, linear algebra 
and optimization, sequences and series, calcu- 
lus, linear difference and differential equations, 
probability, random variables, probability dis- 
tributions. JD-B 


General, T(13-16), S***, P, L**. The USSR 
Olympiad Problem Book: Selected Problems 
and Theorems of Elementary Mathematics. 
D.O. Shklarsky, N.N. Chentzov, I.M. Yaglom. 
Transl: John Maykovich. Dover, 1993, xvi 
+ 452 pp, $10.95 (P). [ISBN 0-486-27709- 
7| Unabridged republication of the 1962 W.H. 
Freeman edition—among best problem collec- 
tions ever. 320 challenging problems with un- 
usually complete and instructive solutions. A 
real bargain. LCL 
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Finite Mathematics, T(13: 1). Finite Mathe- 
matics, Second Edition. Stanley I. Grossman. 
Wm C Brown, 1993, xviii + 685 pp, $60.63. 
[ISBN 0-697-11351-5] Standard topics: sets, 
linear algebra, linear programming, counting, 
probability, statistics, Markov chains, finance. 
Many carefully worked examples; applications 
to business, biology. Uses real data. (1983 
Wadsworth text, TR, October 1983.) TH 


Education, P. Advances in Instructional 
Psychology, Volume-4. Ed: Robert Glaser. 
Lawrence Erlbaum Assoc, 1993, xi + 347 pp, 
$79.95. [ISBN 0-8058-0709-8] Reports on 
research on reasoning and problem solving as 
fundamental components of learning and teach- 
ing. Two chapters focus on mathematics. MW 


Education, P. Handbook of Individual Dif- 
ferences, Learning, and Instruction. David H. 
Jonassen, Barbara L. Grabowski. Lawrence 
Erlbaum Assoc, 1993, xvii + 488 pp, $34.50 (P); 
$99.95. [ISBN 0-8058-1413-2; 0-8058-1412-4] 


Education, P. Schools for Thought: A Science 
of Learning in the Classroom. John T. Bruer. 
MIT Pr, 1993, x + 324 pp, $29.95. [ISBN 0- 
262-02352-0] Clear introduction to cognitive 
science, with persuasive arguments for its im- 
portance in educational reform. Summaries of 
representative research programs, examples of 
specific applications, implications for teacher 
education. Not specifically mathematical, but 
good general overview. MW 

Education, P. How to Start an Industrial 
Mathematics Program in the University. Avner 
Friedman, John Lavery. SIAM, 1993, v + 37 pp, 
free (P). [ISBN 0-89871-327-7] For depart- 
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ments considering creating a graduate industrial 
mathematics program; based on experiences of 
the Institute for Mathematics and its Applica- 
tions at the University of Minnesota. SM 


History, L. A History of Mathematical No- 
tations: Two Volumes Bound as One. Florian 
Cajori. Dover, 1993, $19.95 (P). [ISBN 0- 
486-67766-4| V. I: Notations in Elementary 
Mathematics, xvi + 451 pp; V. I: Notations 
Mainly in Higher Mathematics, xii + 367 pp. 
For each symbol, details first appearance, ori- 
gins, spread, and competition. Entertaining, 
interesting, good browsing. LC 


Logic, P. Complexity of Proofs and Their 
Transformations in Axiomatic Theories. V.P. 
Orevkov. ‘Transl. of Math. Mono., V. 128. 
AMS, 1993, vi + 153 pp, $86. [ISBN 0-8218- 
4576-4] 


Logic, P. The Reconstruction of Trees from 
Their Automorphism Groups. Matatyahu Ru- 
bin. Contemp. Math., V. 151. AMS, 1993, viii 
+ 274 pp, $56 (P). [ISBN 0-8218-5187-X] 


Combinatorics, P. Surveys in Combinatorics, 
1993. Ed: Keith Walker. London Math. Soc. 
Lect. Note Ser., V. 187. Cambridge Univ Pr, 
1993, vii + 287 pp, $39.95 (P). [ISBN 0- 
521-44857-3] 9 survey papers from the 14th 
British Combinatorial Conference (July 1993). 


Discrete Mathematics, T(14—-15: 1, 2). Dis- 
crete Mathematics with Applications. HF. 
Mattson, Jr. Wiley, 1993, xxv + 637 pp, $61.95. 
[ISBN 0-471-60672-3] Opening chapters on 
sets, logic, induction, equivalence relations and 
partitions, functions. Then a tree of possi- 
bilities: integers, congruences, binomial theo- 
rem, counting, probability, recurrence, matri- 
ces, trees, graphs. Conversational style, pleas- 
ing design. Many examples, problems of all 
kinds. Less algorithmic than other texts. LCL 


Discrete Mathematics, T(16—17: 1), P*, L. 
generatingfunctionology, Second Edition. Her- 
bert S. Wilf. Academic Pr, 1994, ix + 228 pp, 
$44.95. [ISBN 0-12-751956-4] Excellent in- 
troduction to theory and techniques of gen- 
erating functions. Second Edition has more 
problems, new applications, appendix on using 
Mathematica and Maple. (First Edition, TR 
and Extended Review, November 1990.) TH 


Number Theory, T(16). Primes and Program- 
ming: An Introduction to Number Theory with 
Computing. Peter Giblin. Cambridge Univ Pr, 
1993, x + 239 pp, $44.95; $19.95 (P). [ISBN 0- 
521-40182-8; 0-521-40988-8] An interesting, 
sophisticated introduction to number theory. 
Several Pascal programs, and many valuable 
computing exercises. Covers primes, congru- 
ences, pseudoprimes, cryptography, primitive 
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roots, divisors, continued fractions, quadratic 
residues. Have a look. SG 


Number Theory, P. Arithmetic of Quad- 
ratic Forms. Yoshiyuki Kitaoka. Tracts in 
Math., V. 106. Cambridge Univ Pr, 1993, x 
+ 268 pp, $54.95. [ISBN 0-521-40475-4] <A 
basic overview. After a general introduction 
to quadratic forms, systematically treats forms 
over the reals, rational local fields, p-adic inte- 
gers, and the integers. A few exercises. SG 


Linear Algebra, T(13—15: 1), L. Linear Alge- 
bra: An Introduction to the Theory and Use of 
Vectors and Matrices. Alan Tucker. Macmil- 
lan, 1993, xiii + 440 pp. [ISBN 0-02-421581- 
3] A new, refreshing look at elementary lin- 
ear algebra. Reflects recent ideas on curricu- 
lar reform and applications. Standard topics 
are enhanced by attractive examples, presen- 
tation. Scalar product, orthogonality are used 
systematically for matrix operations; eigenval- 
ues, eigenvectors are introduced early. Well- 
selected exercises. JS 


Algebra, T(13-14: 1), L. Classical Algebra, 
Third Edition. William J. Gilbert, Scott A. Van- 
stone. Waterloo Math Found (U. of Waterloo, 
Waterloo, Ontario, Canada N2L 3G1), 1993, 
vii + 248 pp, (P). [ISBN 0-921418-92-2] A 
curious, interesting, unusual book. Prepara- 
tion for college-level modern algebra: elemen- 
tary number theory, congruences, binomial the- 
orem, permutation groups, cryptography, com- 
plex numbers, fundamental theorem of algebra 
(with intuitive topological proof), polynomial 
equations over finite and infinite fields. JS 
Algebra, S(18), P. Basic Structures of Modern 
Algebra. Yuri Bahturin. Math. & Its Applic., 
V. 265. Kluwer Academic, 1993, 1x + 419 pp, 
$182. [ISBN 0-7923-2459-5] Best suited for 
expert reference; too terse, fast-paced for use 
as a beginning text. Topics include Galois the- 
ory, normed fields, simple groups, topological 
groups, Noetherian rings, central simple alge- 
bras, Lie algebras, homological algebras, alge- 
braic groups, varieties of algebras. No exam- 
ples or exercises. JS 


Algebra, T(18). Squares. A.R. Rajwade. Lon- 
don Math. Soc. Lect. Note Ser., V. 171. Cam- 
bridge Univ Pr, 1993, xii + 286 pp, $39.95 (P). 
[ISBN 0-521-42668-5] A well-written, un- 
pretentious introduction to squares and sums of 
squares in fields. Among topics considered: the 
state of a field, Hilbert’s 17th problem, Pfister 
forms, formally real fields. Some exercises. SG 
Algebra, T**(17: 2), P, L. Algebra: A Grad- 
uate Course. I. Martin Isaacs. Brooks/Cole, 
1994, xii + 516 pp, $67.50. [ISBN 0-534- 
19002-2] A delicious, beautiful, infectiously 
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enthusiastic treatment of noncommutative and 
commutative algebra. Classical approach (no 
tensor products, categories), but includes mod- 
ern topics. Nice examples, problems. RM 


Algebra, T(16—17). Finite Fields: Structure 
and Arithmetics. Dieter Jungnickel. Bibli- 
ographisches Institut & FA Brockhaus, 1993, 
339 pp, DM 78. [ISBN 3-411-16111-6] Ex- 
plicit constructions, normal and dual bases, shift 
register sequences, characters and Gauss sums. 
Interesting commentaries; no exercises. SG 


Algebra, P. Computational Algebraic Geom- 
etry and Commutative Algebra. Eds: David 
Eisenbud, Lorenzo Robbiano. Symp. Mathe- 
matica, V. XXXIV. Cambridge Univ Pr, 1993, x 
+ 298 pp, $49.95. [ISBN 0-521-44218-4] Pa- 
pers from 1991 conference in Cortona on com- 
putational aspects of Grdbner bases. Includes 
tutorials, research surveys, open problems. RM 


Algebra, T(18: 1, 2), P. Quadratic Algebras, 
Clifford Algebras, and Arithmetic Witt Groups. 
Alexander J. Hahn. Universitext. Springer- 
Verlag, 1994, xi + 286 pp, $34 (P). [ISBN 
0-387-94110-X] Main themes: algebras and 
forms over a commutative ring, involutions on 
algebras, gradings and tensor products, separa- 
ble algebras. New results on representations of 
Clifford algebras, structure of the Arf algebra 
and the quadratic Witt group, connections be- 
tween the group of quadratic algebras and the 
discriminant group. TH 


Algebra, T**(18: 1), P*. Hopf Algebras and 
Their Actions on Rings. Susan Montgomery. 
CBMS Reg. Conf. Ser. in Math., No. 82. AMS, 
1993, xiv + 238 pp, $32 (P). [ISBN 0-8218- 
0738-2] Recent results on algebraic structure 
of Hopf algebras, their actions and coactions. 
Unifies theories of group, Lie algebra, and 
graded algebra actions. Provides an accessible 
introduction to quantum groups. TH 


Calculus, T(13). Calculus with Analytic Ge- 
ometry. Joe Repka. Wm C Brown, 1994, xxii + 
1321 pp, $75.40. [ISBN 0-697-06918-4] Not 
a lean book. Similar in coverage and emphasis 
to standard texts of the last 15 years. Readers 
may find the pages busy rather than lively: 4- 
color displays showing calculator keystrokes; 
historical asides; lots of colored graphics; 3- 
level problem sets with starred problems; cal- 
culator problems; and sometimes a “Point to 
Ponder.” AWR 


Calculus, P. Differentialrechnung fiir Funktio- 
nen mit mehreren Variablen. Klaus Harbarth, 
Thomas Riedrich, Winfried Schirotzek. BG 
Teubner Leipzig, 1993, 198 pp, DM 22,80 (P). 
[ISBN 3-8154-2041-5] 


Calculus, P. Integralrechnung fiir Funktionen 
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mit mehreren Variablen. Karl-Heinz Korber, 
Ernst-Adam Pforr. BG Teubner Leipzig, 1993, 
199 pp, DM 22,80 (P). [ISBN 3-8154-2042-3] 


Calculus, T(13: 3, 4). Applied Calculus, Sec- 
ond Edition. Stanley I. Grossman. Wm C 
Brown, 1993, xix + 856 pp, $61.26. [ISBN 0- 
697-11350-7] Introduction to calculus stress- 
ing applications to business, economics, social 
sciences, biology. Many solved examples, re- 
alistic applications, historical notes. KB 


Calculus, C. Insight: Demonstration Software 
for Calculus and Analytic Geometry, Eighth 
Edition, by Thomas and Finney.  IntelliPro. 
Addison-Wesley, 1992, ii + 6 pp, (P) with 
PC disk. [ISBN 0-201-90897-2] Graphics 
mostly well done. Good animation, but resolu- 
tion is limited; some ¢€-6 pictures are hard to de- 
cipher. Many examples prompt user for param- 
eter values, allowing experiments. Best used in 
careful coordination with classroom work. SM 


Calculus, P. Differential- und Integralrech- 
nung fiir Funktionen mit einer Variablen. Ernst- 
Adam Pforr, Winfried Schirotzek. BG Teubner 
Leipzig, 1993, 302 pp, DM 28,80 (P). [ISBN 
3-8154-2040-7] 

Real Analysis, T(18). Measure Theory. J.L. 
Doob. Grad. Texts in Math., V. 143. Springer- 
Verlag, 1994, xii + 210 pp, $49. [ISBN 0- 
387-94055-3] A concise exposition. Mea- 
sure spaces and measurable functions, inte- 
gration and Hilbert space, measure sequences 
and signed measures, functions of bounded 
variation and Martingale theory. Probabil- 
ity notions are kept consistently before reader. 
No exercises (except to supply some missing 
proofs). AWR 


Differential Equations, P. Asymptotic Anal- 
ysis: Linear Ordinary Differential Equations. 
Mikhail V. Fedoryuk. Transl: Andrew Rod- 
ick. Springer-Verlag, 1993, viii + 363 pp, $129. 
[ISBN 0-387-54810-6] Translation of 1983 
Russian original. A comprehensive reference 
for asymptotic methods. SK 


Differential Equations, T(18), P. Introduction 
to Functional Differential Equations. Jack K. 
Hale, Sjoerd M. Verduyn Lunel. Appl. Math. 
Sci., V. 99. Springer-Verlag, 1993, x + 447 pp, 
$49. [ISBN 0-387-94076-6] Lively, thought- 
ful introduction. Motivates theory with applica- 
tions (historical and recent), and the promise of 
a rich mathematical structure. Emphasizes dy- 
namics, equations that depend on “past history.” 
Well-written; extensive bibliography. MLR 

Dynamical Systems, T(15—18: 1), S, L. Frac- 
tals Everywhere, Second Edition. Michael F. 
Barnsley. Academic Pr, 1993, xiv + 531 pp, 
$49.95. [ISBN 0-12-079061-0] New features 
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include extensive exercise solutions, some new 
problems. A heady mix of analysis, geometry, 
dynamics, and computing, forcefully exposited 
and vividly illustrated. (First Edition, TR, April 
1989; Extended Review, March 1990.) PZ 


Numerical Analysis, T(16~17: 1), L. Itera- 
tive Solution of Large Sparse Systems of Equa- 
tions. Wolfgang Hackbusch. Appl. Math. Scz., 
V. 95. Springer-Verlag, 1994, xxi + 429 pp, 
$59. [ISBN 0-387-94064-2] Slightly revised 
translation from German. Treats Gauss-Seidel, 
conjugate gradient, multigrid, and domain de- 
composition methods. Examples from numeri- 
cal PDE’s. SM 


Numerical Analysis, P. Moving-grid Methods 
for Time-dependent Partial Differential Equa- 
tions. P.A. Zegeling. CWI Tract, V. 94. Centrum 
voor Wiskunde en Informatica, 1993, 168 pp, 
Dfl. 50 (P). [ISBN 90-6196-424-5] 


Numerical Analysis, P, L. Templates for the 
Solution of Linear Systems: Building Blocks 
for Iterative Methods. Richard Barrett, et 
al. SIAM, 1994, xiii + 112 pp, $18 (P). 
[ISBN 0-89871-328-5] Provides templates 
(pseudocode descriptions of algorithms) to help 
with choice and implementation of methods ef- 
fective on high-performance computers. AO 


Numerical Analysis, P. Numerical Methods 
for the Three-dimensional Shallow Water Equa- 
tions on Supercomputers. E.D. de Goede. CWI 
Tract, V. 88. Centrum voor Wiskunde en In- 
formatica, 1993, 124 pp, Dfl. 40 (P). [ISBN 
90-6196-417-2] 


Numerical Analysis, P. Numerical Linear Al- 
gebra. Eds: Lothar Reichel, Arden Ruttan, 
Richard S. Varga. Walter de Gruyter, 1993, 
ix + 199 pp, DM 168. [ISBN 3-11-013784-4] 
Proceedings of a 1992 conference at Kent State 
University. 


Operator Theory, T?(16—17: 1, 2), S, P. The- 
ory of Linear Operators in Hilbert Space: Two 
Volumes Bound As One. N.I. Akhiezer, I.M. 
Glazman. Transl: Merlynd Nestell. Dover, 
1993, xi + 218 pp, $9.95 (P). [ISBN 0-486- 
67748-6| Unaltered republication of 1961 and 
1963 editions. Bounded and unbounded opera- 
tors; spectral analysis of completely continuous, 
unitary, and self-adjoint operators; extensions 
of symmetric operators. Appendices on differ- 
ential operators and Naimark-Krein theory of 
generalized extensions of symmetric operators. 
No exercises. HD 


Functional Analysis, P. Evolutionary Inte- 
gral Equations and Applications. Jan Priiss. 
Mono. in Math., V. 87. Birkhauser, 1993, xxvi 
+ 366 pp, $139. [ISBN 0-8176-2876-2] 
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Functional Analysis, P. A Topological In- 
troduction to Nonlinear Analysis. Robert F. 
Brown. Birkhauser, 1993, viii + 146 pp, 
$24.50 (P). [ISBN 0-8176-3706-0] Topolog- 
ical approach to Krasnoselski-Rabinowitz bi- 
furcation theorem and its application to Euler 
buckling of columns. SK 


Functional Analysis, P. Problémes d’Analyse 
Fonctionnelle et d’Analyse Harmonique. M. 
Samuelides, L. Touzillier. Cépadués-Editions, 
1993, vi + 391 pp. [ISBN 2-85428-257-4] 


Analysis, P. Recent Advances in Wavelet Anal- 
ysis. Eds: Larry L. Schumaker, Glenn Webb. 
Wavelet Analysis & Its Applic., V.3. Academic 
Pr, 1994, xi + 364 pp. [ISBN 0-12-632370-4] 
Ten papers on theoretical and practical applica- 
tions of wavelet analysis. 


Analysis, T**(15-17: 2), S**, L**.  Ad- 
vanced Calculus: A Differential Forms Ap- 
proach. Harold M. Edwards. Birkhauser, 1994, 
xv + 508 pp, $49.50. [ISBN 0-8176-3707-9] 
An inviting, unusual, high-level introduction 
to vector calculus, based solidly on differen- 
tial forms. Superb exposition: informal but 
sophisticated, down-to-earth but general, ge- 
ometrically and physically intuitive but math- 
ematically rigorous, entertaining but serious. 
Remarkably diverse applications, physical and 
mathematical. Still fresh and novel at age 25. 
(1969 original edition, TR, October 1969; 1980 
edition, TR, March 1981; Extended Review, 
March and December 1982.) PZ 


Analysis, S(15-18), P, L. Exercises for Fourier 
Analysis. T.W. Korner. Cambridge Univ Pr, 
1993, x + 385 pp, $54.95; $22.95 (P). [ISBN 
0-521-43276-6; O-521-43849-7] Chapter by 
chapter exercises for the author’s splendid 
Fourier Analysis (TR, May 1990). KS 


Analysis, T(17), S, P, L. Wavelets: An Ele- 
mentary Treatment of Theory and Applications. 
Ed: Tom H. Koornwinder. Ser. in Approx. & 
Decompositions, V. 1. World Scientific, 1993, 
xii + 225 pp, $48. [ISBN 981-02-1388-3] 12 
papers from a 4-day course at CWI, Amster- 
dam. First 6 form nice introduction to (affine) 
wavelet theory; others treat applications, spe- 
cial aspects. KS 


Algebraic Geometry, P. Computational Alge- 
braic Geometry. Eds: Frédéric Eyssette, André 
Galligo. Progress in Math., V. 109. Birkhauser, 
1993, ix + 328 pp, $64.50. [ISBN 0-8176-3678- 
1] 21 papers from the MEGA-92 symposium 
held in Nice, France. 


Algebraic Geometry, P.  Cohomological 
Methods in Transformation Groups. C. All- 
day, V. Puppe. Stud. in Adv. Math., V. 32. 
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Cambridge Univ Pr, 1993, xi + 470 pp, $69.95. 
[ISBN 0-521-35022-0] 


Algebraic Geometry, P. Queen’s Lectures on 
Arithmetical Composition of Quadratic Forms. 
Anatolii N. Andrianov. Papers in Pure & Appl. 
Math., V. 92. Queen Univ, 1992, 11 + 62 pp, (P). 
[ISBN 0-88911-634-2] 


Geometry, T(15-16: 1, 2), L. Modern Ge- 
ometries, Fourth Edition. James R. Smart. 
Brooks/Cole, 1993, xiii + 410 pp, $51.50. 
[ISBN 0-534-21198-4] New edition has more 
examples and figures, short section on frac- 
tals, stronger bibliography, and more historical 
notes. (First Edition, TR, November 1973; Ex- 
tended Review, August-September 1974.) DP 


Algebraic Topology, P. Morse Homology. 
Matthias Schwarz. Progress in Math., V. 111. 
Birkhauser, 1993, ix + 235 pp, $49.50. [ISBN 
0-8176-2904-1] 

Algebraic Topology, P. Nielsen Theory and 
Dynamical Systems. Ed: Christopher K. Mc- 
Cord. Contemp. Math., V. 152. AMS, 1993, 
xii + 350 pp, $52 (P). [ISBN 0-8218-5181- 
0] Proceedings of the 1992 AMS-IMS-SIAM 
Joint Summer Research Conference held at 
Mt. Holyoke College. 

Topology, T(15-16: 1). Topology of Sur- 
faces. L. Christine Kinsey. Undergrad. Texts 
in Math. Springer-Verlag, 1993, vill + 262 pp, 
$39. [ISBN 0-387-94102-9] Choice of topics 
is “fairly random:” about equal parts point-set, 
combinatorial, and algebraic topology. SK 


Topology, P*, L. Knot Theory. Charles Liv- 
ingston. Carus Math. Mono., V. 24. MAA, 
1993, xviii + 240 pp, $31.50. [ISBN 0-88385- 
027-3] A lively exposition; requires only ba- 
sic linear algebra. Covers (1) history and foun- 
dations of knot theory; (2) advanced topics: 
symmetry, Alexander polynomials, numerical 
invariants; (3) recent advances: the Conway, 
Jones, and Kauffman polynomials. TH 


Operations Research, S(15-17), C, L. Ex- 
ploring Interior-Point Linear Programming: 
Algorithms and Software. Ami Arbel. Found. 
of Comp. Ser. MIT Pr, 1993, xxiv + 211 pp, 
$35 (P), with disk. [ISBN 0-262-51073- 
1] Treats 3 interior—point linear programming 
methods: primal affine scaling, dual affine scal- 
ing, and primal—dual methods. With excellent 
demonstration software (DOS) for problems 
with up to 100 constraints. No exercises. SM 

Optimization, P, L. Optimization Software 
Guide. Jorge J. Moré, Stephen J. Wright. Fron- 
tiers in Appl. Math., V. 14. SIAM, 1993, 
xii + 154 pp, $24.50 (P). [ISBN 0-89871- 
322-6] Notes for 1992 SIAM annual meet- 
ing short course. First part overviews al- 
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gorithms for different classes of optimization 
problems. Second part contains brief product 
descriptions from software vendors, individual 
researchers. AO 


Optimization, T(18: 2), L. Optima and Equi- 
libria: An Introduction to Nonlinear Analy- 
sis. Jean-Pierre Aubin. Grad. Texts in Math., 
V. 140. Springer-Verlag, 1993, xvi + 417 pp, 
$59. [ISBN 0-387-52121-6] Convex op- 
timization, 2- and n-person noncooperative 
games, cooperative games, nonlinear equations, 
variational inequalities, economic equilibria. 
Many exercises. Fully rigorous. SM 


Mathematical Modeling, S(15—16). Heureka 
heute: Kostproben praxiswirksamer Mathe- 
matik. Giinter Dewess, et al. BG Teubner 
Leipzig, 1993, 150 pp, DM 16,80 (P). [ISBN 
3-8154-2071-7] Intended for a lay audience, 
but assumes a fair amount of mathematics. Each 
chapter treats a problem from real life (e.g., pro- 
ducing a chemical, routing coal trains, boring 
non-round holes). JD-B 


Mathematical Modeling, T(13: 2), L. Game 
Theory and Strategy. Philip D. Straffin. New 
Math. Lib., V. 36. MAA, 1993, x + 244 pp, 
$27.50 (P). [ISBN 0-88385-637-9] In-depth 
treatment of game theory. Assumes only high 
school algebra, but builds mathematical sophis- 
tication and familiarity with mathematical mod- 
eling process. Chapters on theory alternate with 
chapters on applications to anthropology, social 
psychology, economics, politics, business, bi- 
ology, philosophy. Many exercises. KB 


Control Theory, T?(17: 1), P. Direct Adaptive 
Control Algorithms: Theory and Applications. 
Howard Kaufman, Izhak Bar-Kana, Kenneth 
Sobel. Communications & Control Eng. Ser. 
Springer-Verlag, 1994, xxiii + 370 pp, $69. 
[ISBN 0-387-94155-X] Theory and practice 
of adaptive control algorithms for multiple in- 
put/output control systems with parameter un- 
certainty. Applications include flexible struc- 
ture control, drug infusion, robotics. RM 


Control Theory, P. Representation and Con- 
trol of Infinite Dimensional Systems, Volume II. 
Alain Bensoussan, et al. Systems & Control: 
Found. & Applic. Birkhauser, 1993, xviii + 
343 pp, $79.50. [ISBN 0-8176-3642-0] The- 
ory of quadratic cost optimal control for a large 
class of infinite-dimensional systems. JO 
Control Theory, P. Lecture Notes in Control 
and Information Sciences-189: Non-Identifier- 
Based High-Gain Adaptive Control. Achim 
Iichmann. Springer-Verlag, 1993, x + 204 pp, 
$54 (P). [ISBN 0-387-19845-8] 

Probability, P. Zen Lectures on the Proba- 
bilistic Method, Second Edition. Joel Spencer. 
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CBMS-NSF Reg. Conf. Ser. in Appl. Math., 
V. 64. SIAM, 1994, vi + 88 pp, $18.50 (P). 
[ISBN 0-89871-325-0] Based on_ lectures 
given at 1986 CBMS-NSF conference held at 
Fort Lewis College. This edition contains new 
results, additional material. (First Edition, TR, 
December 1988.) LC 


Mathematical Statistics, P. Efficient and 
Adaptive Estimation for Semiparametric Mod- 
els. Peter J. Bickel, et al. Ser. in Math. Sci. 
Johns Hopkins Univ Pr, 1993, xix + 560 pp, $95. 
[ISBN 0-8018-4541-6] Asymptotic inference 
for finite-dimensional parametric models, with 
extensions to semiparametric models. Infor- 
mation bounds for infinite-dimensional param- 
eters, construction of estimates. Limited results 
on general methods for constructing asymptot- 
ically efficient estimates. RWJ 


Statistical Methods, P. Statistical Uncer- 
tainties in Posterior Probabilities. A.W. Am- 
bergen. CWI Tract, V. 93. Centrum voor 
Wiskunde en Informatica, 1993, 129 pp, Dfl. 
40 (P). [ISBN 90-6196-422-9] 

Programming, P. Mastering C Pointers: 
Tools for Programming Power, Second Edition. 
Robert J. Traister. Academic Pr, 1993, xii + 
163 pp, (P), with disk. [ISBN 0-12-697409-8] 


Languages, S(17), P, L. The High Perfor- 
mance Fortran Handbook. Charles H. Koelbel, 
etal. Scientific & Eng. Comput. MIT Pr, 1994, 
xiv + 329 pp, $24.95 (P); $45. [ISBN 0-262- 
61094-9; 0-262-11185-3] High Performance 
Fortran, an extension of Fortran 90, is a portable 
programming language that allows easy access 
to vector and massively parallel multiproces- 
sors. Book presents HPF in tutorial form. SM 


Computer Systems, P. Learning Perl. Randal 


L. Schwartz. O’Reilly & Assoc, 1993, xxv + 
246 pp, $24.95 (P). [ISBN 1-56592-042-2] 


Computer Systems, S**(15), C, L. Solving . 


Problems in Scientific Computing Using Maple 
and MATLAB. Walter Gander, Jifi Hrebiéek. 
Springer-Verlag, 1993, xiii + 268 pp, $39 (P). 
[ISBN 0-387-57329-1] 19 nifty examples of 
high-level classroom applications of Maple and 
MATLAB: trajectories of tennis balls, orbits in 
the three-body problem, smoothing filters, con- 
formal mappings, heat flow, and the compres- 
sion of a metal disk. An excellent reference on 
undergraduate mathematical computing. MPR 


Computer Science, P. Statistical Analysis of 
Software Reliability Models. M.C.J. van Pul. 
CWI Tract, V. 95. Centrum voor Wiskunde en 
Informatica, 1993, 186 pp, Dfl. 50 (P). [ISBN 
90-6196-425-3] 

Applications (Biological Science), P. Moment 
Problems in Hilbert Space with Applications 
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to Magnetic Resonance Imaging. M. Zwaan. 
CWI Tracts, V. 89. Centrum voor Wiskunde en 
Informatica, 1993, xv + 136 pp, Dfl. 40 (P). 
[ISBN 90-6196-418-0] 

Applications (Physics), T(15—16: 1, 2), S, L. 
Spacetime Without Reference Frames. Tamas 
Matolcsi. Akadémiai Kiad6, 1993, 411 pp, 
$50. [ISBN 963-05-6433-5] This ambitious 
book proceeds quickly from easy introduc- 
tory generalities to sophisticated mathematical 
physics. Underlying mathematical concepts 
(tensor analysis, manifolds, and Lie groups) oc- 
cupy last 100 pages. MU 


Applications (Physics), 8S, P. Quantum Inverse 
Scattering Method and Correlation Functions. 
V.E. Korepin, N.M. Bogoliubov, A.G. Izergin. 
Mono. on Math. Physics. Cambridge Univ Pr, 
1993, xix + 555 pp, $100. [ISBN 0-521-37320- 
4| An important, carefully-crafted text, in 4 
parts: examination of the Bethe ansatz and cal- 
culation of physical quantities; theory of the 
quantum inverse scattering; third and fourth 
sections apply preceding work to calculation 
of correlation functions. MU 

Applications (Physics), S(15-17), P, L. The 
Philosophy Behind Physics. Thomas Brody. 


Eds: Luis de la Pefia, Peter E. Hodg- 
son. Springer-Verlag, 1993, xii + 355 pp, 
$59. [ISBN 0-387-55914-0] This master- 


piece should be much appreciated by physicists, 
but of less value to philosophers. ‘Author’s tech- 
nical expertise is obvious throughout; thought- 
provoking text is virtually devoid of esoteric 
philosophical vocabulary, but rich in precise ex- 
amples, lucid discussion, and analysis. MU 


Applications (Simulation), P. Algebraic Spec- 
ification of Communication Protocols. Eds: S. 
Mauw, G.J. Veltink. Tracts in Theoret. Comp. 
Sci., V. 36. Cambridge Univ Pr, 1993, xi + 
197 pp, $39.95. [ISBN 0-521-41883-6] Sur- 
veys Process Specification Formalism (PSF), a 
formal method for specifying and testing un- 
ambiguous mathematical models. Chapters 
range from tutorial on PSF through research 
topics. RM 


Reviewers 


KB: Karla Ballman, Macalester; LC: Laura Chihara, 
St. Olaf; HD: Hung Dinh, Macalester; JD-B: John Dyer- 
Bennet, Carleton; SG: Steven Galovich, Carleton; TH: 
Tom Halverson, Macalester; RWJ: Roger W. Johnson, Car- 
leton; SK: Steve Kennedy, St. Olaf; LCL: Loren C. Larson, 
St. Olaf; SM: Steve McKelvey, St. Olaf; RM: Richard Mol- 
nar, Macalester; JO: Jeff Ondich, Carleton; AO: Arnold 
Ostebee, St. Olaf; DP: David Peifer, St. Olaf; MLR: Mar- 
garet L. Reese, St. Olaf; MPR: Matthew P. Richey, St. Olaf; 
AWR: A. Wayne Roberts, Macalester; KS: Karen Saxe, 
Macalester; JS: John Schue, Macalester; MU: Milton UI- 
mer, Carleton; MW: Martha Wallace, St. Olaf; PZ: Paul 
Zorn, St. Olaf. 
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No matter how you 
express it, it still means 
DERIVE. |Is half price. 


CYL WwVLZ 


The DERIVE A Mathema- 
tical Assistant program lets 
you express yourself symbol- 
ically, numerically and 
graphically, from algebra 
through calculus, with vec- 
tors and matrices too—all 
displayed with accepted 
math notation, or 2D and 3D 
plotting. DERIVE is also easy 
to use and easy to read, 
thanks to a friendly, menu- 
driven interface and split or 


DERIVE is a 
registered trademark of 
Soft Warehouse, Inc. 


overlay windows that can 
display both algebra and 
plotting simultaneously. 
Better still, DERIVE has been 
praised for the accuracy and 
exactness of its solutions. 
But, best of all the suggested 
retail price is now only $125. 
Which means DERIVE is now 
half price, no matter how you 
express it. 


Soft Warchouse: 


HONOLULU*HAWATI 


System requirements 


DERIVE: MS-DOS 2.1 or 
later, 512K RAM, and one 33" 
disk drive. Suggested retail 
price now $125 (Half off!). 


DERIVE ROM card: 

Hewlett Packard 95LX & 
100LX Palmtop, or other PC 
compatible ROM card 
computer. Suggested retail 
price now $125! 


DERIVE XM 

(eXtended Memory): 

386 or 486 PC compatible 
with at least 2MB of extena- 
ed memory. Suggested list 
price now $250! 


Soft Warehouse, Inc.* 3660 Waialae Ave. 
Ste. 304 « Honolulu, HI, USA 96816-3236 
Ph: (808) 734-5801 + Fax: (808) 735-1105 


Research Issues in 
Undergraduate 


Mathematics Learning 
Preliminary Analyses and Reports 


James J. Kaput and 
Ed Dubinsky, Editors 


Research in undergraduate mathematics edu- 
cation is important for all college and university 
mathematicians. If our students are to be more 
successful in understanding mathematics, then 
college faculty need to understand how math- 
ematics is learned. This knowledge can guide us 
in curriculum reform and in improving our own 
teaching. It can help us make mathematics 
accessible to all students and it can increase the 
number of graduate students in mathematics. 

This volume of research in undergraduate 
mathematics education informs us about the 
nature of student learning in some of the most 
important topics in the undergraduate curricu- 
lum: sets, functions, calculus, statistics, abstract 
algebra and problem solving. Paying careful 
attention to the trouble students have in learn- 
ing mathematics will help us to work with stu- 
dents so they can deal with those difficulties. 

A survey of the literature begins the volume. 
Becker and Pence have brought together an 
unusually complete list of references on research 
in collegiate mathematics. Their comments will 
guide those attempting to begin or to continue 
a program of research in student learning. 

The sad fact that even good calculus students 
stumble over nonroutine problems is the theme 
of Selden, Selden, and Mason. Their conclu- 
sions point to significant shortcomings in the 
curriculum. This study of student difficulties is 
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continued by Ferrini-Mundy and Graham who 
investigate a single student’s interactions with 
the fundamental concepts of the calculus. 
Baxter studies a group of students to learn how 
they acquire the concept of set, while Cuoco 
does the same for the concept of function. 

Cooperative learning does help the student. 
That is the conclusion of Bonsangue, who inves- 
tigates how two carefully matched classes of 
students in a Statistics course perform on exams. 
How students learn to write proofs in group 
theory is the subject considered by Hart. 
Rosamond breaks new ground by comparing 
how emotions vary in their effect on the prob- 
lem solving ability of novices and experts. 

All college faculty should read this book to 
find how they can help their students learn 
mathematics. 


150 pp., Paperbound, 1994° 
ISBN 0-88385-090-7 

List: $24.00 

Catalog Number NTE-33 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


Total $ 
Payment ) Check UO VISA L} MASTERCARD 


Credit Card No. 
Signature 


Exp. Date 


Essential Mathematics from Cambridge 


Probability Theory, 
an Analytic View 
Daniel W. Stroock 


Presents useful material for students who, 
even if they do not intend to devote a 
major portion of their careers to the study 
of probability theory, want to know what 
they are missing if they do not. 

1994 527 pp. 43123-9 Hardback $49.95 


How To Prove It 

A Structured Approach 

Daniel J. Velleman 

Prepares students to make the transition 
from solving problems to proving theorems 
by teaching the techniques needed to read 
and write proofs. 


1994 304pp. 44116-1 Hardback $49.95 
44663-5 Paperback $19.95 


Elementary Probability 
David Stirzaker 

Illustrates the wide range and power of the 
subject by including conditional probability, 
independence, random variables, generating 
functions, and an introduction to Markov 
chains. 


1994 416pp. 42028-8 Hardback $64.95 
42183-7 Paperback $24.95 


Representations and 


Characters of Groups 
Gordon James 
and Martin Liebeck 


Provides a modern introduction to the 

representation theory of finite groups. 

1993 429pp. 440246 Hardback $69.95 
44590-6 Paperback $29.95 


Basic Abstract Algebra 
Second Edition 
P. B. Bhattacharya, S. K. Jain, 
and S. R. Nagpaul 
“...@ thorough and surprisingly clean- 
cut survey of the group/ring/field troika 
which manages to convey the idea of 
algebra as a unified enterprise.” 

—lIan Stewart, New Scientist 


1994 416pp. 46081-6 Hardback $69.95 
46629-6 Paperback $29.95 


Categories for Types 

Roy L. Crole 

Explains the basic principles of categorical 

type theory and the techniques used to 

derive categorical semantics for specific 

type theories. 

1994 352pp. 45092-6 Hardback $59.95 
45701-7 Paperback $27.95 


Computational 
Geometry in C 
Joseph O'Rourke 


Covers the basic techniques used in compu- 

tational geometry — polygon triangulations, 

convex hulls, Voronoi diagrams, and 

arrangements. 

1994 327pp. 44034-3 Hardback $59.95 
44592-2 Paperback $24.95 


Now in paperback... 


Numbers and Functions 
Steps to Analysis 
R. P Burn 


“.. invaluable as a teaching resource, 

or for independent study with gifted 

students.” —The American 
Mathematical Montbly 

346 pp. 45773-4 Paperback $24.95 


The Banach-Tarski Paradox 
Stan Wagon 


“,.. packed with fascinating and beautt- 
fulresults.”” —R. J. Gardner, Bulletin of 
the London Mathematical Society 


“... this beautiful book is written with 
care and is certainly worth reading.” 
—Wlodzimierz Bzyl, 
Mathematical Reviews 


271 pp. 45704-1 Paperback $24.95 


Available in bookstores or from 


CAMBRIDGE 


UNIVERSITY PRESS 


40 West 20th St., N.Y., NY 10011-4211 
Call toll-free 800-872-7423 
MasterCard/VISA accepted. 

Prices subject to change. 


NEW IN THE SPECTRUM SERIES 


Complex Numbers and Geometry 


Liang-shin Hahn 


The purpose of this book is to demonstrate that 
complex numbers and geometry can be blended 
together beautifully, resulting in easy proofs and 
natural generalizations of many theorems in plane 
geometry—such as the Napoleon theorem, the 
Ptolemy—Euler theorem, the Simson theorem, 
and the Morley theorem. 


Beginning with a construction of complex num- 
bers, readers are taken on a 140-page guided tour 
that includes something for everyone, even those 
with advanced degrees in mathematics. Yet, the 
entire book is accessible to a talented high-school 
student. 


The book is self-contained—no background in 
complex numbers is assumed—and can be cov- 
ered at a leisurely pace in a one-semester Course. 
Many of the chapters can be read independently. 
Over 100 exercises are included. The book would 
be suitable as a text for a geometry course, or for 
a problem solving seminar, or as enrichment for 
the student who wants to know more. 


200 pp., Paperbound, 1994 
ISBN 0-88385-510-0 

List: $25.50 MAA Member: $19.50 
Catalog Number CNGE 
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(pseutx ATREERE & (jroxezae 


tiana-shin Hohr 


Table of Contents I 
1. Complex Numbers 


Introduction to Imaginary Numbers; Definition 
of Complex Numbers; Quadratic Equations; Sig- 
nificance of the Complex Numbers; Order Rela- 
tion in the Complex Field; The Triangle Inequal- 
ity; The Complex Plane; Polar Representation of 
Complex Numbers; The nth Roots of 1; The 
Exponential Functions; Exercises 


2. Applications to Geometry 


Triangles; The Ptolemy—Euler Theorem; The 
Clifford Theorems; The Nine-Point Circle; The 
Simson Line; Generalizations of the Simson 
Theorem; The Cantor Theorems; The Feuerbach 
Theorem; The Morley Theorem; Exercises 


3. Mobius Transformations 


Stereographic Projection; M6ébius Transforma- 
tions; Cross Ratios; The Symmetry Principle; A 
Pair of Circles; Pencils of Circles; Fixed Points 
and the Classification of M6Obius Transforma- 
tions; Inversions; The Poincaré Model of a Non- 
Euclidean Geometry; Exercises 
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Knot Theory 


Charles Livingston 


I learned more at-vut knots after an hour with the book 
than I thought I could, and [am glad that it ts here on 
my desk so that I may spend more time with it and, I 
hope, learn more. —Paul Halmos 


Knot Theory, a lively exposition of the math- 
ematics of knotting, will appeal to a diverse 
audience from the undergraduate seeking ex- 
perience outside the traditional range of studies 
to mathematicians wanting a leisurely introduc- 
tion to the subject. Graduate students begin- 
ning a program of advanced study will find a 
worthwhile overview, and the reader will need 
no training beyond linear algebra to under- 
stand the mathematics presented. 

Over the last century, knot theory has pro- 
gressed from a study based largely on intuition 
and conjecture into one of the most active areas 
of mathematical investigation. Knot Theory 
illustrates the foundations of knotting as well as 
the remarkable breadth of techniques it em- 
ploys—combinatorial, algebraic, and geomet- 
ric. 

The interplay between topology and algebra, 
known as algebraic topology, arises early in the 
book, when tools from linear algebra and from 
basic group theory are introduced to study the 
properties of knots, including the unknotting 
number, the braid index, and the bridge num- 
ber. Livingston guides you through a general 
survey of the topic showing how to use the 
techniques of linear algebra to address some 
sophisticated problems, including one of math- 
ematics’ most beautiful topics, symmetry. The 
book closes with a discussion of high-dimen- 
sional knot theory and a presentation of some 
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of the recent advances in the subject—the 
Conway, Jones and Kauffman polynomials. A 
supplementary section presents the fundamen- 
tal group, which is a centerpiece of algebraic 
topology. 

An extensive collection of exercises is included. 
Some problems focus on details of the subject 
matter; others introduce new examples and 
topics illustrating both the wide range of knot 
theory and the present borders of our under- 
standing of knotting. All are designed to offer 
the reader the experience and pleasureof work- 
ing in this fascinating area. 


264 pp., Hardbound, 1993 
ISBN 0-88385-027-3 


List: $31.50 MAA Member: $25.00 
Catalog Number CAM-24 


ORDER FROM: 


The Mathematical Association of America 
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In Eves’ Circles 


Joby Milo Anthony, Editor 


Howard Eves celebrated his eightieth birth- 
day in 1991. To honor that occasion, the 
University of Central Florida sponsored acon- 
ference that focused on the lifelong interests 
of this prominent American mathematician 
namely, the history of mathematics, the teach- 
ing of mathematics, and geometry. Eves is 
well-known for his contributions to all three 
areas. 


The conference was unique. Conference 
participants included pre-college mathemat- 
ics teachers, community college and univer- 
sity teachers, and research mathematicians. 
Papers were delivered in sessions devoted to 
the classroom teacher, to the history of math- 
ematics, and to pedagogical and research 
interests in geometry. Many lectures com- 
bined these subjects. This book presents 
some of those lectures. Anyone involved with 
teaching or producing mathematics can find 
something in this volume that will be interest- 
ing to them. 


Some of these papers are specifically for the 
classroom teacher. They discuss a use of 
technology, or the organization of a class for 
some specific purpose. Other articles will 
provide teachers with examples of 
mathematical problems or historical episodes 
that can be used as part of a mathematics 
class. Still other papers deal with the 
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philosophy of mathematics education. There 
are articles in this volume that present new 
insights into some of the history of 
mathematics, and there are other articles that 
deal with some new results in geometry. This 
is really the legacy of Howard Eves. He has 
been both a mentor for teachers at every 
level, and a colleague of research 
mathematicians. And so some part of this 
volume will be appropriate for anyone 
interested in mathematics. 


Also included in this volume is a penetrating 
interview with Eves. 


220 pp., 1994, Paperbound 
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The Search for E.T. Bell 


also known as John Taine 


Constance Reid 


No one today writes about mathematics and mathemati- 
clans with more grace, knowledge, skill, and clarity, and 
no one ts going to produce a more delightful, informative, 
accurate account of Knc Temple Bell and his work, and 
that of his alter-ego, the prolific pioneer of science fiction, 
John Taine. This is a fine book. © —Martin Gardner 


Enc Temple Bell has been one of my heroes for 60 years...I 
congratulate Constance Reid on a remarkable achieve- 
ment. I hope tt 1s greeted with the success it deserves, and 
revives interest in an extraordinary and multi-talented 
man. —A. C. Clarke 


Eric Temple Bell (1883-1960) was a distinguished 
mathematician and a best selling popularizer of 
mathematics. His Men of Mathematics, still in print 
after almost sixty years, inspired scores of young 
readers to become mathematicians. Under the 
name “John Taine,” he also published science 
fiction novels (among them The Time Stream, Before 
the Dawn, and The Crystal Horde) that served to 
broaden the subject matter of that genre during its 
early years. 

In The Search for E.T. Bell, Constance Reid has 
given us acompelling account of this complicated, 
difficult man who never divulged to anyone, not 
even to his wife and son, the story of his early life 
and family background. Her book is thus more of 
a mystery than a traditional biography. It begins 
with the discovery of an unexpected inscription in 
an English churchyard and a series of cryptic 
notations in a boy’s schoolbook. Then comes an 
inadvertent revelation, by Bell himself, in a re- 
spected mathematical journal...You will have to 
read the book to learn the rest. 
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Originally agreeing to write only a profile of 
Bell, Mrs. Reid soon found herself involved in a 
full-length biography. The discoveries she made in 
the course of her five years of research will neces- 
sitate a fresh evaluation of his extensive math- 
ematical work and his science fiction novels as well 
as the revision of almost every statement currently 
in print about his family background and early life. 
Mrs. Reid is already well known as the author of 
acclaimed biographies of David Hilbert, Richard 
Courant, and Jerzy Neyman. 


Includes a collection of over 75 photographs. 
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EXCURSIONS IN CALCULUS: 


an Interplay of the Continuous 


and the Discrete 


Robert M. Young 


An excellent source of projects for well motivated 
students. This list of 463 references is a valuable 
aid for those who wish to dig deeper. _ CHOICE 


The presentation is clear and the topics very 
interesting...fully accessible to students for whom 
the book is intended. The book will be influential in 
awakening students' awareness for good classi- 
cal mathematics. —Paulo Ribenboim 


Printed with eight full-color plates. 


The purpose of this book is to explore, within the 
context of elementary calculus, the rich and el- 
egant interplay that exists between the two main 
currents of mathematics, the continuous and the 
discrete. Such fundamental notions in discrete 
mathematics as induction, recursion, combinato- 
rics, number theory, discrete probability, and the 
algorithmic point of view as a unifying principle are 
continually explored as they interact with tradi- 
tional calculus. The interaction enriches both. 


The book is addressed primarily to well-trained 
calculus students and their teachers, but it can 
serve as a supplement in a traditional calculus 
course for anyone who wants to see more. 
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Dokcans Matheaxitical Expositions Number 1 


HE MATHEMATICAL ASSOCIATION OF AMERICA, 


CONTENTS: 


Infinite Ascent, Infinite Descent: The Principle 

of Mathematical Induction 

e Patterns, Polynomials, and Primes: Three 
Applications of the Binomial Theorem 

e Fibonacci Numbers: Function and Form 

¢ On the Average 

¢ Approximation: from Pi to the Prime Number 
Theorem 

e Infinite Sums: A Potpourri 


The problems, taken for the most part from 
probability, analysis and number theory, are an 
integral part of the text. Many point the reader 
toward further excursions. There are over 400 
problems presented in this book. 
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Visualization in Teaching and 
Learning Mathematics 


Walter Zimmermann and 
Steve Cunningham, Editors 


Buy this book. If you can't buy at, have the library order 
ut. Ifthe library won t order it, ask to borrow a copy from 
a friend. But do read this book. 

—The Mathematics Teacher 


High school, community college, and university teach- 
ers who use or are interested in using graphics to teach 
calculus, deductive reasoning, functions, geometry, or 
statistics will find valuable ideas for teaching... A 
must for every college or untversity library with a 
mathematics department.—_CHOICE 


The twenty papers in this book give an overview 
of research, analysis, practical experience, and 
informed opinion about the role of visualization 
in teaching and learning mathematics, espe- 
cially at the undergraduate level. Visualization 
in its broadest sense is as old as mathematics, but 


progress in computer graphics has generated a 
renaissance of interest in visual representations 
and visual thinking in mathematics. 
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Two New Derivee Books Now Available 


Numerical Analysis Via Derive 
Steven Schonefeld 


This book is written as a primary text for an 
introductory course in numerical analysis. For 
convenience, a disk is included with a number 
of the functions that are defined. The text is in- 
tended to permit the exploration of many differ- 
ent numerical algorithms with only a minimum 
of technical effort. In order to use this book, all 
you need is a computer and a copy of Derive 
(version 2.09, or higher). The computer disk that 
comes with this book contains all the Derive 
procedures used in the book. You do not need to 
be a computer programmer to work the exer- 
cises. Simply transfer the appropriate files into 
Derive and start using it. (S29 pages Paperbound) 
List $44.95 MAA member ($40.45) 


Differential Equations With Derive 
David C. Arney 


(for Derive Version 2.5 and above) 

This book is designed to show how to use 
Derive to help solve problems in Differential 
Equations and related subjects. It is a compan- 
ion to any of the textbooks used in differential 
equations courses or related subject courses 
(i.e., engineering mathematics, applied math- 
ematics, dynamical systems). Topics include: 
First Order Differential Equations, Numerical 
Methods and Difference Equations, Second and 
Higher Order Equations, Matrix Algebra and 
Systems of Equations, and Partial Differential 
Equations. ( 284 pages, Paperbound) 

List Price $22.95 MAA member ($20.65) 


Order From 
_ MathWare 604 E. Mumford Dr. Urbana, II. 61801 
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(800)255-2468/ (217)384-3196 or Fax (217)384-7043 


Game Theory and Strategy 


Philip D. Straffin, Jr. 


This valuable addition to the New Mathematical 
Library series pays careful attention to applica- 
tions of game theory in a wide variety of disci- 
plines. The applications are treated in consider- 
able depth. The book assumes only high school 
algebra, yet gently builds to mathematical think- 
ing of some sophistication. Game Theory and 
Strategy might serve as an introduction to both 
axiomatic mathematical thinking and the funda- 
mental process of mathematical modelling. It 
gives insight into both the nature of pure math- 
ematics, and the way in which mathematics can be 
applied to real problems. 

Since its creation by John von Neumann and 
Oskar Morgenstern in 1944, game theory has 
contributed new insights to business, politics, eco- 
nomics, social psychology, philosophy, and evolu- 
tionary biology. In this book, the fundamental 
ideas of game theory share the stage with applica- 
tions of the theory. How might strategic business 
decisions depend on information about a rival 
company, and how much would such information 
be worth? When is it advantageous to vote for a 
candidate who is not your favorite? What are the 
optimal strategies for teams in the football draft, 
and what paradoxes can result from following 
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those strategies? What is a fair way to share the 
costs ofa development project? What can we learn 
about the problem of “free will” by imagining 
playing a game with an omnipotent Being? How 
might natural selection lead to altruistic behavior 
in animal species? Game theory gives insight into 
all of these questions. 

The book includes many exercises, with answers, 
which allow the reader to try out calculations, and 
explore alternative formulations of game-theo- 
retic ideas. 
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A Century of Mathematics 
Through the Eyes of the Monthly 


John Ewing, Editor 


This is the story of American mathematics dur- 
ing the past century. It contains articles and 
excerpts from a century of the Monthly, giving 
the reader and opportunity to skim all one 
hundred volumes without actually opening 
them. It samples mathematics year by year and 
decade by decade. Along the way, readers can 
glimpse the mathematical community at the 
turn of the century, and the divisions between 
the mathematical communities of teachers and 
researchers. They read about the struggle to 
prevent colleges from eliminating mathematics 
requirements in the 1920's, the controversy 
about Einstein and relativity, the debates about 
formalism in logic, the immigration of math- 
ematicians from Europe, and the frantic effort 
to organize as the war began. At the end of the 
war, they hear about new divisions between pure 
and applied mathematics, heroic efforts to deal 
with large numbers of new students in the uni- 
versities, and the rise of federal funding for 
mathematicis. In more recent times, they see 
the advent of computers and computer science, 
the problems faced by women and minorities, 
and some of the triumphs of modern research. 


This is a book about mathematics—about teach- 
ing and research, applied and pure, elite univer- 
sitiesand communitycolleges. Browsing through 
its pages, readers see what has changed (the 
kinds of mathematics in fashion, for example) 
and what has stayed the same (our concern 
about teaching and our complaints about the 
deplorable state of our students). 
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This is a book about history—a sampling of 
history, meant to be savored rather than stud- 
ied. For one hundred years, the Monthly has 
contained articles by some of the greatest math- 
ematicians in the world, as well as articles by 
students and faculty from small midwestern col- 
leges where those great names were barely 
known. This book gives a glimpse of both 
worlds. It tells a story rather than the details of 
history. 


This is the story of a century of mathematics in 
America. 


335 pp., Hardbound, 1994 
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NEW IN THE SPECTRUM SERIES 


Cryptology 


Albrecht Beutelspacher 


FR BRX XQFHUVWCQG WKLV? If you 
can’t decipher this coded message, you must 
read this boon! 


How can messages be transmitted secretly? 
How can one guarantee that the message ar- 
rives Safely in the right hands exactly as it was 
transmitted? Cryptology—the art and science 
of “secret writing” —provides ideal methods to 
solve these problems of data security. 


Technology advances have stimulated interest 
in the study of cryptology. Of course, comput- 
ers can break cryptosystems much more effi- 
ciently than humans can. Computers allow 
complex and sophisticated mathematical tech- 
niques which achieve a degree of security un- 
dreamt of by previous generations. Today the 
applications of cryptology range from the en- 
cryption of television programs sent via satel- 
lite, to user authentication of computers, to new 
forms of electronic payment systems using 
smart cards. 


The first half of the book studies and analyzes 
classical cryptosystems. Here we find Caesar’s 
cipher, the Spartan scytale, the Vigenére ci- 
pher, and more. The theory of cipher systems 
is presented, including a description of the best 
possible cipher, the one-time pad. An intro- 
duction to linear shift registers, which serve as 
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building blocks for most presently used ci- 
phers, is also given. 


The second half of the book looks at the excit- 
ing new directions of public-key cryptology, 
which since its invention in 1976, has revolu- 
tionized data security. The author also looks at 
the famous RSA-algorithm, algorithms based 
on “discrete logarithms,” the so-called zero- 
knowledge algorithms, and the smart cards that 
bring cryptographic services to the man-on- 
the-street. 


Although the mathematics covered is nontrivial, 
the book is fun to read, and the author presents 
the material clearly and simply. Many exer- 
cises and references accompany each chapter. 
The book will appeal to a wide audience includ- 
ing teachers, students, and the interested lay- 
man. 


Cryptology was originally published in Ger- 
man by Vieweg. This edition has been exten- 
sively revised. 


176 pp., Paperbound, 1994 
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FROM MARTIN GARDNER 


Mathematical Magic Show 


...We visit most of the prime sites of recreational 
mathematics: game theory, factorials, puzzles, 
playing cards, finger arithmetic, Mobius bands, 
polyominoes, perfect numbers, the knight’s tour, 
trees, and dice. Gardner always has new facts 
and ideas to add interest to even the most well- 
trodden areas. —Times Literary Supplement 


312 pp., Paperbound, 1990 

ISBN 0-88385-449-X 

List: $19.50 MAA Member: $16.50 
Catalog Number MAGIC 


Mathematical Carnival 


His startling gift for bringing the sublime to the 
people is unabated. Once again, hard math- 
ematical ideas are conveyed with fluency, charm, 
and utter clarity. As a philosopher, | warmly salute 
his judgement of when to leave a metaphysical 
question enticingly open. His craftsmanship re- 
mains exquisite. — New Scientist 


320 pp., Paperbound, 1988 

ISBN 0-88385-448-1 

List: $18.00 MAA Member: $15.00 
Catalog Number MCR 
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Riddles of the Sphinx and Other 
Mathematical Puzzle Tales 


This book charms, informs, inspires, puzzles, 
and delights, and the reader can dip in almost 
anywhere and get hooked by the natural lucidity 
of style and the friendly tone which are so charac- 
teristic of Martin Gardner. 

— Mathematical Spectrum 


184 pp., Paperbound, 1987 

ISBN 0-88385-632-8 

List: $16.00 MAA Member: $13.00 
Catalog Number NML-32 


Mathematical Circus 


A circus suggests fun and enjoyment and there is 
plenty of both to be found here. The book should 
certainly be in the school library. It will also be 
valuable resource for the teacher. 

—The Mathematical Gazette 
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NOTICE TO AUTHORS 


The Monthly publishes articles, notes, and other fea- 
tures about mathematics and the profession. The 
readership of the Monthly is intended to include ev- 
erybody who is mathematically inclined, including of 
course professional mathematicians and students of 
mathematics at all collegiate levels. While no single 
article or feature is likely to appeal to everyone, mate- 
rial should interest and be accessible to a large num- 
ber of readers. This is the most important criterion for 
acceptance. 


Articles may be expositions of old results or presenta- 
tions of new ones. They may concern all of mathe- 
matics or one small area, a broad developinent or a 
single application, historical reminiscences or one 
important event. While some articles may contain the 
author's new research, the novelty of material and 
generality of the results is far less important than the 
clarity of exposition and general interest. Discussing 
one illuminating case of a well known result is far 
better than providing all the details of an obscure but 
new proposition. Articles in the Monthly are sup- 
posed to inform and to entertain; they are meant to 
be read rather than archived. 


Notes are short and possibly informal articles. A note 
may concern a clever new proof of an old theorem, a 
novel way to present tired material, or a lively discus- 
sion of a philosophical (but still mathematical) issue. 
Also, any topic is suitable, so long as it is related to 
mathematics. Because a note is short, the first few 
‘sentences are the most important part: They should 
explain the purpose and invite the reader in. Pho- 
tographs or diagrams often will attract the reader's 
attention. 


All articles and notes should be sent to the editor: 


JOHN EWING 

Department of Mathematics 
Indiana University 
Bloomington, IN 47405 


Please send 3 copies, typewritten on only one side of 
the paper. Illustrations should be carefully drawn on 
separate sheets of paper in black ink; the original 
should be without lettering and two copies should 
have appropriate captions and lettering indicated. 


Proposed problems or solutions should be sent to: 


RICHARD BUMBY, 
P.O. Box 10971 
New Brunswick, NJ 08906-0971. 


Please send 2 copies of all material, typewritten if 
possible. 


Letters to the Editor, both for publication and for 
private reading, should be sent to the Editor at the 
address given above. Comments, including criti- 
cisms, are welcome, as are all suggestions for mak- 
ing the Monthly a lively, entertaining, and informative 
journal. 
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Comment 


See Response on page 809 


The recent report Recognition and rewards in the mathematical sciences is a strange and 
pedantic document. It comes from a committee of the JPBM (The Joint Policy Board for 
Mathematics). It is gussied up in slick paper and shaded bar graphs. (Evidently, readers were 
not expected to read numbers or percents.) It ignores the long standing standards of the 
American mathematical community. These standards read simply: Do good mathematics. 
This has meant good teaching (tell the students when you use the Mean Value Theorem), 
good research (proofs that are complete), good guidance (help troubled students to 
understand math), and good judgment, both of colleagues and of research results. 

The report ignores these powerful traditions and instead follows insignificant ideas 
from Ernie Boyer. Boyer wants to "reconsider" scholarship, so this report has an elaborate 
and pedantic discussion of a definition (sic) of "mathematical scholarship". Each department 
is urged to provide its own definition, in keeping with its so-called "mission" (a presently 
popular buzz-word). 

The JPBM committee conducted a survey. The report does not tell us what 
questions were asked; since the exact form of the questions is vital, this omission is 
irresponsible. For example (p. 4) "with regard to research, we asked ... how it was evaluated" 
— not whether it was good mathematics or accurate evaluation. 

Questions were asked separately of chairs (sic) and faculty. One question may have 
been "How important is classroom teaching in determining merit salary raises" (p. 10), but 
figure 13 has "percent responding that salary reflects differences between excellent and 
average teaching". What question did they really ask? The committee professes surprise at 
"The consistent difference in the perception of the chairs and the faculty. Figure 13 shows 
that a much higher percentage of chairs than faculty believed differences in teaching 
effectiveness were truly reflected in salary. Why the surprise? Chair people often help set 
salary, and faculty can be jealous. And did the question really use the word "Truly"? 

The committee discussion of mathematical research (p. 28) is wholly inadequate; 
no Fermat, no quantum fields, no motives. One original member of the committee resigned. 

The committee claims (p. 7) a "consensus" that "faculty members who change 
emphasis during their career (e.g., from research to education)... should be allowed to do so 
without being penalized by the rewards system." This is hopelessly vague. What education; 
a poor text book? Was it good research? What type of university? Is there really a "rewards 
System"? 

The report favors, with little or no evidence, the institution of "periodic review of 
faculty members". Does the background of the report involve some "Hidden Agenda"? 
There is reference to a 1990 book by Boyer and to publications by Robert M. Diamond. He 
is Assistant Vice-Chancellor at Syracuse University, and he has organized in the Adirondacks 
an interdisciplinary meeting on "faculty rewards" (cf Chronicle of Higher Ed., May 11, 
1994). I recently heard Dr. Diamond talk at a symposium. He opposed tenure and "wished 
to define priorities" and considered what faculty members are doing in their field. Do we 
wish Chancellors, vice or not, to direct our research? 

This JPBM report is a striking example of the current practice of selected pundits 
telling the rest of us what we should do. It was prepared by a small and sometimes secretive 
group (as in an incomplete presentation at Cincinnati). It is an assault on the standards of our 
community. 

Saunders Mac Lane 
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Response 


See Comment on page 714 


The commentary of Saunders Mac Lane on page 714 raises a number of points 
related to the contents of the Recognition and Rewards report and, as we read it, to the 
appropriateness of engaging in any discussion about how our community has been able to 
encourage its members to engage in the broad spectrum of professional duties that are part 
of the responsibilities of a department or the university in which it is situated. 

Mac Lane has raised some questions as to the specifics of the survey and its 
interpretation. The full data and survey instruments were too extensive to include in the 
report but, as stated on page four of the report, the American Mathematical Society will 
provide them on request. Indeed, we would be pleased to have them examined and discussed 
by many members of the community. 

Why the report and why now? Colleges and universities today teach the daughters 
and sons of a mass society, a far different population than was the case when most of this 
structure was put in place. As educators we are being called upon to do a better job. These 
calls are coming from students, parents, legislators, from academic leaders such as Don 
Kennedy and from members (most of them academics) of a recent President's Council of 
Advisors on Science and Technology (cf. Renewing the Promise: Research Intensive 
Universities and the Nation). As researchers we are being called upon to address issues of 
national importance and perhaps, as Roland Schmitt and the members of PCAST cited above 
suggest, scientists can and should take that responsibility, and that it is possible to so whilst 
preserving our core values. It is against just such a background that the JPBM charged a 
committee to initiate a dialogue encompassing "differing views about the need for change and 
about how well the system works now". 

The report finds that the work of professors involves much more than research and 
that in all too many cases the selection, evaluation and promotion of faculty relies too much 
on the one dimensional criterion of research, and, in so doing, fails to recognize or encourage 
other contributions necessary in order to meet the mission of the department or the university 
in which it is situated. Mac Lane's apparent scorn for departmental "mission" misses the 
point. The Committee clearly recognizes the diversity of institutions and attendant faculty 
responsibilities. As is clearly stated, the report is mainly addressed to individual departments 
encouraging the faculty to discuss among themselves what it is they are about, what their 
goals are and what they want to achieve. It is simply false that the report suggests or 
recommends that faculty be rewarded for work that lacks quality. Indeed this is a principal 
reason for seeking a statement of scholarship and for the attention paid by the Committee to 
evaluation. As to the work of Boyer and Diamond, it is accepted practice to place work in 
context by referencing previous work. 

Rather than "an assault on the standards of the community" the report is a call to 
engage in discussion of issues that are upon us. It is only through such discussion that we can 
ensure the health and viability of our community. 


Richard H. Herman, Calvin C. Moore, Chair, JPBM Committee 
Chair JPBM on Professional Recognition and Rewards 
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Behind the Scenes of a Random 
Dot Stereogram 


Maria S. Terrell and Robert E. Terrell 


What are all those people staring at? What do they hope to see in those finely 
textured posters, beyond the almost random display of little flowers or paint 
splotches? “‘Look deeper into the picture.” “Look beyond the surface.” “Cross 
your eyes.” “Blur your vision.” Not since Rubic’s Cube has a puzzle or game 
engaged and fascinated the public as have the currently faddish 3D illusions on 
display in bookstores and shopping malls across the country. For many the exercise 
ends in frustration, seeing nothing beyond the surface images. Could it be a case of 
mass hypnosis or “The Emperor’s New Clothes’? For those who see the three 
dimensional images pop into view, there is surprise, pleasure, and wonder. Having 
had experiences of both types, we sympathize with those who have decided they 
have better things to do than stare off into space. But in fact knowing how the 
pictures are created, understanding the elementary geometry and optics of stereo- 
graphic vision, makes it much easier to see the images. 

We begin with the simplest example. Stare at the two dots below. If you relax 
your eyes you will probably see four dots. Try to merge the two middle images so 
that you see three dots. Once you’ve done that, you’ve seen your first, not so 
random, random dot stereogram. If after a few minutes you find that you cannot 
make yourself see three dots, you may be one of the 2% of the population which is 
stereoblind. 


Random dot stereograms are the offspring of an unlikely marriage of aerial 
reconnaissance and brain research. Late in the 1950’s the experiences of psycholo- 
gist and former radar engineer, Bela Julesz, led him to challenge prevailing 
theories of depth perception [1]. Julesz knew that when aerial photographs of 
camoflaged areas, taken from two different positions, were viewed through a 
stereoscope, tanks and equipment appeared to pop out of the almost random 
patterns. This convinced him that depth perception could occur without monocular 
visual cues. It was a revolutionary idea. He proved that he was right by devising the 
random dot stereogram, a pair of random arrays one for each eye, which when 
viewed simultaneously created a three dimensional image. The pair of random dot 
arrays below are examples of Julesz’s ground breaking tool. Without special 
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Random dot stereogram from Julesz, 1971. 


1994] RANDOM DOT STEREOGRAMS 715 


equipment it is fairly difficult to see the 3D image of a diamond hovering above a 
background plane. If you’d like to try, relax your eyes and make three arrays from 
two like you did with the dots. 

In addition to the separate stereo images Julesz [2] combined the left and right eye 
views by coloring one green and the other red. When viewed through glasses with 
one red and one green lense the 3D images popped out of the array. It is hard to 
overstate the importance of random dot stereograms as a tool to study one aspect 
of depth perception without having to sort out and account for multiple effects of 
various visual cues. 

In 1983 C. W. Tyler [3] described a way of combining the left and right eye 
information into one picture called the auto random dot stereograms, which did 
not require the use of special equipment. Rather than superimposing the left and 
right eye views, as Julesz did, Tyler found a way to make one picture which the 
brain can interpret as a 3D image. Although known by various commercial names 
in the marketplace, we refer to these pictures simply as stereograms. We have 
dropped the term “‘random”’ because the dots do not have to be random. Also the 
dots do not have to be dots, but can be small circles or other figures. 

Before discussing how the more interesting stereograms are made, consider the 
case of creating a stereogram of a single point. Think of the drawing below as a 
view from above of the eyes, the two dots on the picture, and the single point 
behind the picture. 


| x 


Since each of the dots is seen by both eyes, there are additional lines of sight 
through them. Each eye has a retinal image of the two dots which the brain can 
interpret as three or four dots at various depths behind the picture plane. While 
there are many places at which the two dots look like four, there’s only one at 
which they look like three. Merging the images of the two dots, completely 
determines the position of the perceived dot in the space behind the picture. 


picture plane 2 , picture : 
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There is another point of intersection, but it determines an image which 
appears to be in front of the stereogram. Since most of the commercial stere- 
Ograms seem to be designed to depict the image behind the plane, we will 
concentrate on that image. However, results like those we derive for the image 
behind the plane can be applied to the image in front as well. 

The next sequence of figures gives an example of how to construct one row of a 
stereogram which will represent the curve shown, starting from an interval of dots. 


_-/ 


__ 


© © 


Draw a line from the left eye through one of the dots in the interval and mark 
the point at which that line intersects the curve. Next draw the line of sight from 
this point on the curve to the right eye. Put a new dot at the point at which this 
line intersects the picture plane. 


Repeat this process with the other dots in the original interval. The resulting 
eight dots in the stereogram, represent four points on the curve. 


Since the original four dots also represent four points on the curve as seen by 
the right eye, they determine four new left eye images. 


al 
-\/ 


NN 
N 


1994] RANDOM DOT STEREOGRAMS 717 


If the curve were longer, we would continue working to the left and right 
making more intervals of dots. Most stereograms have at least five such intervals in 


each row. The outermost intervals represent points seen by only one eye. 


To create a two dimensional representation of a three dimensional scene, we 
stack rows of dots which were generated as above. If the intervals of initial dots are 
all from the center, they form a vertical strip, from which the rest of the display is 
generated. The initial dots do not have to lie in a vertical strip, but it is a 
convenience. 


Now that you have some idea of the principles involved in generating these 
pictures, try seeing the “Mexican Hat”. We have included a set of guide dots 
which you may, or may not find useful. To use them make the two dots look like 
three. This may help you fixate at a point behind the picture and is more precise 
than the commercial advise to look at your reflection. When you begin to sense 
some depth, relax and look for a surface similar to the one in the figure above. A 
word of caution—don’t expect to see anything if you rotate the stereogram. The 
underlying principle is that your eyes are separated horizontally and that your 


brain is quite accustomed to fusing horizontally disparate retinal images. 
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Sometimes when you look at a stereogram, a scene comes into view which 1s not 
at all what you thought it should be. This does not mean there is something wrong 


with your eyes! You may be seeing the front image that we mentioned earlier. It is 


similar to a reflection of the one behind the plane. To see it, hold a pencil a few 


inches in front of the picture, so that when you’re looking at the stereogram, two 
somewhat transparent pencils have their tips directly under the two guide dots. 


Now fixate on the tip of the pencil 


forcing its two images into one. The two dots 


b 


will now look like three, with one directly over the tip of the pencil. As you fix your 


attention on the pencil point 


Il notice that a 3D image is beginning to emerge 


you 


b 


nearby. It should look roughly like the image below. Once you’ve got it, you can try 


poking the pencil through it! 


In addition to the primary images above, it is possible to perceive something like 
the shapes shown in the next figures at depths farther behind (or reversed and in 


front) of the picture plane. 


The perception of these deeper images is easier to describe after looking at the 
perception of a single image more carefully. In the plane through the eyes and a 
point in the scene, we use similar triangles to find a relation among the distances 


dot spacing, p-distance from your eyes to the line through the 


distance from the point to the line. 


eye spacing, s 
dots, and d 


e 


s/e=d/(p+d) 


RANDOM DOT STEREOGRAMS 719 


1994] 


A number of consequences can be predicted from this geometric model. First, 
the perceived depth of a point in the 3D scene is determined by the distance 
between its image dots. As a result, stereograms of shallow scenes have fairly 
strong repeated patterns. Conversely, perfectly regular repeated spacing of figures 
can cause the illusion of a recessed plane, called the wallpaper effect. 

Moving the stereogram closer to or farther from your eyes may help you see the 
image, because it allows you to fixate at different depths. Since this does not 


change s or e, similar triangles predict that d increases if p does. Once you have 
seen. an image, try slowly moving the stereogram back and forth to experience 


changes in the depth of the perceived image. 
Under what circumstances might you see a completely different second image 


This simple relationship should also be considered when choosing the strip 
behind the picture? Suppose the strip width is less than half the distance between 
your eyes. If the stereogram was generated by producing eight strips of that width, 
then there are four strips of about twice that width. 


width for a particular scene. If the strip is too narrow, relative to the depth of the 
success so far, you might try to see other images in the Mexican Hat stereogram to 


is too wide, it may lead to new dots falling in the initial interval, perhaps making 
which we have added double wide guide dots. 


solid black regions. 
To see a third image it is necessary that 3g <e, and so forth. If you have had 


scene, it will lead to gaps in the stereogram, vertical spaces with no dots at all. If it 
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If you are looking at a stereogram with wide dot spacing and you can see only 
one or two images in it, take it to a copy machine and reduce the size. Then you 
RANDOM DOT STEREOGRAMS 
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might see a third and fourth image in it. Try it. We have never seen more than five 
deep in any one picture. Now that you are experienced, try these stereograms, 
which we have labeled to suggest their 3D images. If you find that you need guide 
dots, find a repeated pattern within one row and color any two of the correspond- 
ing dots. We have included pictures of the underlying surfaces and information 


about their sets of initial dots at the end of this article. 
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The algorithm uses intersections of the scene with horizontal, rather than 
sloping planes through the eyes. This is one source of error. 


At the planar level we approximate the spacing for the new dot by 


s = ef(x +g/2,y)/(p + f(x + 8/2, y)) 


where g is the width of the generating strip, e is the distance between the 
eyes, and p is the distance from the eyes to the picture plane, (x, y) are 
coordinates in the picture plane, and the graph of f is the scene depicted. 


approximate depth 
flx + 8/2, y) 
\ correct depth 


Z= f(x, y) 


i , correct dot 
existing dot 


at (x, y, 0) 
‘ . computed dot 
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Our stereograms were computed using an algorithm similar to the one de- 
scribed in [4]. It uses two approximations which avoid the difficulty of computing 
intersections of lines and surfaces. See box. These approximations do cause some 
distortion, but as long as the objects are near the center of the visual field, it is 
usually not too noticeable. In the example below two hemispheres appear to sag 
toward the center, when viewed from a point fairly close to the center of the 
picture. When viewed from points more directly over the hemispheres, the distor- 
tion is reduced. 
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There are many questions to explore and images to create with stereograms. 
Psychologists continue to use both stereo pairs and single stereograms for research 
in depth perception. For us the existence of the deeper images and their depen- 
dence on both the primary surface and the dot spacing algorithm, continues to be a 
source of surprise and curiosity. 
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Eggcrate Pocket 


Notes. The hemispheres, pocket, and eggcrate were constructed from a random 
vertical center strip. The bug was generated from a random scattering of initial 
points over the whole area of the picture. The pocket was rendered using a small 
circle instead of a dot at each computed position. 
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The Fifty-Fourth William Lowell Putnam 
Mathematical Competition 


Leonard F. Klosinski, Gerald L. Alexanderson, 
and Loren C. Larson 


The following results of the fifty-fourth William Lowell Putnam Mathematical 
Competition, held on December 4, 1993, have been determined in accordance with 
the governing regulations. This annual contest is supported by the William Lowell 
Putnam Prize Fund for the Promotion of Scholarship, left by Mrs. Putnam in 
memory of her husband, and is held under the auspices of the Mathematical 
Association of America. 

The first prize, $7,500, was awarded to the Department of Mathematics of Duke 
University. The members of the winning team were: Andrew O. Dittmer, Craig B. 
Gentry, and Jeffrey M. Vanderkam; each was awarded a prize of $500. 

The second prize, $5,000, was awarded to the Department of Mathematics of 
Harvard University. The members of the winning team were Kiran S. Kedlaya, 
Serban M. Nacu, and Royce Y. Peng; each was awarded a prize of $400. 

The third prize, $3,000, was awarded to the Department of Mathematics of 
Miami University. The members of the winning team were John D. Davenport, 
Jason A. Howald, and Matthew D. Wolf; each was awarded a prize of $300. 

The fourth prize, $2,000, was awarded to the Department of Mathematics of the 
Massachusetts Institute of Technology. The members of the winning team were 
Henry L. Cohn, Alexandru D. Ionescu, and Andrew Przeworski; each was awarded 
a prize of $200. 

The fifth prize, $1,000, was awarded to the Department of Mathematics of the 
University of Michigan, Ann Arbor. The members of the winning team were Philip 
L. Beineke, Brian D. Ewald, and Soundararajan Kannan. 

The six highest ranking individual contestants, in alphabetical order, were Craig 
B. Gentry, Duke University; J. P. Grossman, University of Toronto; Wei-Hwa 
Huang, California Institute of Technology; Kiran S. Kedlaya, Harvard University; 
Adam M. Logan, Princeton University; and Lenhard L. Ng, Harvard University. 
Each of these was designated a Putnam Fellow by the Mathematical Association of 
America and awarded a prize of $1,000, by the Putnam Prize Fund. 

The next three highest ranking contestants, in alphabetical order, were Michail 
Sunitsky, Princeton University; Dylan P. Thurston, Harvard University; and Jade 
P. Vinson, Washington University, St. Louis; each was awarded a prize of $500. 

The next six highest ranking contestants, in alphabetical order, were Mikhail 
Kogan, New York University; Akira Negi, University of North Carolina, Chapel 
Hill; Joel E. Rosenberg, Princeton University; David L. Savitt, University of British 
Columbia; Jeffrey D. Wall, Princeton University; and Thomas A. Weston, Mas- 
sachusetts Institute of Technology; each was awarded a prize of $250. 
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The next thirteen highest ranking contestants, in alphabetical order, were 
Manjul Bhargava, Harvard University; John D. Davenport, Miami University; 
Andrew O. Dittmer, Duke University; Sergey V. Levin, Harvard University; 
Douglas A. Levy, University of Pennsylvania; William R. Mann, Princeton Univer- 
sity; Adam W. Meyerson, Massachusetts Institute of Technology; Curtis Z. Mitchell, 
Carleton College; Serban M. Nacu, Harvard University; An T. Nguyen, University 
of Texas, Austin; Byron M. Shock, Albertson College of Idaho; Ka-Ping R. Yee, 
University of Waterloo; and Douglas J. Zare, New College of the University of 
South Florida; each was awarded a prize of $100. 


The following teams, named in alphabetical order, received honorable mention: Cornell University, 
with team members Robert D. Kleinberg, Mark Krosky, and Tong Zhang; New York University, 
with team members Igor Berger, Yevgeniy Dodis, and Mikhail Kogan; Princeton University, with team 
members Ze Y. Chen, Adam M. Logan, and William R. Mann; University of Toronto, with team 
members J. P. Grossman, Edwin N. Sato, and Hugh Thomas; and the University of Waterloo, with team 
members Daniel R. L. Brown, Ian A. Goldberg, and Peter L. Milley. 

Honorable mention was achieved by the following twenty-nine individuals named in alphabetical 
order: Jared E. Anderson, University of Victoria; Jonathan E. Atkins, Rose-Hulman Institute of 
Technology; Henry L. Cohn, Massachusetts Institute of Technology; Ilya A. Entin, Massachusetts 
Institute of Technology; Brian D. Ewald, University. of Michigan, Ann Arbor; Kevin E. Foltz, Rice 
University; David Friedman, Massachusetts Institute of Technology; Ian A. Goldberg, University of 
Waterloo; H. Tracy Hall, Brigham Young University; John D. Harrington, University of Idaho; Simeon 
J. Hellerman, Brown University; Timothy J. Hollebeek, Calvin College; Alexandru D. Ionescu, Mas- 
sachusetts Institute of Technology; Daniel C. Isaksen, University of California, Berkeley; Soundararajan 
Kannan, University of Michigan, Ann Arbor; Robert D. Kleinberg, Cornell University; Botand Készegi, 
Harvard University; Josh L. Levenberg, Reed College; Jie J. Lou, University of Waterloo; Idris D. 
Mercer, University of Victoria; Frosti Petursson, University of Pennsylvania; Anand J. Reddy, Univer- 
sity of California, Berkeley; Edwin N. Sato, University of Toronto; Jason R. Schweinsberg, Williams 
College; Mark A. Van Raamsdonk, University of British Columbia; Jeffrey M. Vanderkam, Duke 
University; Wayne A. Whitney, Harvard University; Tong Zhang, Cornell University; and Zhaohui 
Zhang, Yale University. 

The other individuals who achieved ranks among the top 101, in alphabetical order of their schools, 
were: University of Arizona, Randy W. Ho; Brigham Young University, John Wesley Robertson; 
California Institute of Technology, Julian C. Jamison; University of California, San Diego, David R. 
Wasserman; University of Chicago, Dean W. Jens; Colorado State University, Matthew K. Kahle; 
Cornell University, Demetrio A. Munoz; Gordon College, Hai Shao; Harvard University, Swaine L. 
Chen, Hank S. Chien, Adam Kalai, Joseph D. Kanapka, Joshua N. Newman, David S. Patterson, Royce 
Y. Peng, Jonathan E. Tannenhauser; Massachusetts Institute of Technology, Andrew Przeworski, 
Dmitriy A. Rogozhnikov, Jason M. Sachs, Ritesh A. Shah; McGill University, Rajesh J. Pereira; Miami 
University, Jason A. Howald, Matthew D. Wolf; University of Michigan, Ann Arbor, Philip L. Beineke; 
University of Minnesota, Minneapolis, Matthew P. Kelly; New York University, Yevgeniy Dodis; North 
Carolina State University, Stephen A. London; University of Northern Colorado, John C. Petherick; 
Princeton University, Steven S. Gubser, Mark W. Lucianovic, Thomas J. Weisswange; University of 
Saskatchewan, Trevor N. Green; Simon Fraser University, Erick B. Wong; Stanford University, 
Svetlozar E. Nestorov; Swarthmore College, Mark D. Kernighan; Texas Tech University, Mikhail V. 
Shubov; Vassar College, Andrew F. Rizzo; Washington University, St. Louis, Benjamin B. Gum, 
Edward D. Hanson, Philip X. Wu; University of Waterloo, Daniel R. L. Brown, Eli Lapell; University 
of Wisconsin, Madison, Brent E. Halsey; and University of Wisconsin, Parkside, Sergey M. loffe. 


There were 2356 individual contestants from 402 colleges and universities in 
Canada and the United States in the competiton of December 4, 1993. Teams 
were entered by 291 institutions. 

The Questions Committee for the fifty-fourth competition consisted of George 
T. Gilbert, Texas Christian University (Chair); Fan Chung, Bellcore; and Eugene 
Luks, University of Oregon; they composed the problems listed below and were 
most prominent among those suggesting solutions. 
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PROBLEMS 


Problem A-1. The horizontal line y = c intersects the curve y = 2x — 3x° in the 
first quadrant as in the figure. Find c so that the areas of the two shaded regions 
are equal. 


Problem A-2. Let (x,,),59 be a sequence of nonzero real numbers such that 


2 — — 
XO —X,-1Xne, = 1, forn =1,2,3,.... 


Prove there exists a real number a such that x,,, = ax, —x,_, for all m = 1. 


Problem A-3. Let Y, be the set of subsets of {1,2,..., nm}. Let c(n,m) be the 
number of functions f: A, > {1,2,..., m} such that f(A/N B) = min{f(A), f(B)}. 
Prove that 


m 
c(n,m) = oj". 
j=l 


Problem A-4. Let x1, x5,...,X 9 be positive integers each of which is less than or 
-equal to 93. Let y,, y5,..., ¥93 be positive integers each of which is less than or 
equal to 19. Prove that there exists a (nonempty) sum of some x,’s equal to a sum 
of some y,’s. 


Problem A-5. Show that 


2 2 a) 
~10 x*—-x a x7 —x 4 x*-x 
J 3 + ("Ip + ["\ = 
~100\ x° — 3x + 1 a\x?—3x+1 m\ x3 — 3x +1 
is a rational number. 


102 


Problem A-6. The infinite sequence of 2’s and 3’s 
2,3,3,2,3,3,3,2,3,3,3, 2, 3, 3,2, 3,3,3,2,3,3,3,2,3,3,3,2,3,3,2,3,3,3,2,... 


has the property that, if one forms a second sequence that records the number of 
3’s between successive 2’s, the result is identical to the given sequence. Show that 
there exists a real number r such that, for any n, the nth term of the sequence is 2 
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if and only if nm = 1 + [rm] for some nonnegative integer m. (Note: |x] denotes the 
largest integer less than or equal to x.) 


Problem B-1. Find the smallest positive integer such that for every integer m, 
with 0 < m < 1993, there exists an integer k for which 
m k m+ 1 


——- <-< ; 
1993 7n 1994 


Problem B-2. Consider the following game played with a deck of 2” cards 
numbered from 1 to 2n. The deck is randomly shuffled and n cards are dealt to 
each of two players, A and B. Beginning with A, the players take turns discarding 
one of their remaining cards and announcing its number. The game ends as soon 
as the sum of the numbers on the discarded cards is divisible by 2n + 1. The last 
person to discard wins the game. Assuming optimal strategy by both A and B, 
what is the probability that A wins? 


Problem B-3. Two real numbers x and y are chosen at random in the interval (0,1) 
with respect to the uniform distribution. What is the probability that the closest 
integer to x /y is even? Express the answer in the form r + sar, where r and s are 
rational numbers. 


Problem B-4. The function K(x,y) is positive and continuous for 0 <x < 1,0 <y 
< 1, and the functions f(x) and g(x) are positive and continuous for 0 <x < 1. 
Suppose that for all x, 0 <x <1, 


[ FO) Kay) dy = g(x) and fav) K(x y) dy = f(x). 
Show that f(x) = g(x) for 0 <x < 1. 


Problem B-5. Show there do not exist four points in the Euclidean plane such that 
the pairwise distances between the points are all odd integers. 


Problem B-6. Let S be a set of three, not necessarily distinct, positive integers. 
Show that one can transform S into a set containing 0 by a finite number of 
applications of the following rule: Select two of the three integers, say x and y, 
where x < y, and replace them with 2x and y — x. 


SOLUTIONS 


In the 12-tuples (71,9, 19, Mg, 17, Ng, Ns, Ng, 14, M2, N4, Ny, N_,) following each prob- 
lem number below, n; for 10 =i = 0 is the number of students among the top 207 
contestants achieving i points for the problem and n_, is the number of those not 
submitting solutions. 
A-1 (185, 2, 0, 0, 0, 0, 0, 0, 1, 0, 14, 5) 
Solution. The value of c is 4/9. 

Let (b, c) denote the second intersection point. We wish to find c so that 


[Cc — (2x — 3x°))dx = 0. 
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This leads to cb — b? + (3/4)b* = 0. After substituting c = 2b — 3b? and solving, 
we find that b = 2/3 and the result follows. 


A-2 (146, 21, 6, 0, 0, 0, 0, 0, 8, 1, 17, 8) 
Solution 1. It is equivalent to show that 
Xn +1 + Xn —1 
xy 


is independent of n. This follows (by induction) from 


Xn+2 +x Xn+1 + Xy-1 _ (x,X p42 + x7) a (x74 + X,Xn41) 


_ — (a1 — XnXn42) + (x; —Xy—1Xn41) 


XnXn+1 
-1+1 
= ——_———. = () 
XnXn+1 
Solution 2. For all 7, 
det Xn-1 FX n+ Xn + Xn+2 = det Xn-1 4 det Xn+1 Xn+2 
Xn Xn+1 n Xn+1 Xn Xn 


=-l1+1=0. 
Thus, (x,-1 + Xap %_n $Xn42) = C,(X,,%X, +1) for some scalar c,. Substituting 
n — 1 for n and then comparing the coordinate expressions, we see that c, = c 
(using x, # 0). 


n-1 


Comment. In a similar manner, one can prove that if (x,,),.9 is a sequence of 
nonzero real numbers such that 


Xn Xn+1 Xn+k 
x x x 
n+1 n+2 n+k+1 
det}; . =cr” forn = 1,2,3,..., 
Xn+k %n+k+1 a Xn +2k 
then there exist real numbers a,,..., a, such that 
= + forest + (-1)* 
Xn+k+1 ~ UAn+k GgXn+k-1 AnXn+1 ( ) IX). 


A-3 (2, 11, 24, 0, 0, 0, 0, 0, 27, 8,54, 81) 


Solution. Let S = {1,2,...,m}, and suppose that f is such a function and that 
f(S) =j. Then f( AQ B) = min{ f(A), f(B)} implies that the values of f(S — {i}) 
determine f(A) for all A Cc S, since f(A) = min;z Af(S — {ip} for ACS. 
Conversely, arbitrary choices of f(S — {i}) <j leads to such a function. Because 
there are j independent choices for each f(S — {i}), there are j” functions with 
f(S) = j. Summing over the possible values of f(S) yields c(n, m) = Lt, j”. 


A-4 (3, 2,0, 0,0, 0, 0, 0, 0, 0, 44, 158) 


Solution. For reasons of symmetry, let us replace 19 and 93 by m and n 
respectively, in the problem statement. Without loss of generality, U7" .x; = L7_,Y;. 
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Then, for 0 < k <n, there exists f(k), 0 < f(k) < m, such that 
f(k) k f(k)+1 


Ux, < Ly < x. 
i=1 j=l i=1 
Let g(k) = Li_,y, — L/“x;. Then, for 0<k <n, 0 <g(k) < Xeeye1 Sn. If 
g(k) = 0 for some k, we are done. Otherwise, by the Pigeonhole Principle, there 
exists ky < k, such that g(k,) = g(k,), in which case 


f(ky) ky 
> x; = > yj. 
i=f(ko)+1 J=ko+1 


A-5 (3, 3, 0, 0, 0, 0, 0, 0, 1, 16, 37, 147) 


Solution 1. Observe first that the roots of x? — 3x + 1 can be isolated away from 

the given intervals; that is, there are sign changes in the intervals [—2, —1], 

[1 /3, 1/2], (3/2, 2]. Hence the integrand is defined and continuous throughout. 
Set 


—xX 


t x? 
I(t) ~ al —3x4+1 


1-1/t x*—x 
x>-3x41 


; x7 -x 
1/- — 

dx + J (0-0 > TTF 
a x? -—-3x-4+1 


2 
4+ 


101 
100 


for —100 < t < —10. We wish to compute f(—10). By the Fundamental Theorem 
of Calculus, 


f(t) = QU) + Q 


1 1 , 1\ 1 
ralaoe 91-7) 
where Q(x) = (x? — x)/(x? — 3x + 1)”. We find that Q(U/( — t)) = 


O(1 — 1/t) = Q(4), so that 
( + : + : Ja 
2 (1 —xy 


1 ,_2 1 ) 

—_——_- = + —_—_ —_ — , 

Q(x) [: x x-1 
we see that the last integral is of the form {du /(u’). Hence, its value is 
10 


f(-10) = f 


— 100 


—10 x? —x 
x7 -3x4+1 


But, noting that 


x*—-x 


x? —3x4+1 100° 


which is rational. 


Solution 2. By the substitutions x = —1/(t — 1) and x = 1 — 1/1, the integrals 
over [1/101, 1/11] and [101/100, 11/10] are respectively converted into integrals 
over [— 100, — 10]. In the course of this it is seen that the function 

2 


x7 —x 
x>-3x4+1 
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is invariant under each of the substitutions x ~ 1—-—1/x and x > 
—1/(x — 1) Which, in fact, are inverses of one another). Hence the sum of the 
three given integrals is expressible as 


10 x? —-x ‘{, 1 1 ke 
——————] [1+ — + ——~ Ja. 
Fog x°-— 3x41 x? (x - 1)’ 


The solution continues as in the first solution. 


Comment. It can be shown, more generally, that if f(x) is a rational polynomial of 
degree at most 4, then 


cr f(x) a+ [" f(x) a+ [ f(x) 


-100 (x3 — 3x + 1)" /101 (x3 — 3x + 1)° 01/100 (x3 — 3x + 1) 


is rational. 


A-6 (0, 1, 0, 0, 0, 0, 1, 3, 3, 11, 36, 152) 


Solution. We show that the conclusion holds with r = 2 + ,3. 

Assuming the result, we first derive the value of r. Observe that, asymptotically, 
the proportion of 2’s in the first 1 terms is 1 /r. Thus, assuming there are about m 
2’s in the first n ~ rm terms, there should be about (r — 1)m 3’s. These numbers 
give the approximate number of 3’s in the intervals following the first  2’s, namely 
2m + 3(r — 1)m = Gr — 1)m. Hence, the proportion of 2’s in the first rm + 
(3r — 1m = (4r — 1)m terms is rm/(4r — 1)m = r/(4r — 1). So we want r to 
satisfy 1/r = r/(4r — 1), or r* — 4r + 1 = 0. Since r must exceed 1, r = 2 + /3. 

Note that substituting m = 0 into the formula yields the first 2 and m = 1 the 
next. Assume the jth 2 is in position 1+ \|r(j — | for j <m. Solving 
1+(rGj- DI <m<1+(|yn] yields j =|m/(@ + /3)] + 1 2’s among the first m 
numbers of the sequence. Thus the (m + 1)st 2 is in position 


(m + 1) + 2(|m/(2 + y3)| + 1) + 3(m — [m/(2 + y3)| - 1) 
= 4m — |m/(2 + ¥3)| 
= |4m — (2 — y3)m| = |(2 + y3)m|]. 


The claim follows by induction. 


B-1 (83, 29, 9, 0, 0, 0, 0, 0, 6, 10, 38, 32) 
Solution. First, it is easily verified that 


m 2m+ 1 mt+i1 
1993 1993 + 1994 1994 ’ 


so n = 1993 + 1994 = 3987 suffices. Now consider m = 1992 and suppose 
1992 k 1993 


—— < <—. 
1993 n 1994 
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Since x/(x + 1) is strictly increasing for x > 0, we must have k <n — 2 (note: 
n > 1994). However, 
1992 n—2 


—_—_ < 
1993 n 


implies 3986 <n, so n > 3987, completing the proof. 


B-2 (89, 0, 1, 0, 0, 0, 0, 0, 7, 3, 42, 65) 


Solution. The probability that A wins is 0. 

Clearly, A cannot win on the first turn. Assume B is to play, and that the total 
of announced numbers is 7, and that A has cards x,, x,,...,x,, and B has cards 
V1, Yo0+--+» Ve4 4. Because the integers T+ y,,...,7 + y,,, have distinct remain- 
ders upon division by 2” + 1, at least one has a remainder other than 2n + 1 — 
X1,...,2n + 1-—x,. If B discards that y,, it is impossible for A’s next discard to 
make the total divisible by 2” + 1. Therefore, A cannot win under optimal play 
by B. 


B-3 (111, 16, 13, 0, 0, 0, 0, 0, 4, 20, 10, 33) 


Solution. The limit is (5 — 7)/4 (that is, when r = 5/4, s = —1/4). 

Note that the probability that x/y is exactly half an odd integer is 0, so we may 
safely ignore this possibility. 

For any choice of x, the closest integer to x/y is even if either x/y < .5 or 
2n — 5 <x/y < 2n + 5 for some positive integer n. 

The event x/y < .5, or 2x < y, can occur only if x < .5 Thus its probability is 


5 
J (1 — 2x) de =}. 
0 
For a positive integer n, the probability that 2n — 5 <x/y < 2n + .5, Le., that 
2x/(4n + 1)<y <2x/(4n — 1), is 


2X 2X 


1 1 1 
flo 4n+1 


~ an—-1 4n41° 


Summing from n = 1 to ©, we get 
= k+1 
» (-1) 
k=1 


The total probability then is 1/4 + 1 — 7/4 = 6 — w)/4. 


qT 
shay ft arctant=1— 7. 


B-4 (0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 61, 144) 


Solution. For 0 < x < 1, 


f(x) = [ a(t) K(x,0) at 
= ff POR y) Ka) dy dt 


= [ FO)L(w y) & 


732 PUTNAM MATHEMATICAL COMPETITION [October 


where 
L(x,y) = f[/K(a,t)K(t,y) dt 
0 


forO<x<10<y <1. 


Similarly, 
s(x) = [’a(v) L(x, y)dy. 
Since 
L(x, yf) 
Fay 8! 
and 


p(s y) f(y) Fee _ g(x) 
0 f(x) f(y) f(x)” 


for 0 <x < 1, it follows that g/f = c, a constant. Then 


e(x)=of(x) =f @(y) K(x, 9) ay 


fF) K(x y)dy 


c*g(x). 
Therefore, c = 1. 


B-5 (6, 1, 0, 0, 0, 0, 0, 0, 1, 7, 65, 127) 


Solution. Suppose there were 4 such points. Locate one point at the origin 


and let v,,U>,U3; be vectors from the origin to the other three. Since vu, - v,, and 


L 
lv, - v,|° = U,:U, — 2U,;-U, + U;-U,, for i #j, are squares of odd integers we 
know U; : U; as well as 20, - U;, for j # i are integers congruent to 1 (mod 8). 
Clearly no three points can be collinear. Hence U;, = xv, + yv, for some scalars 
x and y. Then 


2U,°U; = 2xU, +0, + 2yv,- Uv, 
2U,° V3, = 2xV,°U, + 2yv>,- UV, (1) 
203, °U; = 2xv,°U, + 2yv;:U>, 


Since v, is not a scalar multiple of @,, 


> => 


U,°U U,°U 
1 1 1 2 
a 


so that the first two equations in (1) have a unique rational solution for x, y, say 
x =xX/D, y=Y/D, where X,Y, and D are integers. We may assume 
gcdLx, Y, D) = 1. Then multiplying (1) through by D we have 


D =2X + Y (mod 8) 
D = X + 2Y (mod 8) 
2D =X + Y (mod 8) 


> 0 
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Adding the first two congruences and subtracting the third gives 2X + 2Y =0 
(mod 8), so that, by the third congruence D is even. But then the first two 
congruences force Y and X, respectively, to be even, a contradiction. 


B-6 (2, 0, 0, 0, 0, 0, 0, 0, 1, 2, 59, 143) 


Solution. Say the numbers are a, b,c. First, we reduce to the case that exactly one 
of a,b,c is odd. Namely: G) if two are odd, apply the rule with those two, and 
none is odd; (ii) if none is odd, divide all numbers by 2 and apply induction; (ii) if 
three are odd, apply the rule once, and exactly one is odd. Once exactly one is odd, 
this will remain so. 

Say a is odd and b and c even. We aim to make the power of 2 dividing b + c 
as large as possible. If b and c have the same number of factors of 2, then applying 
the rule to those two will yield both divisible by a higher power of 2, or one will 
have fewer factors of 2 than the other. Since b + c 1s constant here, after a finite 
number of applications of the rule, b and c will not have the same number of 
factors of 2. Also, it is easy to see that, possibly after some additional moves, one 
has either bc = 0 (in which case one stops), or, the one of b and c divisible by the 
smaller power of 2 is also smaller; say it is b, so that b < c. 


Case 1: a > b. Now work with a, b. Then a remains odd, b is doubled, and b + c 
is divisible by a higher power of 2. 


Case 2:a <b. Apply the rule first to a and b, and then to b — a and c. (Note that 
c > b> b-—a.) One obtains 


a b Cc 
2a b-a Cc 
2a 2b —2a c-b+a 


Now the odd number is c — b + a, and the sum of the even numbers is 2b, which 
has more factors of 2 than b + c. 
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Literacy in the Language of Mathematics 


James O. Bullock 


The American educational system has treated mathematics as skill in numerical 
manipulations, and has used the term “quantitative reasoning” to describe the 
application of mathematics to other areas of study. This serious misconception has 
severely hampered the ability of our students to comprehend important develop- 
ments in scientific and philosophic thought. Mathematics can be more properly 
regarded as a form of language, developed by humankind in order to converse 
about the abstract concepts of numbers and space. In addition to the intellectual 
appeal of its intricate linguistic structure, the language should be appreciated for 
the richness of the literature composed with it. 

The immediate cause for the recent concern about mathematics education is the 
widespread difficulty students apparently experience with this subject. While some 
educators have expressed an interest in improving pedagogy, students have tended 
to opt for a more practical approach whenever it has been open to them: avoiding 
the subject altogether. Predictably, the reaction of the educational system has been 
an attempt to force students to learn mathematics anyway. The mandate to hold 
students’ feet to the fire has been carried out with varying degrees of resolve, the 
debate over how much pain to inflict being dominated by two conflicting lines of 
reasoning. According to the first, there must be some minimum standards which 
can be applied to all students. In addition, since it builds character, everyone 
deserves their fair share of suffering. Those determined to adopt a more merciful 
attitude, on the other hand, often argue that the dose of unpleasantness should be 
limited to what is absolutely necessary. By this logic, mathematics courses need be 
required only when they are listed as prerequisites for courses that are required for 
some other reason. 

In the course of such arguments, the question of mathematics itself has been 
relegated to a position of secondary importance. Implicit in this view is the premise 
that mathematics is a highly esoteric subject which, despite having some practical 
applications, possesses only tenuous connections to other areas of scholarship. Any 
consideration of the intellectual and aesthetic appeal possessed by the discipline 
has been dismissed as irrelevant for all, save the mathematicians themselves. 
Within most curricula, mathematics has been so thoroughly dissociated from all 
other subjects that students generally encounter little evidence that these assump- 
tions might be mistaken. Physical science, for example, is nearly always taught as if 
students know little or nothing of mathematics. Similarly, students are expected to 
learn biology without any reliance upon the concepts and formalism of the physical 
sciences. Many statisticians now insist that their subject is something quite apart 
from mathematics, so that statistics courses do not require any preparation in 
mathematics. Some computer scientists have gone so far as to suggest that if you 
understand the mathematics of their machines, it is not necessary to understand 
the mathematics of natural phenomena. To the students, nothing seems to be 
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amiss. They observe that the factors which determine whether a course is classified 
as hard or soft, general or specialized, are the complexity of the problem assign- 
ments and the thickness of the textbook. As the values of these parameters 
increase, the demand for proficiency with specific mathematical techniques also 
increases. Across the entire spectrum of course offerings, however, students are 
exposed only to the vaguest notions of what we mean by rigor. 

A careful consideration of the nature of mathematics as an intellectual activity 
reveals the utter incongruity of this situation. Not only have our students failed to 
appreciate the beauty of mathematics, they have little grasp of the profound 
insights about the natural world which mathematics has made possible. Rather 
than a mere intellectual curiosity or useful skill, mathematics is an important facet 
of the most distinctive capability of the human species. 


MATHEMATICS IS A LANGUAGE. As a general definition, mathematics could 
be called the study of numbers and space. In order to talk about such things, 
mathematicians first had to devise an appropriate vocabulary and alphabet. It is 
apparent that the objects defined by mathematicians are entirely abstract, and can 
never actually be observed in any way except by the human imagination. Although 
many students seem to be disturbed by this fact, mathematics is comprehensible 
precisely because it is abstract. We are not dependent upon observation to know 
that lines are completely straight and parallel lines never meet. It is possible to 
make these assertions with confidence because mathematics was not discovered, it 
was invented. It must also be remembered that imagination and abstraction are 
universal parts of the human experience, and not the exclusive domain of the 
mathematician. The idea of a line or a point is no more abstract than the idea of 
loyalty or freedom. Mathematics differs from other languages not because of its 
inclusion of abstraction, but because of its complete detachment from the compli- 
cations of what we experience by direct observation. 

If mathematics consisted only of new words and symbols, it could properly be 
considered as an extension of existing language. The reason mathematics is a new 
and separate language is that it also has its own syntax and grammar. Having 
devised both vocabulary and rules for the language, mathematicians seek to 
discover what things are possible or not possible to say about numbers and space. 
In this ideal, ordered world of the imagination, it is possible to apply relentless 
logic to any questions that arise. Euclid’s geometry should be appreciated for the 
beauty which can result from this process. His system of axioms, definitions, 
theorems, and constructions clearly lays out both the vocabulary and the rules for 
conversing about his chosen topic. By limiting his considerations to fanciful but 
specific abstractions of his imagination, he was able to produce an indisputably 
logical and thorough exposition. If one understands the language, one will not only 
know what a triangle is, but exactly what can be said about triangles. The lexicon 
and grammar of language are quite precise; the conclusions are quite inescapable. 


FROM THE LINGUISTIC TO THE LITERARY. The natural sciences seek to 
explain the behavior of the things we encounter in nature. Experimental observa- 
tion is the court of highest appeal for all such explanations. What then is the role 
of the abstract, detached language of mathematics in experimental science? The 
answer is not different than that for the roles of language in other forms of 
scholarship. The scientist uses the language of mathematics to construct metaphors 
which represent insights into the workings of nature. The use of metaphors is as 
commonplace in science as it is in poetry. We speak of radio “waves”, subatomic 
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“particles”, and celestial ‘bodies’. What distinguishes the mathematical metaphor 
is the extraordinary power of this language to uncover implications of the underly- 
ing idea. Calling the earth a sphere is an extremely powerful statement, for we 
know a great deal about the things we can say about spheres. The fact that most 
scientists prefer to call such constructions “models” or “laws” does not change 
their essential character. It is just as metaphorical to call the world a sphere as it is 
to call it a stage. 


GETTING THE PICTURE. It is generally agreed that the object of studying 
science is to understand its fundamental concepts, but how does one come to 
understand a mathematical metaphor? It is widely believed that the first step is to 
find a way to remove the mathematical details so that the essential picture can be 
seen unobscured. Mathematics, then, need be introduced only if one is confronted 
with a specific problem that requires a numerical answer. Many educators are fond 
of using the term ‘‘quantitative reasoning” as a substitute for “mathematics”. The 
idea that mathematical questions can be answered qualitatively without actually 
using any mathematics represents one of the most serious misconceptions about 
the nature of this discipline. Mathematics is not a way of hanging numbers on 
things so that quantitative answers to ordinary questions can be obtained. It is a 
language that allows one to think about extraordinary questions. Saying that the 
earth has a round shape means only that it has no edges. This non-mathematical 
picture is not simply “qualitative”, “verbal’’, or “‘intuitive’’; it is primitive and 
empty. If we wish to construct a meaningful metaphor about the shape of the 
earth, we must use the language of shapes, which is mathematics. To those 
prepared to examine its implications, even the mathematically simple picture of 
the spherical shape has a great richness when used as a metaphorical device. 
Getting the picture does not mean writing the formula or crunching the numbers, 
it means grasping the metaphor. Without mathematics, one cannot even read the 
words. 


READING CRITICALLY. Students of the physical sciences are often left with the 
impression that facility in solving mathematical problems is all that is necessary to 
understand scientific ideas. After all, the entire subject of classical physics can be 
reduced to seven compact, simple-looking equations, and most study time in this 
subject is spent working problems. This view, however, has the same basic flaw as 
the belief that mathematics can be replaced by less abstract language. Just as 
excluding numerical calculations from an argument does not eliminate the need 
for mathematics, one cannot bring mathematics to bear on a question by simply 
assigning numbers and symbols to its elements. Metaphors do not have answers, 
they have implications. When we struggle to find solutions to mathematical 
problems, we are exploring the specific implications of a writer’s metaphors in 
particular physical circumstances. 

In physics, as in all other intellectual disciplines, understanding derives from 
critical thought, not just hard work. The equations of electrodynamics, for exam- 
ple, are important not simply because they give rise to interesting and complicated 
mathematical problems. Maxwell’s picture of flowing force fields which diverge 
and swirl according to precise mathematical relationships is an incredibly rich 
metaphor. In order to understand this literature, it must not only be read, it must 
be pondered carefully. By thoughtful consideration of wisely chosen examples, you 
can begin to discover the things about different physical situations that have 
important influences on the outcomes one observes. You begin to understand the 
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metaphor. As Paul Dirac put it: “I understand what an equation means if I have a 
way of figuring out the characteristics of its solution without actually solving it.” 
This is what we mean by physical insight; it is an intuition based upon critical 
analysis of key questions. Einstein’s surmisal that magnetism is fundamentally a 
relativistic effect of electricity, one of the truly remarkable insights of human 
history, came directly from his contemplation of Maxwell’s equations. Building 
such intuition may require considerable effort. Electricity and magnetism are more 
difficult to understand than the shape of the earth, and Maxwell’s metaphor is 
complex in language and construction. On the other hand, one could spend a 
lifetime working out solutions to very complex and difficult problems in electrody- 
namics and never discover the possibility of electromagnetic radiation, even though 
Maxwell’s equations directly imply its existence. A mason may indeed build great 
strength in his arm muscles, but his goal is to build the house. Likewise, the goal of 
the scholar is not technical dexterity, but insight. 


ANALYZING MATHEMATICAL LITERATURE. While it is important to choose 
interesting examples to think about, as a practical matter we are still left with 
mathematical problems to solve. In asserting that mathematics is an idealized 
abstraction which lends itself to elegant, logical treatments, we have overstated our 
case in several particulars. First, not all mathematical problems have solutions. In 
fact, the only way a mathematical system can be made to give uniformly consistent 
answers is for some questions to be left completely unanswered. Even if solutions 
are known to exist, it is sometimes true that there can be no general method for 
finding them. (The most famous example is the problem of trisecting an angle.) 
Such philosophical difficulties with the positivist viewpoint pale in comparison to 
the practical difficulties in finding solutions, even when you know that they exist 
and that you should be able to find them. Specific mathematical problems, no 
matter how well posed, often lead one into an impenetrable thicket from which it 
can be exceedingly difficult to extract solutions by any means. In attempting to 
understand science, one should not let these kinds of mathematical difficulties 
take over. What the scientist wants to know is whether the original metaphor can 
reveal anything about what will be expected to happen in a particular situation. 
Often, a good approximate solution can lead out of the quagmire toward great 
insight. The concepts of field lines, resistance, and capacitance are examples of 
insightful approximations of Maxwell’s equations which allow one to picture and 
analyze the behavior of certain physical systems by avoiding purely mathematical 
entanglements. Hence, the use of these heuristic devices is another way to achieve 
the kind of understanding to which Dirac referred. One must never lose sight of 
the goal of this process. By itself, an individual solution or the particular technique 
used for finding it is important only insofar as it provides clues about the nature of 
the metaphor. It is in the original equations, not in a catalog of solutions, that the 
complete law, the entire metaphor, is to be found. The goal is to grasp the 
metaphor, not just to compile a catalog. 


THE NATURE OF KNOWLEDGE. Finally, we have to consider that the metaphors 
themselves may have limited usefulness. Newton’s laws of motion have turned out 
to need corrections in order to be consistent with the theory of relativity. The 
problems with Maxwell’s laws are more serious: they are internally inconsistent at 
some points. Even the most imaginative metaphor cannot bring every facet of truth 
into plain view. In non-mathematical literature, we are used to the fact that 
meanings become distorted when ideas are stretched too far. (For example, the 
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cartoonist Johnny Hart observed some years ago that the trouble with a melting 
pot is that the bottom gets burnt and the scum comes to the top.) In interpreting 
mathematical language in the scientific literature, we tend to become confused by 
the Platonistic view that mathematical objects themselves actually exist. Regardless 
of whether this assertion about mathematics is philosophically correct, the use of 
this language does not imbue one’s ideas about nature with an independent 
existence. The use of mathematical objects is still a metaphorical device. Mathe- 
matical metaphors are not different from other forms of human understanding. 
They may provide some significant insights, but are not a holy grail of ultimate 
truth. This should come as no surprise. We know that the earth is not exactly a 
sphere. Furthermore, the holy grail is a heretical idea, not just to Christian 
theology, but to Western philosophy as well. Despite our desires to the contrary, 
nature is not obliged to limit its complexity to the confines of the human 
imagination. Despair is not an appropriate reaction to this revelation. The impor- 
tance of the insights provided by any single idea is not diminished because it fails 
to explain everything else in nature. In addition, our capabilities are not forever 
fixed. Mathematicians have continued to explore new territories in the abstract 
landscape they have created, and certainly do not believe that this is a task which 
will one day be completed. If all of human understanding is based on metaphors, 
then extending the reach of the languages we use to construct them can greatly 
expand the boundaries of our imagination. We have cited but a scant few of the 
important advances in the natural sciences which have been made possible by the 
development of mathematics. In a fuller accounting, we would need to discuss its 
influences in philosophy and in the social sciences, as well as the fresh insights 
provided by new mathematics into old conundrums. Mathematics, therefore, does 
not simply represent a narrow but important class of intellectual achievement. Its 
distinct and prominent place in the realm of human language and literature make 
it an essential foundational element of education. 


HOW DOES ONE BECOME LITERATE? As educators, we seek to engage 
students in the exploration of literature, in the understanding of other people’s 
ideas, and in the expression of their own views. If the starting place of this process 
is language, its goal must be Uiteracy. This goal can be attained only through 
thoughtful reading and critical analysis. Having seen that all forms of language 
share the same basic literary device, we can draw rather extensive parallels 
between the meaning of literacy in the language of mathematics and that which 
applies to other languages. The following list of assertions about the nature of 
literacy hardly requires explanation or justification. When applied to mathematics 
education, however, they imply that our current approaches are grossly ill con- 
ceived, and require wholesale revision. 


Individuals can learn more than one language. The assertion that mathematics 
requires a very special innate talent is often used as an excuse for not learning this 
subject and a reason for not trying to teach it. This sort of condescension must not 
be permitted in the educational community. Mathematics is not so different from 
other expressions of the human intellectual capacities for communication and 
imagination. The fact one can learn to say things in mathematical language that 
cannot be said in other languages is beside the point, as is the possibility that 
different regions of the brain may be used. Mathematics is not the exclusive 
provence of either the gifted or the deranged, it is for all who would seek to be 
truly educated. 
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One must learn to read before trying to study literature. This point seems so obvious 
that it is surprising that it is violated so systematically, especially in engineering 
and science curricula. Unsuspecting students are routinely confronted by passages 
of unintelligible mathematics. The accompanying explanation, if one is to be 
found, is typically cursory. Usually, it is unclear whether the mathematical inter- 
lude is given as an aside, whether it is a point of central importance, or whether it 
is something the students were supposed to have known already. Often, such 
narratives become more obscure as they progress. In reading any form of litera- 
ture, one may occasionally find it necessary to consult a dictionary, and footnotes 
can be very helpful in clarifying points of grammar or usage. When one has to look 
up every other word and the footnotes begin to swallow up the text, however, a 
reader has very little chance of putting together the meaning of even a single 
complete sentence. If students are given the chance to study enough mathematics 
first, it can be startling how facilely they are able to grasp the central ideas of other 
disciplines. 


Vocabulary is not enough. We all remember vocabulary tests from grade school: 
look up the word in the dictionary, memorize its definition, learn how to spell it, 
and go on to the next word. We compile the same sorts of lists for mathematics 
students: quadratic equations, logarithms, antiderivatives, Bessel functions, etc. 
Furthermore, we forbid our students to use a dictionary to look up the unfamiliar 
mathematics they may encounter in their studies. We insist that they commit the 
entire dictionary to memory. The difficulty with this approach is that knowing the 
formal definitions of a large number of words is no guarantee that one can actually 
say anything comprehensible. One must obtain practice in expressing complete 
thoughts with language. In mathematics, this means constructing examples which 
deal with specific objects and events; that is, applications. It does not matter so 
much whether the objects are observable or abstract, just as it does not matter 
whether the words are in the form of prose or verse. What matters is that words 
must be given a context if we are to be enriched by understanding their meanings. 


Conversational fluency is of limited value. We spend a great deal of time telling our 
students “this is how you solve this kind of problem.” To divide by a fraction, for 
example, you just invert and multiply. This is just like saying ‘““when you answer the 
telephone, pick up the receiver and say ‘hello’.” This might seem to be a step 
forward from the vocabulary lists, but most often it actually represents a retreat. 
You do not even have to know what the individual words mean in order to be able 
to fire back the correct response. Once you have learned the drill, you will never 
have to think about it again. In short, this is training rather than education. 
Whether or not any sort of training has a proper role in university curricula is a 
matter of longstanding controversy, but in this case, the point is moot. We have 
trained our students to become highly proficient in answering questions they will 
never be asked again. Outside the classroom, it is extremely unlikely that you will 
be asked to “solve for x”. The harm we inflict by insisting on this sort of raw skill 
development goes far beyond simply wasting our students’ time, however. Since 
they never really understood the meaning of the answers they were trained to give, 
they will be unable to answer the questions that do come up. Because they come to 
perceive mathematics as a collection of specialized skills, rather than a way of 
thinking and talking, our students are lead to conclude that mathematics is either 
worthless or impossible to master. Imagine what we would think if they came to 
this conclusion about, say, Spanish. There was a line in one of our high school 
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dialogues that went: “‘they-always-serve-meatballs-in-the-cafeteria-on-Wednesday.” 
Since I have never had the opportunity to use Spanish to tell someone this, am I to 
conclude that Spanish is worthless for communicating with my fellow humans, or 
that it is too much to expect that I will ever be able to learn how to say anything 
really useful? By offering training rather than education in mathematics, we have 
produced a legion of mathematical sophomores, possessed of an extensive but 
superficial knowledge. Breaking this cycle is one of the most important pedagogical 
challenges in mathematics education. 


Writing is not the same as penmanship. Whereas penmanship used to be considered 
a highly valuable skill, most of us would now refuse to accept a handwritten 
document, no matter how legible or graceful the hand. We would not say that a 
manuscript had been “‘written” by a machine just because it was typed. In the case 
of mathematics, our attitude is exactly the opposite. We insist on work which has 
been done by hand, and disallow all forms of mechanical or electronic assistance 
because it amounts to cheating. Whether the students are in college learning 
calculus or in the third grade learning multiplication, we insist that they concen- 
trate on developing their skill at the mechanical manipulation of symbols required 
to arrive at the desired product. The time for us to abandon this antiquated and 
misguided attitude is long past. We are wasting our students’ time by teaching 
them how to write with a quill pen when they should be using a typewriter. The use 
of modern technology is beneficial to mathematical undertakings and literary tasks 
alike. One’s attention can be focused on the conceptual content of what is being 
written, rather than the manipulative processes required to produce the writing 
itself. The skills of using the typewriter, the calculator, and the computer are 
useful ones. The value of being able to add large columns of number in one’s head 
or remember the Fourier transforms of complicated functions is dubious at best. 
More to the point, in the outside world, it will be considered inappropriate to 
waste time performing such tasks by hand. 


Complicated is not the same thing as sophisticated. Nearly everyone has had 
experience with long, tedious problems involving only elementary arithmetic. 
There is no trick to constructing similar problems at any level of mathematics. 
When a problem appears to be difficult, it is important to distinguish whether 
obtaining the solution requires new insights, a bit of cleverness, or just plain 
drudgery. The simple problems are not the only ones that are worth solving, but it 
is appropriate to offer some justification for tackling difficult ones. 


The study of grammar can be overdone. While systematic consideration of mathe- 
matical grammar is generally ignored in lower level courses in mathematics, it 
often overwhelms everything else in advanced courses. For someone who is 
studying mathematics in order to do a bit of advanced reading, it is not always 
necessary to grasp all of the subtleties of mathematical arguments in order to 
appreciate the usefulness of their conclusions. Some such problems, like proving 
the existence of the derivative, turn out to be quite formidable. It is sometimes 
enough to understand the kinds of things mathematicians worry about, but to leave 
the actual worrying to them. 


It is possible to be fluently ignorant. The ability to read the English language does 
not imply any knowledge of Shakespeare’s plays, nor is an understanding of the 
playwright’s ideas derived simply from hearing his words. Further, because a piece 
of writing is grammatically correct and draws from a large vocabulary is no 
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indication that it contains any worthwhile ideas. Similarly, the ability to perform 
mathematical gymnastics is not necessarily indicative of any underlying under- 
standing of either the literature of mathematics or the nature of the language. 
Mathematical exercises should always be chosen in such a way as to provide the 
student with a keener insight into great concepts, not just practice in manipula- 
tions. Education is not just learning to read and write. The questions of what is 
read and what is written about are central to any practical definition of literacy. 


Literature must be studied from original sources. The application of this principle to 
science and mathematics is often misunderstood. We are not suggesting that the 
only proper way to learn Newtonian physics is to study the Principia. Ideas and 
arguments presented in the language of mathematics can be reproduced very 
concisely. When that part of the exposition written in other languages is para- 
phrased or even condensed, little or no loss of meaning need occur as long as the 
mathematics is preserved. On the other hand, major distortions of perception will 
inevitably result if mathematics is replaced by non-mathematical pictures or lists of 
specific examples. It is essential to present the original formulation, the equations, 
which define the author’s metaphor. This is true even if the students’ command of 
the language is limited, and they are able to grasp only some of the simpler 
implications of the metaphor. Having seen the original idea in its entirety, they will 
understand the underlying basis of however much they are ready to learn. Further- 
more, they can approach the subject again in the future without having anything to 
unlearn. 


The accomplishments of the past cannot be dismissed. We have become accustomed 
to considering pre-modern science as so much primitive nonsense: domed sky, flat 
earth, flies erupting spontaneously from dead meat, et cetera. In the humanities, 
such an attitude would appear silly. The works of Shakespeare did not relegate 
those of Sophocles to the dust bin. Because mathematics is generally, and quite 
incorrectly, categorized with science rather than with philosophy, most people, 
including our students, tend to think that the mathematics being taught in the 
present day represents modern developments. In fact, most college students are 
exposed to very few mathematical ideas which originated since the fifteenth 
century. The accomplishments of the early Greek, Hindu, and Islamic scholars are 
of more than historical interest. They were not made obsolete by subsequent 
developments, but rather formed the basis for those very advances. What is 
distressing about the present state of education is that so few students have even 
an inkling of the progress made in mathematics during the last half millennium. 


There is no substitute for learning how to read. The reason mathematics was devised 
in the first place was the inability of existing language to deal with this subject 
matter. Newton, for example, found it necessary to invent the calculus in order to 
develop and express his ideas. Trying to understand Newton without calculus is 
not like trying to understand Sophocles without Greek, it is like trying to under- 
stand Sophocles without words. One could represent Antigone as a series of 
pictures. If done with sufficient skill, the major points of the story line could be 
made evident; but the play is not just a story about some people who end up dead 
under tragic circumstances. To pretend that one can understand Newton without 
using his mathematics would be equally delusional. At best, one might hope to be 
trained in solving certain types of problems involving throwing things up in the air 
or sliding them down inclined planes. With few exceptions, such training will prove 
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useless in the long run. Visual images, no matter how important they may be to 
human communication, do not constitute a form of language. Creating a picture 
book is not an act of translation; and, despite the aphorism to the contrary, no 
number of pictures can match the power of the written or spoken word. Mathemat- 
ics, on the other hand, is precisely a form of language. If we truly wish to 
understand the things that have been said with this language, we must eventually 
take up some books with words in them. 


Illiteracy has unfortunate consequences. In demanding picture books, one chooses 
illiteracy and is cut off from some of the most important ideas in the history of 
western civilization. This is a decision that should not be taken lightly by anyone 
who truly desires an education. That Americans seem unaware of the conse- 
quences of such a decision is hardly surprising, since they do not make it for 
themselves. If it makes us feel better about our own ignorance to say that we know 
that the earth is spherical, it is only because we were never told that the educated 
people of the world had known that for at least 1800 years prior to the time 
Columbus set sail. Americans may not think that the earth is flat, but most do 
believe that Michael Jordan can hang in the air and change direction in mid-flight. 
Only the ignorant could be convinced of this. The educated have known better for 
three hundred years. Our own educational system has given out nothing but 
picture books, and as a result, out society has been left illiterate and vulnerable to 
manipulation. We have told the public that they should leave the technical stuff to 
the experts, that they are not smart enough to understand anyway. They have been 
convinced that they are stupid. They are not stupid, they have been cheated. 


Literacy is the goal of education. The study of mathematics not only allows one to 
read and understand the work of others, it also increases one’s own powers of 
thought, imagination, and expression. That so many regard this subject as frighten- 
ing, boring, or otherwise unpleasant represents a disappointing failure of our 
educational efforts. Literacy in mathematics is not simply a question of how much 
or for whom. To improve our present condition will require substantive reform 
across the spectrum of curricula. It will not be enough for our departments of 
mathematics to teach our students how to read. All the rest of us must see to it 
that they do read, that what they read is worthwhile and important, and that they 
are able to comprehend what they read. Becoming literate requires a real educa- 
tion, the kind that does not become out of date, but prepares one for a lifetime of 
learning. Just as there is no aspect of the human experience unworthy of serious 
study, there is no student unworthy of a genuine education. For once, a popular 
buzz word has captured the true essence of a national problem. Our difficulty is 
not that education has failed to keep pace with rapidly changing knowledge, it is 
that we have misplaced the goal of all true education: literacy. 
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Fractional and Trigonometric 
Expressions for Matrices 


Goro Shimura 


1. INTRODUCTION. We start with two elementary facts: 

(1) Every rational number x can be expressed as a quotient c /d with two integers c 
and d which have no common divisor greater than 1. Moreover, if x = c'/d' is 
another such expression, then (c’, d') = +(c, d). 

(II) Every real number x can be expressed as a quotient c /d with two real numbers 
c and d such that c* + d? = 1. Moreover, if x = c'/d' is another such expression, 
then (c', d') = +(c, d). 

One can add some geometric flavor to the latter statement by using sin 6 and 
cos 9 instead of c and d. 

There is a certain parallelism between these two statements, but I guess most 
readers will wonder if it is really essential or merely rhetorical. This point aside, we 
can pose the question of whether the same types of statements can be made for 
matrices, which is the meaning of the title of this article. Considering only the first 
type for the moment, we can ask: 

(A) Given a matrix x with rational entries, can one find matrices c and d with 
integral entries so that x = d~'‘c and that this is an expression “reduced to the lowest 
terms’? If the answer is affirmative, to what extent is the pair (c, d) unique? 

Here we don’t have to assume that x is square; if x is an m X n-matrix, then c 
must be of the same shape and d invertible and m X m. Similarly we can ask for 
an expression x = cd~' with c and d of appropriate shapes. There is of course a 
trivial answer: take d to be the scalar which is the least common denominator of 
the entries of x. But look at the equality 


(2 ye ! _ (° (3 3 | 
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Which side is more appropriately called the quotient reduced to “the lowest 
terms”? Clearly the left-hand side looks more “reduced.”’ 

This example will tell that perhaps the answer to (A) should be formulated in 
terms of elementary divisors of integral matrices, and in fact (A) can be answered 
easily that way, as will be shown below. However, our aim is not only to give an 
answer to (A), which is quite elementary, but also to supply a certain conceptual 
background for this type of question. At the same time, we shall show that (II), as 
well as its matrix version, can be understood in the same framework. In the last 
two sections we shall give higher-dimensional analogues of cos @ and sin @, as well 
as their hyperbolic counterparts, in the context of statement (ID, which are closely 
related to the decomposition of the type G = K - exp(p) for compact and noncom- 
pact Lie groups G. Our theorems or lemmas in the other sections can hardly be 
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called new, but it seems that some of them have never been stated in the forms as 
we present them in this article. 


2. FRACTIONAL EXPRESSIONS FOR RATIONAL AND REAL MATRICES. Let 
us first make some notational conventions. For an associative ring R with identity 
element, M’"(R) denotes the module of all m xX n-matrices with entries in R and 
GL,(R) the group of all invertible elements of the ring M”(R). The zero element 
of M/"(R) is denoted by 07” or simply by 0, the identity element of M”(R) by 1, or 
simply by 1, and the transpose of a matrix X by ‘X. For square matrices 
A,,..., A, we denote by diag[A,,..., A,] the square matrix with the A, in the 
diagonal blocks and 0 everywhere else. As usual, Z, Q, and R denote the ring of 
integers, the field of rational numbers, and the field of real numbers. 
We now recall 


Lemma 1. Given X © M/"(Q) of rank r, there exist A € GL,(Z), B © GL,(Z) and 


positive rational numbers e,,...,e, such that e,,,/e; <= Z for alli < rand 
EO On- . 
AXB = mr mr} E = diag[e,,...,e,]. (2.1) 


Moreover, the e; are uniquely determined by X. 


This is usually stated only for X € M/Z), but for X © M”(Q) if we take 
g © Z, >0,so that gX¥ € M’/"(Z) and apply the theorem to gX, then we immedi- 
ately obtain the result in the above form. The numbers e,,...,e, are called the 
elementary divisors of X. 

We call an element X of M?"(Z) primitive if rank(X) = Min(m, n) and the 
elementary divisors of X are all equal to 1. If m =n, clearly X is primitive if and 
only if X € GL,(Z). 


Lemma 2. Let X © M’(Z) and m <n. Then the following conditions on X are 
mutually equivalent: 

(1) X is primitive. 

(2) There exists an element Y of M?~"™(Z) such that (*) € GL, (Z). 

(3) There exists an element W of M"(Z) such that XW = 1,,. 

(4) If C © M1(Q) and CX € M}(Z), then C € M}(Z). 

(5) For every 1 € Z, > 0, if C € M{(Q) and CX € M‘(Z), then C € M{(Z). 


Proof: Put s=n-—m. If X is primitive, our definition implies that AXB = 
(1, 0”) with A © GL,(Z) and B € GL,(Z). Let Y = (0%, 1,)B~'. Then 


—1 
F] 7 4, ? |B = GL, (2); 
~1 

which proves that (1) = (2). With Y as in (2) put (*] =(W Z)withhWe MZ) 
and Z<&M"(Z). Then XW =1,, and hence (2) = (3). With W as in (3) let 
C © M}(Q) and CX € M}(Z). Then C = CXW E M}(Z), and hence (3) = (4). 
Decomposing C of (5) into row vectors, we easily see that (4) = (5). Finally, 
assume (4). If rank(X) < _m, then there is a vector v € M}(Q), # 0, such that 
vX = 0. Changing v for its suitable rational multiple if necessary, we may assume 
that v € M/(Z). This cannot happen under (4). Thus rank(X) = m. Now take A, 
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B, E, and e,,...,e, as in Lemma 1. Then E~'AX =(1,, 0)B7! © M(Z). Then 
(5) implies that E~'A © M7"(Z), and hence E~' € M”(Z). Therefore e, = 1 for 
every i, which proves that (4) = (1) and our proof is complete. 


Now an answer to (A) can be given as follows: 


Theorem 1. Given x € M’"(Q), the following assertions hold: 

(1) There exist c © M”(Z) and d © M?(Z) ON GL, (Q) such that (c_ d) is primi- 
tive and x = d~'c. 

(2) If c and d are as in (1) and if x =d'"'c' with c! © M™(Z) and d' € 
M?(Z) N GL,,(Q), then c' =hd and d' = hd with h © M"\Z). In addition if (c' 
d') is also primitive, then h € GL, \Z). 

(3) Let e,,...,e, be the elementary divisors of x and for each i let e, = f,/g, with 
relatively prime positive integers f, and g,. If c and d are as in (1), then the elementary 
divisors of c are f,,...,f, and the elementary divisors of d are 1,...,1, 8,).++5 8 
with 1 repeated m — r times. 


Proof: Assuming the existence of c and d as in (1), suppose x =d' ‘c’ with 
cl © M™(Z) and d’ &€M"(Z) OGL, (Q); put h =d'd~'. Then d' =hd, c' = 
d'x =hc, and so h(c d)=(c d')eEM"(Z). By (5) of Lemma 2 we have 
heM"(Z). If (c’ d') is also primitive, then exchanging (c d) for (c’ d’), we 
find that h~' © M’"(Z), and hence h € GL, (Z). This proves (2). To prove (1) and 
(3), we apply Lemma 1 to x to find an expression axb = ° ) with a € GL, (Z), 
b € GL,(Z), e = diagle,,...,e,],0 <e; € Q, e;,,/e; € Z. Put e; = f,/g, as in (3) 
and d = diaglg,,...,g,,1,,_,]a, f = diaglf,,...,f,]1 qd = (/ ‘|, and c = qb", 
where the zeros in q are arranged so that gq © M’"(Z). Then x = d~'c and 


fi 0 0 §1 0 0 
b 0 - ) 
(¢ \(° my 0 f, 0 0 gz 0 
O -- 0 0 O «= O 4, 


We easily see that the right-hand side is primitive (by checking (4) of Lemma 2, for 
example), and hence (c 4d) is primitive since diag[b,a~'] € GL,,,,(Z). This 
proves (1). Put k; = e;,,/e;. Then k;f,g;,, = 8;f;,,. Since f; and g, are relatively 
prime, we see that f,|f,,,, and similarly g,,,|g;. Therefore we obtain (3). 


Remarks. (R1) If (cd) is as in Theorem 1, then (3) of Lemma 2 guarantees 
elements a € M"(Z) and b € M”(Z) such that ca + db = 1,,. Thus, as in number 
theory, (c d) may be called relatively prime. It should be noted that as can be 
seen from the example x = diag[2, 1]~'diag[1, 2], the maximum elementary divi- 
sors of c and d may not be relatively prime. 

(R2) The expression x = cd~! can be treated in the same fashion by making 


obvious modifications. For example, in this case we assume that (<) is primitive. 


(R3) Clearly all the above results can be extended to the case of matrices with 
entries in the field of quotients of a principal ideal domain, or in a central division 
algebra over a p-adic field. This comment applies also to (1) of Theorems 3, 4, 5 
and Lemma 6 below. 
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Let us now turn to real matrices. In fact, we shall treat not only real matrices 
but also complex and quaternion matrices, since all these can be handled easily by 
the same technique. To make our exposition uniform, we let K denote any one of 
the following three objects: the real number field R, the complex number field C, 
and the division ring of Hamilton quaternions H. For X = (x,;) © M7"(K) we 
define X* € M/(K) by X* = (y,,) with y,, = x,,, where X is the image of x under 
complex conjugation or quaternion conjugation. We put also 


U(K) = {a Ee GL,(K)|aa* = 1,}, 
5,(K) = {X © MM(K)IX* = X}, 


and call an element X of S,(K) positive definite (resp. nonnegative) if y*Xy > 0 
(resp. y* Xy >= 0) for every y © M"(K), # 0. If K = R, then X* ='X and U,(K) is 
the orthogonal group of degree n. For X,Y € S,(K) we write X > Y and Y < X if 
X — Y 1s positive definite. 


Lemma 3. Jf y © M/"(K), then yy* is nonnegative. Conversely, every positive definite 
element s of S,(K) can be written in the form s = r* with a positive definite element r 
of S,(K) and also in the form s = aa* with an upper triangular matrix a in M"(K) 
whose diagonal elements are positive real numbers. Both r and a are unique for s. 


This is well known. For 0 < s € S,(K) we write s'/* = r with the above element 
r; we then put s—!/7* = (s71)!/2. 
Now the higher-dimensional analogue of (II) of the introduction is given by 


Theorem 2. Given x € M/"(K), there exist c © M7"(K) and d € GL,,(K) such that 
x =d~'e and dd* + cc* =1,,. If x =d'~'c' is another such expression, then 
c’ =hc and d' =hd with h © U,(K). Moreover there is a unique choice of such 
(c ad) for x with the property that 0 < d € S_(K). 


Proof: Observing that 1,, + xx* is positive definite, we find an element e of 
GL,,(K) such that 1,, + xx* = ee*. Putting d =e~' and c = dx, we obtain the 
first assertion. Given another such pair (c’,d’), put h =d'e. Then d' = hd, 
c’ =hc, and hh* = d'ee*d’™ = d'(1,, + xx*)d™ = d'd™ + c'c’* =1,,, and hence 
h € U_(K). By Lemma 3 we can put e = (1,, + xx*)'””. Then 0 < d € S,(K). If 
0<d' €S(K), then d'* = d*h*hd = d?, and hence d’ =d by Lemma 3. This 
completes the proof. 


Remarks. (R4) In the above theorem, we can take d to be an upper triangular 
matrix with positive diagonal elements. When d is so chosen, the pair (c, d) is 
unique for x. This can be shown as in the above proof by means of Lemma 3. Can 
we prove a similar uniqueness in Theorem 1? We shall answer this question in 
(R7) below. 

(R5) We are tempted to call an element X of M?"(K), m <n, primitive if 
XX* = 1, though that may not be good terminology. If we do use it, then(c d) 
with c,d as in Theorem 2 is primitive. Moreover, the analogue of (1) = (2) of 
Lemma 2 can be stated as follows: 
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An element X of M™(K), m <n, is primitive if and only if there is an element Y of 
M?”~""(K) such that (/ x) © UK). 


Now the last part of Theorem 2 and its proof can be stated in the following way. 


Lemma 4. Put o(x) = (1,, + xx*)7'/*x forx © M’(K). Then ¢ gives a one-to-one 
map of M/"(K) onto {y € M/"(K)|1,, > yy*}, and the inverse map w of ¢ is given by 
wy) = (1, — yy*)7'/7y. 


This is an easy exercise (cf. [2, Lemma 2.3]). 


3. A GROUP-THEORETICAL INTERPRETATION. Let us now show that there 
are some group-theoretical facts which give more substance to the parallelism 
between Theorems 1 and 2. For F = Q or K (or for any ring F) we define 
subgroups P,(F) of GL,(F) and P, ,,(F) of GL,,,,CF) by 


m+n 


Pan F) = {(§ 8) © Om sal) 


aéGL,(F),bEM"(F),dé€ GL,(F)}, 


PF) = Pano 


where we understand that P, (Ff) = Py ,(F) = GL,(F). Clearly P,CF) is the 
group of all upper triangular matrices of GL,(F). 


Theorem 3. (1) GL,(Q) = P(Q)UGL (Z) = P. 
(2) GL, (K) = P(K)U(K) = P, 


n—-(QUGL,(Z) (0 <r <n). 
n-r/KOU(K) (0 <r <n). 

Proof: Though these are well known, we give a proof here for the reader’s 
convenience. Since P, ,_,(F) Cc P,(F), it is sufficient to prove the first equality in 
each case. We first prove Case (1) by induction on n. It is trivial if n = 1. Given 
E © GL,(Q), n > 1, let x be the last row of €. Then x = gy withO #q © Qanda 


primitive element y of M)(Z). Take an element a of GL,(Z) whose last row is y. 
Then y = (0! 1)a, so that xa~' = (0! q). Thus we can put a7! = ([" | 


n-1 n-1 0 @q 
with r]@ M7~/(Q) and s € M7~'(Q). By induction we find 7 € P,_(Q) and 
o €GL,_(Z) such that r=7o0! Then €a~'- diaglo~', 1] € P,(Q), which gives 
(1). To prove (2), let € € GL,(K). By Lemma 3 we can find 7 € P,(K) such that 
EE* = nn*. Then yn 'E € U(K) and € = n - 1 ~'E, which proves (2). 

The groups P, ,, and P, are examples of parabolic subgroups of algebraic 
groups, and the decompositions of the above types are well known for classical 
groups. We shall give some more examples in Theorem 5 below. 

Let us now derive the essential part of Theorems 1 and 2 from Theorem 3. 
Given x € M/"(Q), we consider an element 


tn Om f GL 
x 1,, O m+n(Q). 


By (1) of Theorem 3 we have 
1, 07 
n Ym\ _ [PD 4 [2 *) (3.1) 
x 1, 0 slic d 
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with 


D @)\ .. a b 
[? ; | EP, ,(Q) and [2 A © GL, +(Z). 
Then (c_ d) is primitive and (x 1)= (se sd). Therefore d is invertible and 
x=d'c, 

Similarly, for x € M/"(K) we have (3.1) with 


? 4) © Py (B) and < P| © Un a(K). 


Again d is invertible and x = d™ 'c. Since (: *) & U,,,,(K), we have cc* + dd* = 
1 (or (c_ d) is primitive in the sense of (R5)). 

Thus the parallelism between Theorems 1 and 2 can be interpreted as the 
parallelism between (1) and (2) of Theorem 3, or rather, between the decomposi- 
tions given by (3.1) in both cases, 

Theorem 3 positions U,(K) as the counterpart of GL,(Z). As the counterpart of 
Lemma 1 in this sense we obtain 


Lemma 5. Given X © M’(K) of rank r, there exist A € U,(K), B © U(K) and 
positive real numbers e,,...,e, such that e; <e;,, for all i <r and (2.1) holds. 
Moreover, the e; are uniquely determined by X, 


The statement for X belonging to the semisimple part of GL,(KK) is a special 
case of the fact in the theory of Lie groups which is usually stated as G = KA,K 
(cf. [1, p. 402)). 


Proof: The uniqueness follows from the easy fact that the e? are the nonzero 
eigenvalues of XX”*. To obtain the desired expression, we view (left matrix 
multiplication by) X as a (right-) K-linear map of M7(K) into M;"(K). Let V be 
the orthogonal complement of Ker(X) in M7(K). Then X is essentially a map of 
V onto XV. This reduces our problem to the case in which m = n and X € GL,(K). 
In this case we can find an element A of U,(K) so that AXX*A* = diagl f,,..., f,1, 
O0<f,< +--+ <f,. Let e, =f)” and B=X7~'A7'diagle,,...,e,]. Then BB* = 
1, and we obtain the desired expression, 


Remarks. (R6) As mentioned in (R3), we can prove the same types of results with 
any p-adic field F and its ring A of p-adic integers in place of Q and Z. In this 
case, GL,(A) is a maximal compact subgroup of GL,(F’), as U(K) is in GL,(K). 
(The fact follows easily from the p-adic version of Lemma 1 and Lemma 5.) This 
kind of analogy belongs to the standard philosophy in number theory today, but we 
shall not make any p-adic discussion in this article. 

(R7) In Theorem 1, for a given x the element d can be changed for any element 
in the coset GL,,(Z)d. This means that if we fix a complete set of representatives 
D for GL,,(Z) \ GL,(Q), we can choose d from D. Now by Theorem 3(1) we have 
GL,,(Q) = GL,,(Z)P,,(Q). Therefore we can take D to be a subset of P,,(Q). More 
precisely, if we take a complete set of representatives D, for [GL,,(Z) O P,,(Q)]\ 
P_(Q), then, given x, there exists a unique (c_ d) as in Theorem 1(1) with d € Dp. 
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4. FRACTIONAL EXPRESSIONS FOR HERMITIAN AND SKEWHERMITIAN 
MATRICES. By restricting our matrices to symmetric, alternating, hermitian, or 
skewhermitian ones, we can still make clear-cut statements on their fractional 
expressions. Though the results follow immediately from Theorems 1 and 2, they 
reflect the decompositions of symplectic, orthogonal, and unitary groups in the 
same sense as in Theorem 3, and therefore, may make the parallelism more 
cogent. 

With A denoting Z, Q, or K and the symbol « = (¢,, €,) with «, = + or — we 
put 


S*(A) = {x © M%(A) |x? = -e,2}, 
O «1 
G*(A) = {X € GL,,( A)|XI,X° = J,}, J = | ‘| 


C*(K) = G*(K) 9 U,,(K), 
V°(A) = {w © M3 A)|wJ,.w? = 0, rank(w) = n}. 
Here X° = X* if e, = + and X? ='X if e, = —; we consider ¢, = — only when 
A=K=C.If A =H, the condition rank(w) = n means that wM?"(H) = M(H). 
Clearly S‘~*(K) = S,(K). If w=(c d)e&M{(A) with c,d e@ MA) and 
rank(w) = n, then 
w &V*(A) @ cd? = —€,dc® = cd? © S*( A). (4.1) 
If A is Z, Q, or R, the group G‘~*(A) is usually denoted by Sp(n, A). If 
w € V*(A), a € GL,(A), and B € G*(A), then awB € V*(A). Since (0 1,) € 
V*(A), we have (0 1,)8 € V*°(A). Clearly (0 1,)6 is the lower half of B. We 
also put 
W*(Z) = {w € V*(Z)|w is primitive}, (4.2) 
W*(K) = {w © V*(K)|ww* = 1,}. (4.3) 


Clearly anelement(c d)of V*(K) belongs to W*(K) if and only if cc* + dd* = 1,,. 
Now our fractional expressions for the matrices in $*(A) are given by 


Theorem 4. (1) Given x © S*(Q), there exists (c d) €= W*(Z) with an invertible d 
such that x = d~'c. 

(2) Given x © §*(K), there exists (c d) € W*(K) with an invertible d such that 
x=d'e. 


Proof: For x € S*(Q) take c and d as in (1) of Theorem 1. Then ‘c -‘d~! ='(d7'c) 
= —e,d~'c, and hence (c_ d) € W*(Z), which proves (1). Assertion (2) can be 
derived from Theorem 2 in the same fashion. 


As the analogue of Lemma 2(2) and the statement in (R5) we obtain: 
Lemma 6. (1) W°(Z) = (0 1,,)G*(Z). 

(2) W°(K) =(0  1,)C*(K). More precisely, a> (0 1,)a gives a one-to-one 
map of C*(K) onto W*(K). 


Proof: To prove (1), we first observe that (0 1,)G*(Z) Cc W*(Z) since (0 1,,) is 
primitive and G*(Z) Cc GL,,(Z). Conversely, take x € W*°(Z). By (2) of Lemma 2, 
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we can find some y € Z3, such that ( , € GL, (Z). Put a = ( 4 ) Then 


t — [Yy t ty u v 
ad, a= (7), y J, x) ae | 


with u,v © M"(Z). Since aJ, -‘a € GL,,(Z), we see that v € GL,(Z). Put B = 
diagle,v~',1,]. Then 

Zz €&1, 

1 0 


BaJ, a 'B = 


n 


with z © M"(Z). If (a_ b) is the upper half of Ba, then z = ¢,a ‘b+ b-°‘a. Put 
1, -—b-‘a 
Y = 


Then yBaJ, ‘a -‘B -'y =J,, and so yBa € G*(Z). Now we see that (0 1,)yB = 
(0 1,), and hence (0 1,)yBa=(0 1,)a =x, which proves (1). Assertion (2) 


follows from 
d' ¢,c' 
C*(K) = 
w={(0 9) 


where (d' c')=(d ¢) if K=C and e, = -; (d' c')=(d c) otherwise. In 
fact, it is easy to verify that any matrix on the right-hand side is contained in 
C*(K). Conversely, let 


(c d)eE weeK)}, (4.4) 


d' / 
aécC*(K), (0 1,)a=(c d), and B= | a | 
C 
Then B € C*(K) and(0 1,)a=( 1,)f, and hence aB@' = (: | with a,b € 
M/(K). Since aB~' € C*(K), we see that a = 1 and b = 0, which proves (4.4). 
To obtain the analogue of Theorem 3 in the present case, put 


P*(Q) = G°(Q) NP, .(Q),  P*CUK) = G*(K) 9 P,, CK). 


Theorem 5. (1) G*(Q) = P*(Q)G*(Z). 
(2) G*(K) = P*(K)C*(K). 


Proof: To prove (1), let € € G*(Q). By Theorem 3(1) we have € = ya with 
E P, (Q) and a € GL,,(Z). Put 


a b D @q 

n-( A and e-(? ‘| 
with a, b, etc. of sizen. Then d= ‘(r_ s)= (0 1,)a. Since € € G*(Q), the left-hand 
side of the last equality, d~‘(r__s), belongs to V*(Q). The right-hand side (0 1,)a 
is primitive. Thus (0 1,)a <q W*(Z). By Lemma 6(1) we have (0 1,)a = 
(0 1,)6 with B © G*(Z). Put y=aB-'. Then (0 1,)y=(© 1,), and hence 
y € P,, (Q). Now € = na = nyB, which proves (1) if we can show that ny € P*(Q), 
but this is indeed the case, since ny € P, ,(Q) and ny = €B~' € G*(Q). Similarly 
let € © G*(K). By Theorem 3(2) we have = na with n € P, ,(K) and a € U,,(K). 
With d, r, and s as above, (0 1,)a=d~‘(r_ s) € W*(K). By Lemma 6(2) we 
have (0 1,)a=( 1,)B with B € C*(K). Repeating the above argument with K 
in place of Q, we obtain (2). 
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The above theorem enables us to explain Theorem 4 again by means of the 
decomposition of (3.1). Indeed, if x € S*(A), then (: °] © G*(A). By Theorem 5 


we have (3.1) with (? ‘)e P(A) and (< ‘le C, where C denotes C*(K) or 


c a 

G*(Z) according as A = K or Q. Then we obtain x = d~'c with (c_ d) © W*(K) 
or W*(Z). 

We conclude this section by adding some historical notes. The expression 
x =d~'c of a symmetric rational matrix x by means of (c d) belonging to 
W‘~ +)(Z) was first considered by Siegel in [3, p. 653]. In particular, he observes 
that |det(d)| is the product of the reduced denominators of the elementary 
divisors of x (as shown in Theorem 1(3)), and puts v(x) = |det(d)|. He employed 
this function v in his investigations of Eisenstein series in [3] and of indefinite 
quadratic forms in [4]. It should also be noted that Lemma 6(1) for G*(Q) = 
Sp(n, Q) was first given by Siegel. The decomposition of an algebraic group over a 
local field as in Theorems 3 and 5, as well as the generalization of Siegel’s function 
v, has been employed in recent papers on automorphic forms. 


5. MATRIX-VALUED TRIGONOMETRIC FUNCTIONS. As mentioned at the 
beginning, c and d in the one-dimensional case can be given as cos 8 and sin 0. Let 
us now give higher-dimensional analogues of these trigonometric functions which 
play similar roles. With fixed m and n we put 

W(K) = {(c d) € M"(K) x M”(K)|cc* + dd* = 1,,}, 

W'(K) = {(c_ d) € W(K)|d = d*}. 
For X © M7(K) we put exp(X) = L*_,X”/v! as usual, and we define an 
M’(K)-valued function c, an M/"(K)-valued function d, and a GL,, , ,(K)-valued 
function EF on M/"(K) by 


e(y) = ¥ (-yy*)"y/(2v + 4 d(y) = ES (-y*)"/Q20)1, 
vy=0 v=0 


m+n 


n 


OQ” —y* 


E(y) = on | yo” ) (y = M,"(K)). 
We have clearly 
e(y)" =e(y*), d(y)" =d(y), ‘e(y) =c(‘y), ‘d(y) =d(¥), 
d(y*) —-e(y)" 
E = . 
(y) | ey) dy) | 
Since exp(X) € U,(K) if X* = —X © MAK), the matrix E(y) belongs to 
U_,,0K), and hence 
e(y)e(y)* +d(y)°=1,,  e(y)*e(y) + d(y*)° = 1,, 
c(y)d(y*) = d(y)e(y). 
Theorem 6. Every element x of M’(K) can be written x = d(y)~'c(y) with an 
element y © M’™(K) such that d(y) is invertible. Moreover, we have 
W(K) = U,(K)W'(K) = U,(K){(e(y) d(y))ly eM"(K)}, (5.1) 
W'(K) = {(e(y) d(y))ly @MM(K)} ifm <n, (5.2) 
Un +n(K) = (U,(K) x U,,(K))E(M7"(K)), (5.3) 
where U_(K) X< U,,(K) = {diagla, b]la <= U(X), b € U, (KX). 
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Proof: The first assertion follows from Theorem 2 and (5.1). Let (c d) € W(K). 
By Lemma 5 we have d = uev with u,v € U,,(K) and a real diagonal matrix e. Put 
w=v 'u-' and (e f)=wc d). Then (e f) © WK) and f =v~'ev. Thus 
f* =f and so (e f) € W'(K), which proves the first equality of (5.1). Assuming 
now m <n, let us prove (5.2). Clearly the right-hand side is contained in the 
left-hand side. To prove the opposite inclusion, let (c d) € W'(K). Take p © 
U,(K) so that pdp~' = diagla, —1,,1,], m=r+s+t, with a real diagonal 
matrix a of size r whose diagonal entries are different from +1. Then pc(pc)* = 
1, — pd*p~' = diag[1, — a*,0”—"]. From this we can conclude that 


pe= (ga--| with z<=M/’(K). 


We can then find a real diagonal matrix b of size r such that b* = 1, — a’, and 
also a diagonal matrix o of size r such that exp(ia) = ib + a. By Lemma 5 we can 
find g € UK) such that z =(v 0’_,)q with v € GL,(K). Then vv* = zz* = b’. 
Put 


ob-'v Or OF 
y=p'| OF wil, Ofla, 
0! 0! 0! 


where u =n —r-—s and mw is a real number such that e’” = —1. Then yy* = 
p'diagl[o?, 771,, O]p, and a straightforward calculation shows that e(y) = c and 
d(y) = d, which proves (5.2), and the second equality of (5.1) as well, under the 
condition m <n. If m > n, this reasoning fails since r + s may be greater than n. 
To avoid this difficulty, after taking p, a, b, and o fora given(c d) € W'(K), we 
put u =p‘ diag[1,, —1,,1,]p and(e f)=u(c d). Then pfp~' = diag[a,1,,,], 
(pe) pe)* = diag[b, 0], and 


pe= (ge--| with z<M/(K). 


Taking v and q as above, put 


4 ob-'v 0"_, 
y=Pp q. 


m—r m—r 
0; On 


Then yy* =p 'diag[o’, 0]p, c(y) =e, and d(y) = f, which proves (5.1) in gen- 
eral. To prove (5.3), let a = (< ‘le U,,,,4K). Since (cd) € W(K), we have 
(c d)=ule(y) d(y)) with u © U(K) and y € M/"(K) by (5.1). Put B = E(y). 
Then (0 1,)a=(0 u)B, and hence we can put diag[1,,u~']aB-' = (; ‘ 


0 1, 
with g,h © M"(K). Since the left-hand side belongs to U,,,,(K), we see that 
g © U(K) and h = 0. This proves (5.3), since the opposite inclusion is obvious. 

It should be noted that the equality of (5.2) is false if m > n. In fact, take 
(c d)=(0 -—1,,). Suppose that m >n and d = d(y) with y € M’"(K). We can 
find p € U_(K) and a real diagonal matrix r such that yy* = pr*p~'. Then 
—1, =p ‘d(y)p = d(r), and hence r = diag[a,7,..., a,,77] with odd integers a,,. 
This is a contradiction, since rank(yy*) <n < m. 

To consider the case of matrices in S*(K), we put 


Z*(K) ={(c d) © W*(K)|c° = —e,c, d* = d} 
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and restrict the functions c and d to S*(K). Clearly, for s € S*(K) we have 
e(s)" = —e,e(s), d(s)* = d(s), d(s)" = d(s*), 
e(s)d(s)° = d(s)e(s), e(s)e(s)* + d(s)* = 1,. 

From these and (4.4) we see that 
{(e(s) d(s))|s © S*(K)} c Z°(K) 
={(c d) €M3(K)|c? = —e,c, d* =d, cd® = dc, cc* + dd* = 1,}, (5.4) 

E(S*(K)) c {a € C*(K)|a*A = Aa}, =A = diag[1,, —1,]. (5.5) 
Theorem 7. Every element x of S*(K) can be written x = d(s)~'c(s) with an element 


s © §*(K) such that d(s) is invertible. Moreover, excluding the two cases in which 
(e, K) = (+, +), R) or (4, —), ©), we have 


W*(K) = U,(K)Z*(K), (5.6) 
Z*(K) = {(e(s) d(s))|s © $*(K)}, (5.7) 
E(S*(K)) = {a € C*(K)|a*A = Aa}, (5.8) 
C*(K) = {diaglu’, w]lu © U,(K)}E(S*(K)), (5.9) 


where u’ = ui if (e€,K) = (—, —), ©) and w' = u otherwise. 


Proof: The first assertion follows from Theorem 4(2), (5.6), and (5.7) except in the 
two exceptional cases, which will be treated at the end of the proof. To prove (5.7), 
let (c d) € Z*(K). We repeat the proof of Theorem 6 with some modifications. 
Taking p and a as in that proof, we put g=pcp® and h = pdp™'. Then 
g’ = —e,8, gg* = diag[1 — a’,0], and gh? =hg. From this we can conclude 
that g = diaglv,0] with v = —ew? © M’(K), va = av. We may assume that 


a = diagla)1,,...,@,1,,] with r=r, +--+: +r, and the a, which are all dif- 
ferent. Then v = diaglv,,...,v,] with uv, = —ew? © M/(K). Put b= 
diaglb,1,,,...,b,1,,] and o = diaglo,1,,...,0,1,,] with real numbers b, and ag, 
such that b* = 1 — a? and exp(ic,) = ib, +a,. If e, = -, put y = 


p' diaglob~‘v, 71,,0)(p?)~*. Then y € S*(K), c = c(y), and d = d(y). If ¢ = 
(+,+) and K # R, then we get the same conclusion with y = 
p‘diag[ob~'v, iw1,,0]p. This combined with (5.4) proves (5.7). Then (5.8) fol- 
lows immediately from (5.7) and (4.4). To prove (5.9), given a € C*(K), put 
B = Aa*Aa. Then B € C*(K) and B* = ABA, so that B = E(y) with y € S*(K) by 
(5.7). Put y = E(—y/2). Then Aa 'Aa = B = y* = Ay ‘Ay, and hence Aya 'A 
= ya_'. This shows that ya™' commutes with A, which implies that ya™! = 
diaglw, z] with w,z € UK). Since ya~' € C*(K), we see that w =z’. This 
proves (5.9). Finally, (5.6) follows from Lemma 6(2), (5.9), and (5.4). 

In the exceptional cases we proceed as follows: Put u = p™! diagl1,, —1,,1,]p 
and (e f)=u(c d). Then pep? =g, pfp~' = diagla,1,,,]. Putting y = 
p 'diaglab~‘v,0\(p?)~', we obtain y € S$*(K), e = e(y), and f = d(y). Thus, in 
the exceptional cases we have 


Z*(K) CU,(K){(e(s) d(s))|s  S*(K)}. (5.10) 


Let us now prove the first assertion in those cases. Given x € S*(K), let d = (1 + 
xx*)~!/2 and c = dx. As noted in the proof of Theorem 2 (cf. also Lemma 4), 
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(c d)< WK), d*=d, and x=d7'c. Since ‘x = —x, we have d=(1+ 
x*x)~'/?, If we assume that dx = xd, then ‘c = —c and c -‘d = dc, and hence (c 
d) € Z°(K). By (5.10) we have (c d)=a(c(s) d(s)) with a © U(K) and se€ 
S*(K), which proves the desired assertion. The assumed equality dx = xd can be 
proved as follows. Put f=d~! and g=d_'. Then f7x =(1 4+ xx*)x =x(1 + 
x*x) = xg’. Suppose gu = Av with 0  v © M?(K) with A © R. Then A > 0 and 
f?xv = A*xu, and hence fxv = Axv = xgu. Since M"(K) is spanned by all such v’s, 
we obtain fx = xg, and so dx = xd. This completes the proof. 

Let us note that (5.3) and (5.9) are special cases of a general principle 

G = K- exp(p). (5.11) 
Here G is a connected Lie group, K is a compact subgroup of G, and pb is the 
subset of the Lie algebra g of G defined by 
p ={X © gldox = —X}, 

where @ is an analytic automorphism of G of order 2 such that K is a subgroup of 
K, ={g € G|@g = g} containing the identity component of K,. The explicit form 
of @ in our cases is: 0g =Agd~', A = diag[1,, —1,,] with m =n for (5.9). 
Relation (5.11) is usually stated only for noncompact groups, but actually it is true 
in general for the following reason. Since G/K is a complete Riemannian 
manifold, any point on it can be connected to the origin by a geodesic. Now any 
geodesic passing through the origin is the image of {exp(tX )|t © R}, X € p, under 
the natural map G > G/K (see [1, pp. 208-9]), which proves (5.11). 

In the exceptional cases C*(K) is not connected. So let us denote by C5(K) the 
identity component of C*(K), and put 


Wo(K) = (0 1,)Co(K), — Z9(K) = {(e(s)  d(s))|s © S°(K)}. (5.12) 
By Lemma 6(2) W;(K) is one of the connected components of W*(K). 


To see the nature of our group C*(K) more clearly, let us use the traditional 
notation by putting 


O(n) = U(R), SO(n) = {a € O(n)|det(a) = 1}, 
U(n) =U(C),  SU(n) = {a € U(n)|det(a) = 1}. 
If (e, K) = (4, —), O), it is easy to see that the map a — faf~! with 
1 il 
_ 7A-1/2 n n 
g= 20 e 1 | 


n n 


gives an isomorphism of C‘*~(C) onto O(2n). Therefore C§* ~(C) = SU(2n) N 
Ch -C) = 67 'SOQn)£. 
In the case (¢, K) = ((+, +), R), equality (4.4) together with a simple relation 


alae alt a)=("5° a2.) 


[4 (| (@ted—e) 


gives an isomorphism of C‘**(R) onto O(n) X O(n), and det( “| = det[(d + 
cd — c)|. Therefore 


shows that 


Cir )(R) = {(¢ “ E C(+ +R) 


det(d +c) = det(d —c) = i) 
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Theorem 8. Suppose that (ce, K) = ((+, +), R) or (4, —), C). Then 


Wé(K) = U/(K)Zé(K), (5.13) 
Cé(K) = {diag[a@, a]la € U;(K)}E(S*(K)), (5.14) 
Zi(K) = {(c d) € Z*(K)|d € S'}, (5.15) 


where U,(K) denotes the identity component of U,(K), and S}, the subset of S(C) 
consisting of the elements with —1 as an eigenvalue with even multiplicity (including 
the case of zero multiplicity ). Moreover, Z°(K) has two connected components, and 
the component containing (0 1,) is Z5(K). 


Proof: We have S*(K) = {—‘y =y € M"(K)} in the present cases. From our 
definition of C*®CK) we easily see that its Lie algebra is 


xX —y*™ 

y x 
Therefore (5.14) follows immediately from (5.11). Then (5.13) follows from (5.12) 
and (5.14). To prove (5.15), let w © S*(K). Take g € U(K) so that qww*q* = 
diaglaj,...,a7] with O<a,€R. Then qd(w)q* = diag[cos a,,...,cosa,]. By 
Lemma 7 below we easily see that d(w) € S,, and therefore the left-hand side of 
(5.15) is contained in the right-hand side. Conversely, suppose (cd) € Z°(K) and 
deéS'. Take p € U(K) so that pdp~' = diagla, —1,,1,] as in the proof of 
Theorem 6. Then s is even. Put 


0 —] 
h =p! ig 0 | i 
len 0 
and w = diag[ p, p]. Then wE(h)w~! = diag[d, 5] with 6 = diag[1,, —1,, 1,]. Take 
y =p '‘diaglob~'v,0]p as in the last part of the proof of Theorem 7. Since 


(° | commutes with (° v), we have E(h + y) = E(A)E(y). Now E(h) = 


h 0 0 

diaglu,u] with u =p 'ép, and hence (e(h + y),d(h + y)) = (uc(y), ud(y)) = 
(c,d). This completes the proof of (5.15). Now Z§(K), being a continuous image of 
a connected set S°(K), is connected. Suppose (c d) € Z°(K) and d € S’. Take 
x € U/(K) so that xdx~' = diag]-1,,w] with a real diagonal matrix w whose 
eigenvalues are different from —1. Then s is odd and xc :'x = diag[0°, z] with 
w © M”~°(K) for the same reason as in the proof of Theorems 6 and 7. This shows 
that (cd) belongs to the set 


{(x~! diag[0, e]x, x~! diag[ -1, f ]x)|x © U;(K),(e f) € Z,-\}, (5.16) 


where Z,,_, is the set Z5(K) defined with n — 1 in place of n. Since both U;(K) 
and Z,_, are connected, (5.16) is connected. Thus Z*°(K) has at most two 
connected components. To show that it is not connected, take any(c d) € Z*(K). 
In the proof of Theorem 7, we obtained an expression pdp™! = 
diagla,1,,...,@,1,,, -1,,1,], and also v, = —ew? € M/(K). Since v, is invert- 
ible, 7, must be even. From this we can conclude that the parity of the number of 
negative eigenvalues of d is a continuous function on Z°(K). This proves the last 
assertion of our theorem, since both (0,1,) and (0,diagl—1,1,_,]) belong to 
Z*(K). 


—~x* =x © M"(K), y © S*(K)}. 


Lemma 7. If w = —‘w € M"(C©), then the multiplicity of every nonzero eigenvalue 
of ww™ is even. 
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Proof: Take u € U,(C) so that d = uww*u* is diagonal. Put z = uw -‘u =x + iy 
with x,y € M"(R). Then 'z = —z, ‘x = —x, ‘y = —y, and zz = —d. Since d is 
real, we have xy = yx. Thus x and y are commuting normal matrices, and so 
simultaneously diagonalizable by an element of U,(C). Therefore we can find an 


element v of UC) such that vzv* = diag[c,,...,c,] with c, © C. Then vdv* = 
diag[|c,|°,..., lc, |*]. Since 'z = —z, we see that {~c,,..., —c,} coincides with 
{c,,..-,C,} aS a whole. Therefore we obtain our lemma. 


The decomposition for the group C‘~+(K) in (5.9) can be expressed in a 
somewhat different way. We first define a ring-injection x: M”(H) > M?"(C) by 


K(u + uj) = “ = for u,v €M"(C), 


where j is one of the standard quaternion units. Now it can easily be verified that 


ict+d (K = R), 
[4 “s)> (ic+d,—-ict+d) (K=C), 
ik(c) + k(d) (K = H) 


gives an isomorphism of C‘~+*R) onto U(n), C‘~ tC) onto U(n) x Un), and 
C‘~>*)(H) onto U(2n). Then (5.9) can be given in the forms 


U(n) = O(n)exp(i- S,(R)), 
U(n) x U(n) = {(u,u)|u € U(n)}{(u,u7!)|u € U(n)}, 
U(2n) = «(U,(H) )exp(i - «(S,,(H) )). 
6. THINGS THAT ARE HYPERBOLIC. As a natural variation on our theme, we 
can take the group 
Fy, m(K) = {X © GL, i m( KAT, im X* = Sam}s Tam = diagl1,, - Ln], 


in place of U,,,,,(K), and can still prove corresponding theorems. In this case the 
matrices whose fractional expression is the question must be restricted to the ball 


Br(K) = {y © M7"(K)|yy* < 1,,}. 
We put also 
X(K) = X,,,.(K) = {((f 8) © M"(K) x My'(K)|gg* — ff* = La}, 
X'(K) = {(f g) © X(K)|g* =g > O}, 
X*(K) =X, ,(K) 9 V"°(K), -A*(K) = G*(K) 1.4, ,(K), 
Y°(K) = {(f g) ©X*(K)|f? = —e,f, g* = 8 > O}, 


Oy n(B) = {{6 5) € Gly sa(B) 


where c € M’"(K). As an analogue of (4.4) we have 


H*(K) = (< ee) 


cot < ia} 


(c d) © x(k), 
where (d’ c’) is the same as in (4.4). 
Theorem 9. (1) 0, ,,(K) = P,,,KOH,, ,,(K). 


(2) 0, (KK) 1 G*(K) = P*(K)H*(K). 
(3) (f g)~ 987 'f gives a one-to-one map of U,,.K)\ X(K) onto B”(K). 
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These assertions correspond to Theorem 3(2), Theorem 5(2), and Theorem 2, 
and can be proved in a straightforward way. 
Let us now define f(x), g(x), and F(x) for x € M/"(K) by 


oO 


f(x) = Lee) 2/2» +1)!, g(x) = E (x8*)"/(20) 


n * 


Or x 
F(x) = oo | ‘ a] (x © Mj"(K)). 


Then we have 
eo oe eo _ [a(x*) f(x") 
f(x) ~ f( x ), g(x) ~ g(x), and F(x) a | f(x) g(x) . 
Theorem 10. The map x > 9(x)7'f(x) gives a one-to-one map of M’"(K) onto 
B’"(K). Moreover we have 
X(K) = U,(K)X'(K), — -X"(K) = {(f(4)  8(x)) |x = M"(K)}, 
A, m(K) = (U,CK) x U,,(K))F(M,"(K)). 
Theorem 11. The map s > g(s)7'f(s) gives a one-to-one map of S*(K) onto 
B"(K)  S*(K). Moreover we have 
X*(K) = U,(K)Y*(K), —-Y*°(K) = {(f(s)_ g(s))|s © S*(K)}, 
F(S*(K)) = {a € H*(K)|a* = a}, 
H*(K) = {diag[u’, u]lju © U,(K)}F(S*(K)), 
where u' = uv if (e, K) = (+, —), C) and uw’ = u otherwise. 


All these, except the injectivity of the maps in both theorems, can be proved by 
the same technique as for the corresponding facts in Section 5. In fact the present 
case is much easier. To see the injectivity, let y = g-'f =d~'c with(f g) and (c 
d) in X'(K). Then d = hg with h € U,(K), d* = g’, and hence d = g and h = 1. 


Suppose f = f(x) and g = g(x) with x © M7(K). Put T = & a): Then 7* = T 
and 


F(x) = exp(T) = ¥ i? with k = (1, + f*f)'”. 


Now it is well known (and easy to prove) that z — exp(z) gives a bijection of 
SK) onto {y € S,(K)|y > 0}. Therefore x is unique for (f g), which proves the 
desired injectivity. 
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Noether Lasker Primary Decomposition 
Revisited 


Barbara L. Osofsky 


There are many theorems which are absolutely basic for the study of particular 
areas of mathematics. Some of these theorems work their way into the standard 
undergraduate curriculum, but many are reserved for specialized graduate work. 
In this note, we present a theorem usually reserved for a first graduate course in 
commutative algebra in a way that undergraduates studying abstract algebra might 
find congenial. The theorem is the Noether-Lasker Primary Decomposition Theo- 
rem, which may be thought of as a way to extend a version of prime factorization 
from the integers to a much larger class of rings, including polynomials in several 
variables over a field. Traditionally, the approach taken to this theorem has been 
arithmetical, i.e. using the multiplicative properties of the ring. Although we refer 
to the arithmetic of the integers, our approach is more in the spirit of vector 
spaces. There are a large number of exercises to help the reader get involved in 
understanding the material. It is not cheating to get help on these exercises. 

We assume that the reader has had a first abstract algebra course, including the 
basic homomorphism theorems for groups and rings, some study of prime factor- 
ization for a Euclidean domain such as the integers or polynomials over a field, 
and at least the definition of a prime ideal in a commutative ring. The other 
necessary concepts from commutative algebra will be given here. The reader 
should also have seen the definition of vector space over a field and worked with 
bases and linear independence in a vector space. That is the background we build 
on. Although we prove a generalized version of the primary decomposition for 
finitely generated modules over a commutative Noetherian ring, including unique- 
ness of primary components if and only if the prime is a minimal associated prime, 
the reader does not have to know what those words mean to understand our main 
theorem. The major idea in the proof is contained in a useful module-theoretic 
definition. After we prove the theorem, we define all the concepts in the Noether- 
Lasker Theorem so we can specialize our theorem to the case of interest in 
algebraic geometry and commutative algebra. 

A (left) module over a ring R with identity 1 is a system (M, +, - > that satisfies 
the axioms for a vector space with elements of R as scalars written on the left. Any 
Abelian group is a module over the ring Z of integers, where scalar multiplication 
of x by a positive integer nm means adding x to itself n times. Ring multiplication 
induces a module structure on the ring itself. Indeed, if J is any left ideal of the 
ring R, then the quotient group R/J is a module over R under the induced scalar 
multiplication s-(r + J) =sr + J. 

A module homomorphism yg: M— WN between two R-modules satisfies the 
axioms for a linear transformation. To make the notation look more like the 
associative law rather than the commutative law, we will write our functions to 
the right of their arguments, on the side opposite the scalars. Although it does not 
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matter in this paper, this convention simplifies things elsewhere when non-com- 
mutative rings are being studied. For example, when writing down the matrix of a 
linear transformation 7 with the matrix on the left and column vectors on the 
right, you write down the equations 


T(e;) = 24; :¢ 
j 


and set the matrix of T equal to the matrix [a, ,]. You are forced to transpose the 
matrix to have the product (matrix of 7) x (matrix of T’) correspond to the 
composition of J and 7’. Writing functions on the right, so 7° 7’ means first T 
and then 7”, the equations would read 


(e;)T = d4;, ;e;' 
; 


and you would not need the transpose. For general modules, there are no matrices 
associated with homomorphisms, but there are definite benefits to avoiding the 
equivalent of that transpose. 

With this convention on sides, the defining property of a module homomor- 
phism 9:M-—N is that (r-x+s:y)o=r-(x)o+5s°(y)o for all x, yeM 
and r, s © R. The homomorphism ¢ is an isomorphism if it is one-to-one and 
onto. Just as in other algebraic structures such as groups, rings, vectors spaces, an 
isomorphism is essentially a renaming of the elements of the module M. 


Definition. A module M is called cyclic provided M = {r-x|r € R} for some 
xeM. 


Exercise 1. Any cyclic module R - x is isomorphic to the quotient module R/I, where 
T={reER\|r-x =O}. 


With that background material listed, we proceed to define the main concept 
we will need in this note. 


Definitions. A non-zero module M is uniform if any two non-zero submodules 
have non-zero intersection. M contains enough uniforms provided every non-zero 
submodule contains a uniform submodule. If U and V are uniform, we say U is 
subisomorphic to V and write U ~ V provided U and V contain non-zero isomor- 
phic submodules. M is called primary if M has enough uniforms and any two 
uniform submodules of M are subisomorphic. 

To illustrate the concept of uniform module, we show that the module Z is a 
uniform module over the integers Z. Let J and J be two non-zero ideals of Z, and 
pick a non-zero m € J and a non-zero n € J. Then the non-zero element mn € 
INJ,so [QJ #0. Note that Z or any uniform module must be primary. 


Exercise 2. Prove ~ is an equivalence relation on the class of all uniform modules. 
Notation. We let [U] denote the equivalence class of the uniform module U. 


Exercise 3. Any commutative domain is uniform as a module over itself. 
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Exercise 4. A vector space over a field is uniform if and only if it is one-dimensional. 
Any non-zero vector space contains enough uniforms and is primary. 


Exercise 5. A cyclic Z-module, Z/nZ, is uniform if and only if n is 0 or a power p' 
of a prime p. 


Exercise 6. Any Z-module, i.e. any Abelian group, contains enough uniforms. 


Exercise 7. Every module over a ring R contains enough uniforms if and only if every 
non-zero cyclic module R/I contains a uniform submodule. 


There is an old proverb that a picture is worth a thousand words. In mathemat- 
ics notation can also be worth many words. So we introduce some very special 
notation to save some words. 


Notation. Let the non-zero module M contain enough uniforms. For each class 
[U] of uniforms, set 


Fiy),m = {K C M|K a submodule which contains no submodule in [U ]}. 


Thus writing K © Fy) y 1s a short way of saying that K is a submodule of M 
which contains no uniform submodule subisomorphic to U. Fy, y is never empty 
as the zero submodule is always in it. 


Exercise 8. Let M contain enough uniforms. Then for any uniform module U and 
any L © Fy, m, there is a maximal element of F,y, y which contains L (and which 
possibly might be 0 or M). 


Hint. This requires the set theoretic axiom Zorn’s lemma. 


We now have enough preliminaries to state and prove our Theorem A. Exercise 
8 shows that the module Ny, in Theorem A must exist. Part (b) of Theorem A 
justifies calling the set of all Nu, which are distinct from M the primary 
components of 0, and their intersection a primary decomposition of 0 in M. 


Theorem A. Let the non-zero module M contain enough uniforms. For each class 
[U] of uniforms, let Ny, be any maximal element of Fiy,, um. Then 


(a) Nw Ny = 9. 

(b) Jf OAM /Nuy then M/N,y) 1s primary, that is, it contains enough uniforms 
and any uniform submodule of M/N,y, is in [U]. 

(c) Fu, mu has more than one maximal element = thereisaK © Ay) y anda 


non-zero homomorphism ~ from K to V C M where V is a uniform module in 
[U]. 


Proof: To see (a), let 0 #x © M. Then R-x contains a uniform module V. By 
definition, Ny cannot have a non-zero submodule isomorphic to a submodule of 
V. Thus VN Ny, = 0 and x € Ny. 


1994] NOETHER LASKER PRIMARY DECOMPOSITION REVISITED 761 


To see (b), we draw a picture of the situation in Figure 1 and then give the 
argument. 


Figure 1 


Let K be any non-zero submodule of M/N,,,, and W the largest submodule of M 
mapping onto K modulo Nv} Since W properly contains Nuy by maximality of 
Nv, Av) um, W must contain a uniform submodule V subisomorphic to U. Look 
at the module V 1 N,_,. If it is not zero, then it is a uniform module contained in 
N,v and it is subisomorphic to V which is in [U]. By definition of Nyy no such 
non-zero module exists, so VM Ny, = 0. Restrict the quotient map v:M- 
M/N\v, to V (ie. look at the left column of Figure 1), This restriction has kernel 
V 1 Ny, = 9 and so is one-to-one. Then K contains a submodule (V )y isomor- 
phic to V € [U]. 

For = in (c), assume that Kj), and L,,, are two distinct maximal elements of 
Fu, mu- We again draw a picture to aid in following the argument. 


H CLiy,oM 


bf / 
VY oM/Kiy 


Figure 2 


Since L,y, is not contained in K,,,, the natural map v from M to M/K,y, 
induces a non-zero map on Liu; By (b), the image of v restricted to Ly] contains 
enough uniforms, and all of its uniform submodules are in [U]. Restrict v even 
further to the preimage H in Ly, of a uniform submodule V C (Lup (i.e. look 
at the left hand column of Figure 2). Then v restricted to H is the non-zero map 
required in the = part of (c). 

For the — direction of (c) assume there is a non-zero map ¢ from a submodule 
Ke Fy,y toasubmoduleVCM,VeE [U]. Pick x © K with 0 # (x)g. For the 
moment we assume that r-x =O er: (x — (x)o) = 0. 

By a basic homomorphism theorem and our assumption, both R-x and 
R - (x — (x)g) are isomorphic to the same quotient module R/{r € R|r +x = O}. 
Since K, = R-x is in Ay, y, So is the isomorphic module K, = R «(x — (x)g). 
However, the sum K, + K, contains the module R - (x)g € [U], so no submodule 
Le Ay, m contains both K, and K,. By Exercise 8, each K; is contained in a 
maximal element of Ay y so there must be more than one maximal element in 
Fu) 

Now we prove our assumption that r-x = 0 @r-(x — (x)g) = 0. Since R-x 
E€ FAyuy,m and R:(x)p €[U], R-x OR: (xX)p = 0. Then 0=r- (x — (XD) 
implies 0 = r-x — r- (x)o by the distributive law for scalar multiplication. Bring- 
ing r-(x)g over to the other side shows that r:-(x)o =r-x ER-xNR-: (x) 
= 0, so any r with r- (x — (x)o) = 0 has r- x = 0. Conversely, if 0 = r- x, then 
0=(r:x)o=r'(x)o)so0=r-x—-—r(x)op=r-(x-(ax)o). O 
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If R = F is a field, there is only one subisomorphism class of uniform modules, 
namely the class [F'] of the field as a module over itself. In this case N,-, = 0 for 
any non-zero vector space M, and the theorem does not yield much information. 

What does this theorem say about modules over Z? Exercise 5 asked for a proof 
that the only cyclic uniform modules over Z are isomorphic either to Z or to Z/p'Z 
for some prime p. Exercise 9 asks you to specialize the Theorem to an Abelian 
group M. Let 7(M) denote the torsion subgroup of M, that is, the set of all 
elements in M of finite order. 


Exercise 9. Let M be an Abelian group. Show 


(a) For each prime p there exists a submodule N, GM such that every element of 
M/N, is of order a power of p. Furthermore, 


| () N,) 9 r(M) = 0. (*) 
Pp prime 
(b) If 7(M) #M then each N, contains a subgroup isomorphic to Z. If in 
addition M/N,, # 0, then there is a non-zero map from this subgroup to a 
uniform submodule of M isomorphic to Z/Z - p. 
(c) If 7(M) # M, then for any p with N, # M,N, is not a unique submodule 
of M. 
(d) If 7(M) = M, then all the N, are unique. 


Exercise 9 forms a bridge to the theorem mentioned in the first paragraph. We 
next define some of the terms used there. 


Definition. A module M 1s called Noetherian provided every submodule of M is 
finitely generated, that is, if N is a submodule of M, then there exists a finite set 
{a,,...,4,} GN such that N = {L7_y7;- a,\r, © R}. A ring is Noetherian if it is 
Noetherian as a module over itself. M has the maximum condition if for any 
non-empty family Y of submodules of M there is a maximal element of 7. 


Exercise 10. Z is Noetherian and has the maximum condition. 


Fact. The rings that arise in the study of algebraic geometry and algebraic number 
theory are commutative Noetherian rings. It is a well known theorem attributed to 
Hilbert that polynomials over a Noetherian ring are again Noetherian. Thus the 
ring F[X,,..., X,] of polynomials in 1 variables over a field F is Noetherian. So 
is Z[X,,..., X,]. 


Exercise 11. M has the maximum condition <— M is Noetherian. 


Hint. A third condition, called the ascending chain condition (acc), is usually given 
as another condition equivalent to Noetherian. It states that for any ascending 
chain 


K, GK, ¢ oe CK,¢ cee 


of submodules of M there must be an / with K, = K, for all 7 => /. It might be 
useful in doing Exercise 11 to show maximum condition = Noetherian > acc > 
maximum condition. 
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Exercise 12. Jf a ring R is Noetherian, then every submodule of a finite direct sum of 
copies of the R-module R is also finitely generated. 


Hint, Use finite induction on the number of summands, the projection onto the 
last summand, and the one-to-one correspondence homomorphism theorem. 


Exercise 13. If M is a finitely generated module over a Noetherian ring, and F is a 
non-empty family of submodules of M, then there is a maximal element of F. 


What does all this have to do with uniform modules? 


Lemma 1. Let M be a Noetherian module or any module over a Noetherian ring. 
Then M has enough uniforms. 


Proof: Let N be a non-zero submodule of M, 0 #x € WN. By Exercise 11, the 
maximum condition is equivalent to Noetherian. We observe that any submodule 
of a module with maximum condition must have maximum condition and any 
quotient of a Noetherian module is Noetherian. Thus either hypothesis on M 
implies R-x has maximum condition. To use this fact, we must find the correct 
non-empty family to have a maximal element and then use that maximality. 

Let 


F = {K asubmodule of R - x|there is a submodule 
L # 0of R-x with LN K = 0}. 


F is not empty since {0} € ¥. Let K, be a maximal element of F, and let U be 
a non-zero submodule of R-x with UN K, = 0, Assume U is not uniform. Then 
it contains non-zero submodules H and J such that H OJ = 0. By Exercise 14 
below, HON (J+ K,)=0s0 J+ K, € F¥. Since J ¢ Ky, J + Ky properly con- 
tains K,, contradicting the maximality of K,. We conclude that U must be 
uniform. O 


Exercise 14. Complete the details in the proof of Lemma 1 by showing that if 
HO J=0and(H+J)0K, =0 then HO (J + Ky) = 0. 


Hint. You must use both of the hypotheses. Start with h =; +k e@HO (J+ K,) 
and manipulate to show h =j =k = 0. 


We now proceed to do some traditional commutative algebra. We first state the 
Noether-Lasker Primary Decomposition Theorem, which is what motivated this 
note. 


Theorem B [Noether-Lasker]. Let M be a finitely generated module over a commu- 
tative Noetherian ring R. Then there exists a finite set {N,|1 < i < ]}} of submodules of 
M such that: 


(a) Nj-,N, = 0 and (\;4;,N, is not contained in N,, for all 1 <ig <I. 

(b) Each quotient M/N, is primary for some prime P,. 

(c) The P, are all distinct for 1 <i <1. 

(d) The primary component N, is unique = P, does not contain P, for any j # 1. 
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This theorem looks somewhat different than Theorem A. There are several 
“problems” with it. 


The Noether-Lasker Theorem talks about prime ideals, not uniform modules. 
The uniqueness portion of Theorem A seems to have nothing to do with 
prime ideals being contained in other prime ideals. 

In the standard Noether-Lasker Theorem, uniqueness in (d) is asserted for 
any N, satisfying (a), (b) and (c), not only maximal elements of Fy, yy. 
Theorem A says nothing about finiteness. 

The usual definition of the word primary is not the same as our definition. It 
is actually a special case of our definition. 


In the rest of this paper we will address these ‘“‘problems”’. 


Lemma 2. Let M have enough uniforms and let {N,|1 <i < k < &} satisfy 


(a) N*_,N, = 0. 
(b) M/N, is primary with uniforms all in [U,]. 
(c) (U] #[U,] fi 4). 


Then each N, © Ay, y- Moreover, if Ay, has only one maximal element which 
is distinct from M, then N, is that maximal element. 


Proof: Let U be a uniform submodule of M with U © [U,]. Then for j #7, U 
cannot embed in M/N,. Thus UM N, # 0 for j #1. Since U is uniform, by finite 
induction K; = 1 ,;,(UON,) #0. But K,; 1(UNN,) is equal to 0, so by the 
uniformity of U, UN N; = 0. Thus N; © Fy, y. 

If Avi m contains a unique maximum element Ly, which is not M, then M 
contains a uniform U €[U;] and L,y,/N,, which is [U,]-primary, cannot have a 
non-zero map from a submodule to U. Thus N,; must equal Ly, O 


Note that in Lemma 2, if N; does not contain f) ;,;N;, then M € Fy, y. Can 


jy? 
you show that Lemma 2 need not be true if the number of JN, is not finite? 


Hint. Look at the Abelian group Z ® Z ® Q/Z. 


What do uniform modules over a commutative Noetherian ring R look like? If 
P is a prime ideal of R, then R/P is a domain and hence a uniform R/P-module. 
Since multiplication by an element r € R gives the same result as multiplication by 
r+Pe€ER/P, R/P is also a uniform R-module. We next show that every uniform 
R-module is in the equivalence class [R/P] for precisely one prime ideal P. 


Notation. Let M be any R-module. For x € M we let (0: x) denote the annihila- 
tor of x in R, that is, 


(O:x) ={reERir-x =0}. 


Lemma 3. Jf R is commutative, then any ideal P maximal in {((0: x)|0 # x € M} is 
prime. 


Proof: We first note that P cannot equal R since 1 € P. Let 0 #x, © M have 


P =(0:x,). If ab © P and b € P, then0 # b-Xx,, and (0:b-x,)) DR: a+ P. By 
maximality of P, P= R-a+P,soae€P. Thus P is prime. O 
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Lemma 4. Let U be a uniform module over a commutative Noetherian ring R. Then 
U contains a submodule isomorphic to R/P for precisely one prime ideal P. 


Proof: By Exercise 11, {(0: x)|O # x € U} has a maximal element. By Lemma 3, a 
maximal element in this set of annihilators of elements of U is prime. If P = (0: xj) 
is such a maximal annihilator, then U contains the submodule R- x, which is 
isomorphic to R/P. 

We observe that every non-zero cyclic submodule of R/P, for R commutative 
and P a prime ideal of R, is isomorphic to R/P. If J is any ideal distinct from P 
then R/I is not isomorphic to R/P. Hence if U and V are uniform R-modules 
containing submodules isomorphic to R/P and R/Q respectively where P and Q 
are distinct primes, then [U] # [V]. Thus any uniform module contains a submod- 
ule isomorphic to R/P for one and only one prime P. O 


We next look at the uniqueness portion of the Noether-Lasker Theorem. 


Definition. If R is a commutative Noetherian ring and P is a prime ideal of R, 
then P is associated with the module M provided P = (0: x) for some x € M. We 
will call M P-primary if the prime ideal P is associated with M and no other prime 
iS. 


Exercise 15. Jf R is a commutative Noetherian ring and P is a prime ideal of R, then 
an R-module M is P-primary in the above sense if and only if M has enough uniforms 
and every uniform submodule of M is in |R/P]. 


Lemma 5. Let R be commutative Noetherian, and let M have the set of associated 
prime ideals {P,|i € 4%}. Let x © Mand p © (\;~<.¢P,. Then there is a nonnegative 
integer n such that p” -x = 0. 


Proof: Let I = (0: p" - x) be a maximal element of {(0: p' - x)|i © N}. If p” +x # 
0, by Lemma 4 some submodule of Rp” - x, say Rsp” « x, is isomorphic to R/P, for 
some i € %. Then p”"s -x = sp" +x # 0, but since p € P. we have p- (p"s-x) = 
pp"s -x = 0. Then (0: p”*! +x) 2J + Rs which properly contains J, contradicting 
the maximality of 7. Hence our assumption that p” -x # 0 must be wrong, Le. 
p’:x=0 QO 


Corollary 6. For R commutative Noetherian, let the set of primes {P,,..., P,} 
associated with M be finite, and let Q be any prime which does not contain any of the 
P.. Then the only homomorphism from M to R/Q is the zero homomorphism. 


Proof: For each i with 1 <i < k let p, EP, p, € Q. Then s = [1%_,p, € Q. Let 
gy be a homomorphism from M to R/Q, x € M. By Lemma 5 there is an n such 
that s”-x =0. Then 0 = (O)o = (s"-x)o = 5" (x)g). But Q is a prime ideal 
and s" € Q, so s"- (x)g) = (0 + Q) in R/Q implies (x)o = (0+ Q) in R/Q. 

L] 


The next property we must examine to show that our theorem reduces to the 
standard primary decomposition of a finitely generated module over a commuta- 
tive Noetherian ring is the finiteness of the set of associated primes. 

We first give a definition which generalizes the concept of linear independence 
for vector spaces. A set @ of elements in a vector space is called linearly 
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independent provided the condition Lr, - b; = 0, where the b, are distinct elements 
of @, implies that each r; = 0. By convention, the empty set @ is considered 
independent and indeed a basis for the 0 vector space. 


Definition. A family @ = {U, C M} of submodules of M is called independent it, 
whenever you have Lu, = 0 with the u; from distinct U,, then each u, = 0. The 
empty set of submodules is considered independent. 


Exercise 16. Let M be a module. Then there exists a maximal (possibly empty) 
independent set of uniform submodules of M. 


Hint. This uses Zorn’s Lemma. This is precisely the way in which one proves that 
any vector space has a basis even if it is not finite-dimensional. 


Lemma 7. Let M be a non-zero finitely generated module over a commutative 
Noetherian ring R. Then there are only a finite number of primes associated with M. 


Proof: By Exercise 16, there is a maximal independent set @ = {U,|i € 7} of 
uniform submodules of M. By Lemma 1, which says that M contains enough 
uniforms, @ is not empty. Set K = u,..,U,. Since K is finitely generated, some 
finite subsum of )},U, must contain all of the generators. Then K must equal that 
finite subsum, say K = L/U,. By the independence of &, no other U; can have a 
non-zero element in common with L/U,, so @ = {U,|1 <i < J} is a finite set. 
Each U, has a single associated prime P,. Let Q = (0: x,) be an associated 
prime for M. Then R- x, is a uniform module. Just as we have for vector spaces, 


either R- x, )| U,# Oor{R:-x5,U,,...,U,} is independent. 
i=] 


By maximality of @, R-x» NK #0. Let O# k = Yi_ju,ER-°x)OK. Then 
Q =(0:k) = Nj-,0:u,;) = N,,40F;- This tells us that Q CP, for each j with 
u; # 0. If, for each non-zero u,, there is an element p, € P,, p,; € Q, then IT, 40 D; 
must belong to Q so one of the p, must be in Q, a contradiction. We conclude that 
Q 2 P, for some such j. Thus Q € {P,,...,P,}. O 


We now come to our last “problem”. Even our definition of P-primary is not 
the traditional definition used in classical commutative algebra texts. For the 
record, traditionally M is called primary provided that, whenever x € M and 
r &R satisfy r-x = 0, either x = O or there is a non-negative integer n such that 
r” - M = 0. For finitely generated modules over commutative Noetherian rings, the 
definitions are equivalent. Otherwise ours is more general. The second condition 
in Lemma 8 below is this traditional definition of primary. It has a much more 


arithmetic flavor than our definition. 


Lemma 8. Let M be a non-zero finitely generated module over a commutative 
Noetherian ring R. Then M is a P-primary module for some prime ideal P of R if and 
only if forall 0 #x © MandreER, if r-x =0, then there exists ann © N such 
that r"”-y = 0 forally = M. 


Proof: For the only if direction, assume M is P-primary for some prime P. Let 
0#+x EM and rER have r-x =0. By Lemma 3, there is an s €R with 
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(0:s-°x) a prime ideal. Since M is P-primary, (0:5 -x) must equal P. Since 
r-(s:x)=s-:r-x =0,re¢€P. By Lemma 5, some power of each element of P 
annihilates every element of M. Since M is finitely generated, M = LR -x, for 
some finite set {x,|1 <j < k}. If r™-x, = 0, we leave it as an exercise that 
pitt oot. M=Q. 

For the if direction, assume for all 0 #x © M and rE R, if r-x =O, then 
there exists an n © N such that r”- y = O for all y © M. Set 

P = {s € R|there is an n € N with 5" - M = O}. 

Clearly 0& P so P# ©. Equally clearly, 1 4 P. Let s, S&P, and reR. If 
s”»M =O and §* = 0, then (s + 9)"*™ - M = 0 and (7s)" + M = 0. Thus P is an 
ideal not equal to R. Let U be any uniform submodule of M. Let Q be an 
associated prime of M, that is, OQ = (0: x) for some non-zero x € M and Q is 
prime. By hypothesis, every element of Q is in P, that is, Q CP. If s € P, then 
some power of s is in (0: x) = Q which is a prime ideal. Thus s € Q, so PC Q. 


We conclude that P = Q is the unique prime associated to M, so M is P-primary. 
L] 


To enable the reader to apply the ideas in this paper, we include one last 
exercise. 


Exercise 17 [Final Exam]. Let R = RLX,Y,Z] be the ring of polynomials in 3 
indeterminates over the reals. Let I be the ideal of R generated by {X*, XY, XZ’}, 
that is, 


I = {f,\(X,Y, Z)X° + fo( X,Y, Z) XY + f3( X,Y, Z) XZ° 
lf, X,Y,Z) €R[X,Y, Z]} 


Find the primes associated to the module M = R/I, a set of primary components 
Nirypp 4nd at least two different primary components for each prime for which that 
is possible. 
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DeMorgan was explaining to an actuary what was the 
chance that a certain proportion of some group of people 
would at the end of a given time be alive; and quoted the 
actuarial formula, involving 7, which, in answer to a 
question, he explained stood for the ratio of the circum- 
ference of a circle to its diameter. His acquaintance, who 


had so far listened to the explanation with interest, 
interrupted him and exclaimed, ““My dear friend, that 
must be a delusion, what can a circle have to do with the 
number of people alive at a given time?” 


—Walter William Rouose Ball (1850-19235) 


Mathematical Recreations and Problems. 
London: 1896, p. 180. 
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Elementary Infinite Sources of Non-Unique 
Factorization Rings 


S. Stein and S. Szab6 


When students first meet unique-factorization rings, usually Z, Z[i], and F[x], 
where F is a field, they may get the impression that non-unique-factorization rings 
are rare. After all, the text usually presents only one or two examples of such rings. 
Actually, with just a little more effort, we can offer an infinite number of such 
rings. We describe two ways to do this. The first could be presented at the 
beginning of the discussion of unique factorization; the second depends on the fact 
that if R is a unique-factorization ring, then factorizations in R[x] and F[x], 
where F is the field of fractions of R, are the same up to constant factors. 

For our first source of examples let t € Z be an odd, negative integer, t < —3. 
Then Z[vt] is a non-unique-factorization ring. To show this, consider the product 
(1 + yt) — vt) = 1 —1, which is an even integer greater than or equal to 4. 

Let d(a + bvt) = (a + bvt a — bvt) = a? — tb’, the usual norm. Using this 
norm, one shows that 2 has no non-trivial factorization in Z[vt]. However, it is 
clear that 2 divides neither 1 + Vt nor 1 — vt in Z[Vt], though it divides their 
product. Thus 1 — ¢ has at least two different factorizations as products of primes, 
so Z[vt] is a non-unique-factorization ring. This method does not work when tf is 
an odd positive integer, since, for example, Z[V3 ] and Z[V7 ] are unique-factoriza- 
tion rings. (They are some of the rings of algebraic integers that are Euclidean.) 

Our second source of examples depends on the fact that if R is an integral 
domain such that R[x] contains a polynomial whose factorization into primes in 
R(x] is essentially different from its factorization in F[x], then R is a non-unique- 
factorization ring. 

Using this observation, we show that if t € Z is not a square and is of the form 
4k +1,k © Z, then R = Z[Vt] is a non-unique-factorization ring. 

Consider the polynomial Q(x) = x? +x + (1 — t)/4, which lies in R[x]. Let F 
be the field of fractions of R. Q(x) has in F[x] the factorization 


[x + (1/2 + ve /2)|[x + (1/2 - ve /2)]. 


If R were a unique-factorization-ring and Q(x) were reducible in R[x], say 
O(x) = A(x) B(x), then there would be integers a and b such that 


A(x) = (a + byt )(x + (1/2 + vt /2)) 
and 
B(x) = [(a — bvt)/(a? — th?)|(x + (1/2 - ve /2)). 


Thus a? — tb? divides both a and b, hence (a? — tb’)? divides a? — tb’, so 
a* — th* = +1, implying that one of a and b is odd, the other even. The constant 
term in A(x) is (a + tb)/2 + (a + b)Vt /2, which shows that a+ b is even, 
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contradicting the fact that a and b have opposite parities. Hence R is a non- 
unique-factorization ring. 


Note that if e # +1 is an odd integer, then ZiVve2d ] is properly contained in 
Z[Vd ]. Therefore there is an infinite descending chain of non-unique-factorization 
rings. (These rings are contained in the ring of algebraic integers in Q(Vd ), which, 
for d = 4k + 1, also contain numbers of the form a + b¥d, where both a and b 
are fractions with denominator 2.) 

The examples illustrate two general techniques. In the first, a homomorphism is 
used to project R on a smaller structure, Z. In the second, R is injected into a 
larger structure, R[x], by an isomorphism. The second example can also be 
analyzed with the aid of the “rational root” theorem for R. 
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Davis, CA 95616-8633 P.O. Box 32038 
sstein@math.ucdavis.edu Isa Town, State of Bahrain 
esp5g@isa.cc.uob.bh 


PICTURE PUZZLE 
(from the collection of Paul Halmos) 


Are they related? 
(see page 802.) 
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the Mathematische Annalen affair, van Stigt quotes Brouwer’s speculation that 
Hilbert’s motive was anger at Brouwer’s declining a position in GOttingen in 1919 
and, instead of critically examining this or recalling other documented sources of 
Hilbert’s anger (which would suggest Brouwer was not completely right in his 
estimation), van Stigt supports the assertion with further speculation. As an 
indication of just how far from the mark such speculation can get, I might mention 
van Stigt’s remark on Brouwer’s fellowship with various young poets: “Brouwer 
lacked the ability to express his feelings artistically’. This certainly runs counter to 
what every Dutchman I’ve spoken to has said of Brouwer’s correspondence with 
Adama van Scheltema: Brouwer’s prose is uniformly favourably compared with 
that of the poet. One criticism I’ve read and must agree with is that van Stigt’s 
psychological analysis of Brouwer is excessive and excessively negative. Indeed, it 
might be said that van Stigt has done a hatchet job on Brouwer. But, in doing so, 
he has still managed to paint a more positive image of Brouwer than is generally 
displayed to mathematicians and the reader can get a first, albeit rough, approxi- 
mation to Brouwer from van Stigt’s book. Thus, with the reservations that the book 
is only tangentially relevant to the mathematical community and that it must be 
approached very critically, I refer the reader to it for a faltering first step in getting 
to know the real Brouwer. 


429 South Warwick Avenue 
Westmont, IL 60559 


There are in this world optimists who feel 
that any symbol that starts off with an 
integral sign must necessarily denote 
something that will have every property 
that they should like an integral to pos- 
sess. This of course is quite annoying to us 
rigorous mathematicians; what is even 
more annoying is that by doing so they 
often come up with the right answer. 


—E.J. McShane 


Bulletin of the American Mathematical Society 
v 69, p 611, 1963. 


Answer to Picture Puzzle 
(p. 770) 


No, they are not: they are Emile Borel and 
Armand Borel. 
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Apropos Two Notes on Notation 


Antal E. Fekete 


The best thing that has happened to the Stirling numbers during their 400 years of 
chequered history is Donald Knuth’s paper Two Notes on Notation. The symbol (""} 


of von Ettinghausen for the binomial coefficients, introduced in 1826, has been 
called ‘beautiful’, suggesting a most successful combination of aesthetics and 
economy. For precisely the same reasons, Knuth recommends the use of the 


symbols A and ("| to denote the Stirling numbers of the first and second kind, in 
combination with a conceptual definition, leading to the rediscovery of the Law of 


Reciprocity {="| = BE The purpose of this note is to explore reciprocity further, 


and to make additional comments on terminology and notation. 


1. BINOMIAL COEFFICIENTS AND RECIPROCITY. Let {”} denote the num- 


ber of k-quotient sets of an n-set, and let |7| denote the number of k-orbit 


permutations of an n-set, called the Stirling numbers and the reciprocal Stirling 
numbers, respectively. Their earlier names (Stirling numbers of the second and first 
kind, respectively), have justly been criticized by Donald Knuth in [1] for being 
historically incorrect and without mnemonic merit. However, his suggestion to call 


,; a Stirling subset number’’, and to verbalize the symbol to read “‘n subset k” 


would spell conceptual disaster. Reference to subsets in connection with the 
Stirling numbers must be avoided. Stirling numbers count quotient sets; it is 


binomial coefficients that count subsets. Indeed, the conceptual definition of ("} is 


the number of k-subsets of an n-set. The lattice of subsets is very different from 
the lattice of quotient sets of an n-set (for the concept of a quotient set, see [2]). 
The complementary nature of the two lattices, highlighted by the remarkable 
formulas for the number of injective /surjective functions: 


Inj(k,n) = Ki("), Surj(n,k) = Kite 


must be carefully preserved. In verbalizing, the best course is to read ("} “n sub 
k”, and to read ("| “nm quotient k”’. 


If we wish to extend a two-parameter symbol such as (""} from the natural 


numbers to integers, we are at the crossroads. We have to choose between the 
conflicting demands of two conventions. On the one hand, we may have to refer to 
rows with slope m, in which case the conventions governing plane coordinate 
systems apply. On the other hand, we may have to do calculations with the infinite 
matrix represented by the table, in which case the conventions governing matrix 
calculus apply (G.e., increasing m means moving down, and increasing k means 
moving right). I go along with the latter (which is also Knuth’s choice) but I shall, 
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all the same, refer to rows with slope m, which then must be interpreted in the 
traditional coordinate system. This leads to the mild anomaly that the main 
diagonal has slope —1 while the secondary diagonal has slope 1. To illustrate the 
power of this new language I restate the familiar relation between the binomial 
coefficients and the Fibonacci numbers: the sum of entries in the nth row with 
slope 1 of Pascal’s triangle (Table 2) is equal to the nth Fibonacci number. For the 
sum of entries in rows with slope m of Pascal’s triangle, see [3i], p 91. 
The binomial coefficients satisfy the recurrence relation 


n n— 1 n—-1 
= (ei) + k } 

Some recurrence relations admit an extension of the two parameter symbol to 
negative integral values of n,k; others do not; still others may admit two essen- 
tially different extensions. The existence of an extension depends on the availability 
of sloping rows (columns) whose entries are in arithmetic progression. Indeed, every 
arithmetic progression can be continued for negative ordinals (note that it is 
possible to encounter several consecutive zeros without the continuation being 
identically zero!). The familiar extension of the binomial coefficients to negative 
values of n owes its existence to the fact that entries in the kth column of Pascal’s 
triangle are in arithmetic progression of order k, yielding the formula 


—n\_ 7 _yyk{n+k-1 
anes arene 
and the values (blank spaces stand for 0’s) 


TABLE 1 


-4 10 -20 35 —56 


1 

1 -3 6 -10 15 —-21 
1 -2 3 —4 5 —6 
1 -ti 1 —] 1 —1 
1 

1 1 

1 2 

1 3 3 i 

1 4 6 i 

1 5 


10 10 5 1 
Theorem 1. Entries in the kth column of Table 1 


oon (E(E EE (REZ) on (E Mon 


are in arithmetic progression of order k, and the difference sequence is furnished by 
the k — 1-st column. 


Here we have ("} = (0 for all k < 0, and (="] = 1. However, this is not the only 


possible extension. Another is based on the fact that entries in the nth row with 
slope ~1 of Pascal’s triangle are in arithmetic progression of order n, that can be 
continued for negative values of k. It yields the Law of Reciprocity for binomial 


coefficients: 
fools 
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and the values 


TABLE 2 
1 
—6 1 
15 —5 1 
—20 10 -—4 1 
15 —-10 6 —3 1 
—6 5 --4 3 -2 1 
1 —] 1 -!l 1 -1 1 =O 
1 
1 1 
1 2 1 
1 3 3 1 
1 4 6 4 1 
1 5 10 10 5 1 
1 6 15 


Theorem 2. Entries in the nth row with slope —1 of Table 2 


fami freeed Our freeed 6-9 Gl Cad avers 


are in arithmetic progression of order n, and the difference sequence is furnished by 
the n — 1-st row with slope —1. 


Theorem 2*, Reciprocity for the arithmetic progression of order m in Theorem 2 can 
also be stated in terms of a determinant of order m — |: 


a 1 _ 0 0 


Lea] - Ln a) bn] 


The Law of Reciprocity can now be stated in terms of a reflection in the 
secondary diagonal of Table 2, followed by attaching the signature (—1)"** to 


each entry (*). Under this extension (")= 0 for all n < k, and (;')= 0. Since 


there is no way to reconcile the two extensions, we are again at the crossroads. An 
intelligent choice must be made in order to fix the meaning of the symbol (" for 
negative integers. For certain applications, the extension discussed in this note 
might be preferable. The second natural extension of the binomial coefficients to 


negative arguments has been noted by Herbert Wilf and Doron Zeilberger [5]. 


2. STIRLING NUMBERS AND RECIPROCITY. The Stirling numbers and the 
reciprocal Stirling numbers satisfy the recurrence relations 


(i) (eH) alta) ow Bleed ola 
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which can be used to complete their tables for positive n, k. In each case we find 
that entires in the nth row with slope — 1 are in arithmetic progression of order 27 
that can be continued for negative values of k. Moreover, since both progressions 
have the same difference sequences up to and including the 2n-th, they are the 
same, in fact, either one is the continuation of the other for negative values of k. 
This yields the Law of Reciprocity for Stirling numbers, rediscovered by Knuth [1]: 


and the values 


TABLE 3 
1 
21 1 
175 15 1 
735 85 10 1 
1624 225 35 1 
1764 274 50 11 3 #1 
720 120 24 2 1 1 
1 
1 
1 1 
1 3 1 
1 7 6 1 
1 15 25 10 1 
1 31 90 65 15 1 
1 


Theorem 3. Entries in the nth row with slope —1 of Table 3 


(5B oon( Soon LEE olka 


are in arithmetic progression of order 2n such that the 2n-th difference sequence has 
constant value 1.3.5...(2n — 1). 


Theorem 3*. Reciprocity for the arithmetic progression of order m in Theorem 3 can 
also be stated in terms of a determinant of order m — 1: 


ie) 1 _ 0 0 
ey \("a2} (ESF 
ey SD Gey 


te) Leta} 7 Ae ab Ana] 


An analogous determinant formula holds for me Theorem 3* follows from 
the fact, proved in [3i] p 155, that the inverse of the infinite matrix of Table 3 is 
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obtained by a reflection of the table in the secondary diagonal, and attaching the 
signature (— 1)”** to each entry {7}: 


TABLE 3* 
1 
—21 1 
140 —-15 1 
— 350 65 —10 1 
301 —90 25 —6 1 
— 63 31 —-15 7 —-3 
1 -1l 1 -1 1 -1 1 
1 
1 
ae 1 
2 —3 1 
—6 11 —6 1 
24 — 50 35 —10 1 


— 120 274 —225 85 —15 1 
720 —-1764 1624 —-735 175 —-21 1 


This fact can also be expressed by the formula 


Eo NT in| = oro 


m 


where I have used Iverson’s bracket [1]. 

The proof of Theorem 3 is based on the famous formulas, independently 
discovered by Appell [6], Jordan [7], and Ward [8], expressing entries in the nth 
rows with slope —1 as polynomials in k of degree 2n, 


(er en Cat nt ae) ee 
lee n| 7 Lilt | r Leo)" 37| toe + (f\PP| .. (2) 


where the coefficients 


(nF mare Sof ane [PP pears 


depend on n but not on k, for i= 1,2,...,n. Here an{ ° | denotes Aa, for 


us .}: In general, the terms of a sequence b, are in arithmetic progression 
of order n if, and only if, b, is a polynomial in k of degree n, in which case the 
nth difference sequence is constant and is equal to n!-times the coefficient of k”. 


It follows that the 2nth difference sequence of the progression a, is constant and 
2n 


a= | 


equal to { and [*” ], respectively. Knuth gives reference to several proofs of 


the fact that (1) and (2) are polynomials in k of degree 2. In my presentation it is 
clear that (1) and (2) are special cases of Newton’s first formula expressing the 
subsequent terms of the arithmetic progression of Theorem 3. This remark makes 
further proof unnecessary. However, I shall furnish yet another proof which has 
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the merit of making these formulas conceptually transparent. The symbol | iS 


called a second order Stirling number, defined as the number of k-quotient sets of 
an n-set having at least two elements in each equivalence class; and the symbol [’ | 


is called a second order reciprocal Stirling number defined as the number of k-orbit 
permutations of an n-set having at least two elements in each orbit. In other 
words, the permutations have no fixed points and therefore are derangements, thus 
[;,] is the number of k-orbit derangements of an n-set. 

The proof of (1) is based on a survey of quotient sets of a k-set with k —n 
equivalence classes; these clearly have at least k — 2n, but no more than k — n — 
1, single-element classes. The number of those having exactly k — n — i single-ele- 


ment classes is obviously 
k n+i\l. 
n+ l 


If we add up these numbers for i = 1,2,...,”, then we have accounted for every 
quotient set under survey exactly once, proving (1); (2) is proved mutatis mutandis. 


Saha me tol 
and i on } = 13.5...(2n- 1) -|?"| 


because in the latter case each equivalence class (orbit) has exactly 2 elements. 
This completes the proof of Theorem 3. We also have 


{ 2n } -0-| 2n | and (ar a -o=(?" | 
nt+k n+k n+k nt+ki 

for k = 1,2,3,... 
because if a quotient set of a 2/-set, or a 2m + 1-set, has more than n equivalence 


classes (orbits), then at least one of them must be a single-element class (orbit). 
The recurrence relations 


(2) e923} 24 
flee oficee—oftet 


can be proved by a simple combinatorial argument, not reproduced here, and they 
yield the numbers 


n n 
TABLE FOR { k \ TABLE FOR [;| 


and 


0 0 

1 0 1 0 

1 0 0 2 0 0 

1 3 0 0 6 3 0 0 

i 10 0 0 O 24 20 0 0 0 
1 25 15 0 0 120 130 15 0 0 
1 56 105 0 0 720 924 210 0 0 
1 119 490 105 0 5040 7308 2380 105 0O 
1 0 0 


246 1918 1260 40320 64224 26432 2520 
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Note that the coefficients in Formulas (1) and (2) are exactly those in the rows of 
these tables with slope —1. More extensive tables are found in [3ii] pp. 57, 98 and 
in [4] pp 152, 172. Second order Stirling numbers can also be expressed explicitly in 
terms of the Stirling numbers, thanks to Newton’s second formula: 


n\\ _ _4yif 7 n-i ny) _ _4ayif on n— 1} 
Ki} = 2CV Leni} am [i u(-) nein 
More generally, a Stirling number of order r is the number of k-quotient sets of 

an n-set having at least r elements in each equivalence class, and a reciprocal 


Stirling number of order r is the number of k-orbit permutations of an n-set 
having at least r elements in each orbit. I recommend the notation 


il} le i (Ce Wd) Cah ey 
lel Cdl. = te 


to denote these numbers, which satisfy the recurrence relations 


ey (roca) ee Lal 


= (n= (m= Io (n= re H/T] ee —H]" IY 


and 


Related concepts are that of an r-Stirling number [9], and that of a weighted 
Stirling number [10]. 


3. CONCLUSION. The main result of this note is Theorem 3%, establishing an 
explicit formula for the reciprocals of Stirling numbers. This determinant formula 
may be used to discover the yet unknown reciprocals of certain other numbers 
dependent on two integral parameters, for example, the Lah numbers which, like 
the binomial coefficients, are self-reciprocal, and the idempotent numbers which 
like the Stirling numbers, are not—but their reciprocals have an interesting 
combinatorial interpretation. (For the definition of the Lah numbers and the 
idempotent numbers, see [3)]). 


ACKNOWLEDGMENTS. I am indebted to Donald Knuth and Herbert Wilf for valuable comments 
they have made on the first draft, and also for calling my attention to important references. 
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PC Curves 


I write to correct a remark made by Thomas Banchoff and Peter 
Giblin in their article “On the geometry of piecewise circular curves” 
(May 1994, pp. 403-416). 


On page 405, they claim that an exterior offset of an ellipse is an 
algebraic curve of degree 4. In fact, offsetting an ellipse by a distance 
d produces a single algebraic curve of degree 8. It is only when we 
restrict from the complex numbers to the reals that this curve breaks 
up into separate interior and exterior components. 


Looking at examples in which the offset distance d is small, one might 
be fooled into thinking that the interior and exterior offsets were 
separate curves, each of degree 4. But in the example above, where a 
straight line intersects an offset at eight distinct, real points, six of 
those points lie on the interior component. 


Dr. Lyle Ramshaw 

Digital Equipment Corporation 
Systems Research Center 

130 Lytton Avenue 

Palo Alto, CA 94301 


NOTES 
Edited by: John Duncan 


The Coin Exchange Problem 
for Arithmetic Progressions 


Amitabha Tripathi 


We consider a problem which, in the general case, dates back to Frobenius, and 
can be thought of as exchanging coins of arbitrary denomination with an infinite 
supply of coins of certain fixed denominations. 


Hence, we are given integers a),...,a,_,, and N, and we seek nonnegative 
integers Xo,...,X,_, such that 
ApXo ttt +ay_ X,_1, = N. (1) 


In this note, we will consider the case where the given coin denominations a, are 
in arithmetic progression. We write a, = a + (j — 1)d, with (a,d) = 1, for 1 < 
j <k. For a fixed value of k > 2, we denote by g(a, d; k) (respectively, n(a, d; k)) 
the largest (respectively, number of) N such that 


(> +d 


j=0 


k-1 
uu i) =N (2) 


j=0 


has no solution in nonnegative integers. 

We present a simple argument that results in a formula for g(a,d;k) and 
n(a,d;k). A variation of the formula for g(a, 1; k) is due to [Bra42] while that for 
n(a,1;k) is due to [NW72]. [Rob56] generalized this to obtain g(a, d;k), later 
simplified by [Bat58], and [Gra73] obtained the result for n(a, d; k). 

Let & denote a non zero residue class modulo a, and let my denote the least 
positive integer of the form a(Lfi9x,;) + d(LjZyJjx,) in @. It is well known that 
both g(a,d;k) and n(a,d;k) can be readily derived from these minima. A 
derivation of these may be found in [Tri89], but we present it here for the sake of 
completeness. 


Lemma 1. 


(i) g(a, d;k) = max zm g-— a, the maximum taken over all nonzero classes @. 
(ii) n(a, d:k) = 1/aX ymyz— (a —1)/2, the sum taken over all nonzero 
classes @. 


Proof: (i) Since max zm z and max zm ¢ — a are in the same residue class modulo 
a, and max zm z is the least positive integer of the form (2) in its class, g(a, d; k) 
> max ym _,—a. On the other hand, if N > maxzym,-—a, then N>maxymy 
for each class @, so that N is of the given form. 

(ii) The number of positive integers in class & which cannot be represented in 
the form given by (2) is [m,/a], since my is the least representable integer in 


1994] NOTES 779 


class @. Hence, the total number of nonrepresentable integers is 


mM ¢ me i 1 a-1 
E | LS aaah 2 0 


We note that the lemma together with its proof carries over to the general case 
given by (1), with g and n being defined analogously. 


Lemma 2. For each y,1<y <a — 1, the least positive integer of the form given by 
(2) in the class dy mod a is given by 


1+ + dy. 
=|) +4 


a 


Proof: Each nonzero class @ modulo a determines a unique y, with1 <y <a — 1, 
such that dy € @. This clearly is the smallest multiple of d in the class @. For this 
choice of y, we wish to minimize via0%; in order to determine the smallest 
member of @. With y = (k — 1)qgq+r,0<r<k — 2, we may choose x,_, = q, 
with x, = 1, other x, = 0 provided r # 0 but with all other x, = 0 if r = 0. Thus, 
the minimum value for iro; is q+ 1 if r#0 and q if r=0. This may be 
written more briefly as 

yl 

k-1|}’ 


+dy, wheredye @. O 


1+ 


so that 


Meg=a 


Theorem 1. 
(i) g(a, d;k) = alla — 2)/(k — I) + dla — 1); 
Gi) n(a, d;k) = 5(a + £1) + [Ca — 2)/(k — 1))) + s(a — 1)\(d — 1), where t is 
the smallest nonnegative integer such that a ~ 2 = t (modk — 1). 


Proof: The theorem is an easy consequence of the two lemmas. 


(1) g(a,d;k) =max zmz- a 


y 
= ma ey <u] 7 _ 1 
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(ii) 


1 a-l 
n(a,d;k) =—Ymz- 
ae? 2 
J a=1 y-1l a-1 
= 1+ + dy — 
au k-1 a) 


pal [5 |)+S-S 


[Ca —2)/(k -1I)Kk-1)-1 | 
1 


t 
M 
+ 


! 
M1 


. ‘ (1+ y )) + ene 


y=[a—2/k—-1\(k-1) 


=(kK-1)j1+2+---4 I 
a-—2 (a —1)(d —-1) 
+(1+¢)}1+ —||+ 


where t = a — 2(mod(k — 1)) 


7 a—-2 a—-2 
= 5 1+ rai ||{« - 9] +t+2+t 
_ ~V~) 
2 
a—-2 a-1)(d-1 
=5 1+ l(a +4) + SO 


where t =a —2(mod(k-—1)). O 


ACKNOWLEDGMENT. The author wishes to thank Preety Nigam and the referee for their helpful 
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Congruence of Triangles 


Leonard Gillman 


“They seem to have forgotten all their high school geometry.” This could be 
almost any college teacher speaking. Can part of the problem be that the course 
has become dull? I suggest injecting some mathematical spirit to freshen it up. 

I got to thinking about this recently on reading Daniel Hirschhorn’s interesting 
article [1] about ‘““SSA’’, the ambiguous case in the congruence of triangles. It turns 
out that if the angle is the one opposite the larger side, then there is no ambiguity 
but a theorem—in Hirschhorn’s clever notation, the SsA theorem. Hirschhorn 
laments that although the result was published at least a dozen years ago [2, 3], it 
has not yet made its way into the textbooks. 

I too lament the fact that the theorem has not caught on. One reason may be 
that SsA is an outsider, a role accentuated by the fact that the classical SSS, SAS, 
and ASA appear in most of today’s textbooks only as postulates. 

Another possible reason is the proofs. Even though the proof in [3] is concise, it 
is still indirect, and students tend to be uncomfortable with indirect proofs. 
Hirschhorn supplies a direct proof, but it is much more complicated than the other 
one and in fact is a veritable tour de force of diagrams and notation. 

I propose introducing the notation of a representative of an equivalence class. 
From this point of view, congruence conditions determine a triangle (that is, except 
for its location and orientation). For example, two sides and the included angle 
determine a triangle. The four congruence proofs are then put on an equal footing. 
Figures 1—4 are proofs without words—in fact, without letters. 

Here are some words. In Figure 1, one side (drawn heavy) determines two of 
the vertices, and the intersection of two appropriate arcs then locates the third 
vertex. In Figure 2, the remaining two vertices are the endpoints of the two sides 
forming the given angle. In Figure 3, the third vertex is the intersection of the rays 
forming the given angles with the given side. 

In Figure 4, the given shorter side is drawn heavy. The argument rests on the 
fact that the longer of two segments from a point to a line cuts off the greater 
distance from the perpendicular. In the triangle on the right, the given angle is an 


Figure 1. SSS 
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Figure 2. SAS 


Figure 3. ASA 


Figure 4. SSA 


obtuse or a right angle; on the left, it is acute. In each case there is exactly one 
location for the third vertex. 
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A Short Elementary Proof 
of the Mohr-Mascheroni Theorem 


Norbert Hungerbuhler 


1. INTRODUCTION. In 1797 Lorenzo Mascheroni surprised the mathematical 
world with the theorem that every geometric construction that can be carried out 
by compasses and ruler may be done without ruler (see [4]). It turned out later that 
Georg Mohr proved this theorem in 1672 already (see [6]). The proofs given by 
Mohr and Mascheroni are quite complicated. Later easier proofs have been 
developed (See [3] or [5]). Furthermore the proof could be simplified by means of 
the circular inversion (see [1] or [2]). Here we give a very short and direct proof for 
the theorem that does not appeal to inversion. 


2. THE MOHR-MASCHERONI THEOREM 


Theorem. Every geometric construction carried out by compasses and ruler can be 
done without ruler. 


Proof: We have to prove that the following three fundamental constructions are 
possible to carry out with compasses alone. 


1. Points of intersection of two circles given by its centers and radii. 

2. Points of intersection of a circle (given by center and radius) and a straight 
line (given by two points). 

3. Point of intersection of two straight lines each of them given by two points. 


There is nothing to prove for the intersection of two circles, so let us consider 


2.1. Points of intersection of a circle and a straight line. Here we have to 
distinguish two cases: 

1. The straight line misses the center of the circle. 

2. The straight line passes through the center of the circle. 
The first case is covered by the following construction: 


Construction 1. If the straight line g is given by the points P, and P,, we reflect 
the center M of the given circle K with respect to g as Figure 1 indicates. Then 
we find the two points of intersection {X,Y} = K AN g as the points of intersection 
of K and the reflected circle K’. 


Before we are able to attack the second case, we need to have a construction which 
allows to bisect a segment AB without ruler. This can be done as follows: 


Construction 2. Let K, be the circle through B with center A and K, the circle 
through A with center B with K, \ K, = {C, D} (see Figure 2). We then find a 
point E as the intersection of K, and the circle K, through D around C. Note 
that B is the bisection point of AE. Let F and G be the points of intersection of 
K, and the circle K, through A around E. Then we get the bisection point M of 
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Figure 2 


AB as intersection of the circles K, through A around F and K, through A 
around G. 


The correctness of the construction is evident: Note that the triangles FAM and 
EFA are similar with proportion 1: 2. 


Remark 1. Note that AF has double the length of the segment AB! 


Now we construct the points of intersection X and Y of a circle K with center 
M and a straight line MP: 


Construction 3. Let A be an arbitrary point on K and K AP = {A, B} (see 
Figure 3). B is constructed according to construction 1. Let K, be a circle though 
A and B with radius larger than the radius R of K and M, the center of K,. Now 
we construct a segment CD with endpoints on K, and length 2R (see Remark 1). 
Then we obtain P’ as the intersection of CD and the circle K, through P around 
M, according to construction 1. Let M, be the bisection point of CD (see 
construction 2) and K, the circle around M, through C. Let E be a point of K, 
with P’E = PB. Now X and Y lie on K and BX = EC and BY = ED. 


The correctness of the construction can be verified as follows: Note that PX - 
PY = PA: PB = P'C : P'D by applying Euler’s Theorem on intersecting secants, 
once for K and then for K,. Hence the sets of points P,Y,M,X,B and 
P', D, M,C, E are congruent by construction. Thus in fact X and Y are obtained 
as described. 
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Figure 3 


2.2. Point of intersection of two straight lines. Here we first need the following 
construction with compasses alone which allows to construct the footpoint L of the 
perpendicular through a point Q on a straight line P,P: 


Construction 4. Just reflect Q with respect to P,P, (see Figure 4). If Q’ is the 
reflected point, we find L as the bisection point of QQ’ by Construction 2. 


Let us now analyze the situation of two straight lines P,P, and Q,Q, intersecting 
in S (see Figure 5): 


Let L be the footpoint of the perpendicular through Q, on P,P, and N be the 
footpoint of the perpendicular through L on Q,Q,. Both L and WN are obtained 
by Construction 4. Hence we have the relation 


(Q,L)° =Q,N-Q)S. 


The idea is now to construct the length / of Q,S since then we find S as 
intersection of Q,Q, and a circle with center Q, and radius / (see Construction 3) 
and we are through! In fact / is obtained as follows: 


Figure 4 
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Figure 5 


Figure 6 


Construction 5. First we double Q,L (Q,C = 2Q,L) according to Remark 1 (see 
Figure 6). Let K be an arbitrary (but large enough) circle through Q, and C and 
let D be a point of K with LD = Q,N. Further let FE denote the intersection of 
LD and K (see construction 1). Then LE has length / since we have (Q,L)* = 
Q,L:-LC=LD:LE=Q,N-LE by Euler’s theorem on intersecting chords in a 
circle. 


REFERENCES 


1. H. Dorrie, 100 Famous Problems in Mathematics, Minchen: Oldenbourg, 1960. 

2. H. Eves, A Survey of Geometry, Allyn and Bacon, 1972. 

3. R. Honsberger, Ingenuity in Mathematics, Washington: Math. Association of America, 1970, ib. 7 
(New Mathematical Library, 23). 

4. A.N. Kostovskii, Geometrical Constructions Using Compasses Only, Blaisdell Publication Company, 
1961. 

5. Lorenzo Mascheroni, Geometrie du compas, Coubron: Monom, 1980. 

6. Georg, Mohr, Euclides danicus, Amsterdam: Van Velsen, 1672 (Kobenhavn: 1928). 


Mathematik Department 
ETH Ziirich 

CH-8092, Ziirich 
buhler@math.ethz.ch 


1994] NOTES 787 


THE AUTHORS 


MARIA TERRELL graduated from Albertus Magnus College and received her Ph.D. at the University 
of Virginia. Her research focuses on geometry, its history, and its relation to vision. She is currently 
developing an introductory level geometry course, with support from an NSF curriculum development 
grant. 


ROBERT TERRELL graduated from North Carolina State University, and received his Ph.D. from the 
University of Virginia. He was an engineer for a time in industry, has patents for machinery, and 
currently enjoys teaching mathematics to engineering students. 

The Terrells were students of E. E. Floyd, who was not only a great mathematician, but a superb 
teacher. 


JAMES BULLOCK presently holds a position as Assistant Professor of Physiology at the University of 
Missouri-Columbia School of Medicine. His academic credentials are in the field of biology, having 
earned a baccalaureate degree in that subject from the Illinois Institute of Technology and been 
awarded the Ph.D. degree in biophysics and theoretical biology by the University of Chicago. The 
phenomenon that has captured most of his attention is the transport of ions across cell membranes. He 
points out that biologists have long recognized transport processes of all sorts as crucial to living 
organisms, and have relied heavily upon principles of physical science and the corresponding mathemat- 
ics in analyzing questions of function. That mathematics is ubiquitous in the literature of physiology but 
scarce in its classrooms would have been sufficient to cause him some apprehension. The concern 
expressed in his article is, in the parlance of his fellow biologists, that the lesion may be more 
widespread. 


GORO SHIMURA is self-taught to a good extent, since he attended neither graduate school nor 
kindergarten, though he managed to graduate from the University of Tokyo in 1952. Princeton 
University has been liberal enough to permit him to teach since 1962. His youthful aspiration was to 
become a fashion designer, but he reconciled himself to living in a less competitive world of 
mathematics, and testing his meager knowledge of low-dimensional topology by designing his wife’s 
dress occasionally. 


BARBARA L. OSOFSKY received her BA and MA from Cornell University, and her PHD from 
Rutgers University where she has been teaching ever since. Her primary research interest is modules 
over non-commutative rings, with excursions into commutative rings and algebraic applications of 
working with infinite cardinals. She has been on the editorial boards of the Proceedings of the AMS 
and the Journal of Algebra. For health and recreation, she has swum in a great many swimming pools 
all over the USA and abroad. 


SANDOR SZABO’S main mathematical interest is in algebra and its application. In particular, he has 
worked on factors of groups related to Hajés’ theorem. Recently some of his results played a role in the 
construction of a tiling of 10-dimensional Euclidean space by congruent parallel cubes such that no 
pairs have a complete 9-dimensional facet. (This refutes a conjecture Keller made in 1930.) Based at 
the Technical University of Budapest, he frequently indulges his desire to see the world, serving as a 
visiting professor at the University of Dundee, University of California at Davis, University of the 
Pacific, and presently at the University of Bahrain. 


790 THE AUTHORS [October 


SHERMAN STEIN of the University of California at Davis has recently been applying algebra to 
geometric problems and, with Dean Hickerson, investigating combinatorial aspects of algebraic identi- 
ties. In addition he has maintained a strong involvement in pedagogy. One fruit of this interest is a 
3-year series of high school texts written with Cal Crabill and Don Chakerian and published by WINGS 
which emphasizes discovery and communication. 


ANTAL E. FEKETE is a graduate of the Edtvés Lorand University of Budapest (1955) and has an M.Sc. 
from the University of Ottawa (1958). He has been on the faculty of Memorial University of 
Newfoundland since 1958, with several one-year absences while on tour-of-duty elsewhere (Columbia 
University, 1960; Trinity College, Dublin, 1965; Acadia University, 1970; Princeton University, 1975; 
American Institute for Economic Research, Great Barrington, Mass., 1983.; U.S. House of Representa- 
tives, 1989.) He is the author of Real Linear Algebra (Marcel Dekker, N.Y., 1985) and several Apropos 
articles, which he hopes to develop into a mathematical literary genre. 


FOLKE ERIKSSON studied at the University of Uppsala, Sweden, and for a half year in Copenhagen 
for Werner Fenchel. He has been teaching at Chalmers University of Technology since 1956, except for 
one year at Western Washington in Bellingham. He is now retiring. He has written a textbook 
(Swedish) on Calculus in Several Variables. Questions related to Solid Angles and the Law of Sines in 
dimension > 2 have been his main mathematical interest in the last years. 


CRAIG SMORYNSKI received his Ph.D. at the University of Illinois at Chicago Circle in 1973. His 
research interests include logic and, more recently, the history of mathematics. He has written articles 
on Gddel’s Theorems for the Handbook of Mathematical Logic and the Handbook of Philosophical 
Logic. His other published works include two books for Springer-Verlag, Self-Reference and Modal 
Logic and Logical Number Theory I; An Introduction. 


Buffon Noodles 


Immediately after my article Buffon 
Noodles appeared in the June/July, ‘94 
issue of this Monthly I received a refer- 
ence by electronic mail to a very interest- 
ing paper by J.F.C. Kingman which, along 
with a number of lively discussion papers 
and other references, will clearly be of 
interest to readers thinking about alterna- 
tive approaches and refinements of the 
noodle problem. The precise reference is: 


Kingman, J.F.C. (1982), The Thrown 
String, J.R. Statist. Soc. B 44(2), 109-138 


Thank you Colin Mallows at AT&T 
Bell Labs for the e-mail message. 


Professor Edward C. Waymire 
Deparment of Mathematics 
Oregon State University 
Kidder Hall 368 

Corvallis, OR 97331-4605 
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PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on the 
inside front cover, Please include solutions and relevant references. Three copies of 
all items needed to evaluate the problem should be sent. 


Solutions of published problems should arrive before March 31, 1995 at the MONTH- 
LY PROBLEMS address given on the inside front cover. If possible, solutions should 
be typed with double spacing. Two copies suffice. Several solutions may be mailed 
together, but they should be on separate sheeis of paper. The problem number and the 


solver’s name and mailing address should appear on each solution. A mailing label 
should be included if an acknowledgment 1s desired. 


The published solution ts likely to be based on a solution that is complete and correct. 
Additional information, such as references to other appearances of the problem or its 
solution, is also welcome. 


An asterisk (*) after the number of a problem, or part of a problem, indicatcs that no 
solution is currently available. 


PROBLEMS 


10403. Proposed by David Doster, Choate Rosemary Hall, Wallingford, CT. 

Define a sequence (y,,) recursively by yo = 1, y; = 3 and 

Ynt1 = (2n + 3) yn — 2nyn-1 + 8n 

for n > 1. Find an asymptotic formula for yy. 
10404. Proposed by Behzad Djafari Rouhani, Shahid Beheshti University and Islamic Azad 
University, Tehran, Tran. 

Let x1, x2, ... be a sequence of real numbers such that 

Ini — xy] = [aint — xj44| 


for all positive integers i, 7 with |i — j| < 2. Prove that (x,/n) converges to a finite limit 
asin —> OO. 
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10405. Proposed by Herbert Giilicher, Westfdilische Wilhelms-Universitdt, Miinster, Germany. 


Let Ay A2A3A4A5A¢6 be a hexagon circumscribed about a conic, and form the intersec- 
tions P; = AjAj42N Aj41Aji43 Gi = 1,..., 6, all indices mod 6). Show that the P; are the 
vertices of a hexagon inscribed in a conic. 


10406. Proposed by David C. Fisher, University of Colorado, Denver, CO, Karen L. Collins, 
Wesleyan University, Middleton, CT, and Lucia B. Krompart, Rochester, MI. 


Show that a path on an m by n square grid which starts at the northwest corner, goes 
through each point exactly once, and ends at the southeast corner divides the grid into two 
equal halves: (a) those regions opening north or east; and (b) those regions opening south 
or west. 


10407. Proposed by Roy Mathias, College of William & Mary, Williamsburg, VA. 


Let Aj,...,An41 and 44,..., 4, be 2n + 1 given real numbers such that 


Ay S M1 SA2 S ba S++ Sn S Mn S Ant 
My <P2<'+°+ < Un. 
Show that 
n fai — vy f= Lt yt 1 n+l n 2h n+l 
2 2 
apa { (Seow) edn 
j=1 [|] Mi —pjpiiz=l.n,i fx J i=1 i=l i=l i=l 


In particular, deduce that 


n 1{2@-)-1:i=L.@ty} n(n +1) 


fat. T1[2G-:iatni¢s] 2 


10408. Proposed by Peter W. Shor, AT&T Bell Labs, Murray Hill, NJ. 
Suppose that a function is defined as follows: 


fd,1)=1 
fG0=0 @209) 
fa, N=9 G22) 


and 
fG,/) =32i-j -—DFE-1,N)4+0-274+39fG-LI-DY 
otherwise. 
(a) Show that 
Li/2J+1 | 
SY) EDA FG AD HIG + DEF4)--- Bi -4). 
j=l 


(b) Find a closed form expression for f (27 — 1, /). 
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10409*. Proposed by J. van de Lune, Centrum voor Wiskunde en Informatica, Amsterdam, The 
Netherlands. 


Let p1, P2, p3,... denote the prime numbers in increasing order, and define 
Sy = {¢ ER : sin(t log(pz)) > 0 |, 
C, = {¢ ER: cos(t log(pz)) > 0} 


and 


St = CA) se ct =() cp. 
k=1 


k=1 


Prove (or disprove) that the relative measure of S7 and CF (in R) is equal to 2~”. More 
precisely, prove (or disprove) that 


I 
li _ * _ — —n 
wim ap MS OL T,T]) =2 


and the corresponding statement for C7, where A denotes Lebesgue measure. 


NOTES 


(10406) A path meeting the conditions of the problem on a 5 by 8 grid is shown in figure 
10406 below. 


Figure 10406 


SOLUTIONS 


Non-zero Jacobians 


5881 [1972, 1042]. Proposed by D. E. Cooper, Hampton Institute, Virginia. 


Let U be a connected open subset of the plane, and let f be a map of U into the plane 
which is differentiable (in the sense of Walter Rudin, Principles of Analysis, p. 188). If the 
Jacobian of f is non-zero at every point of U, must the Jacobian have constant sign? 


Solution by M. J. Pelling, Balliol College, Oxford, England. The answer is affirmative 
and may be proved using results from the local topological analysis of mappings from 
2-manifolds to 2-manifolds. In particular, we use the following theorem from G. T. Why- 
burn, Topological Analysis, second edition, Princeton University Press, 1964, p.99. 
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Theorem. Let f(X) = Y be light and open where X and Y are 2-manifolds and X is without 
boundary, let q be any ordinary point of Y and let p € f—'(q). There exists an integerk > 0 
such that if B is any sufficiently small closed 2-cell on Y with q in its interior, the component 
A of f—'(B) containing p is a closed 2-cell mapping onto B by a power map of degree k, 
ie. f\,4 is topologically equivalent to w = z* on |z| < 1 (with p corresponding to 0, 1.e., 
AN f7'@) = p). 


To explain the terminology here, f is a continuous map of X onto Y, and an ordinary 
point of Y means one in the interior of a 2-cell contained within Y; f is light means f~!(y) 
is totally disconnected for every y in Y; and f is open means f(V) is open in Y for every 
open V in X. In the given problem we take X = U (a 2-manifold without boundary) 
and Y = f(X) and show first that the theorem is applicable. Differentiability of f in the 
given sense means that the coérdinate functions of f have total differentials or equivalently 
that, foreach x € U, f(x +h) = f(x) + Mh +n where M is the Jacobian matrix and 
the term 7 = o(|h|) ash — O. Since the Jacobian J(x) = |M| 0 it follows that in 
the neighbourhood of each point of X f behaves, to first order, like a non-singular linear 
mapping. It follows that for every x € U there is a neighbourhood N of x such that xj ¢ N 
and x; # x implies that f(x1) 4 f(x), so that for every y € Y, f—!(y) is a set of isolated 
points and thus totally disconnected. So f is light. 


Lemma. The map f : U > R? is strongly open, that is for each open V in U, f(V) is an 
open subset in R*. Hence Y = f(U) is open in R? and so is a 2-manifold, and all f(V) are 
relatively open in Y so that f : U — Y is open. 


Proof. If f fails to be strongly open there will be a point x € U and aclosed disc N centred 
on x such that y = f(x) does not lie in the interior of f(N) in R?. N may be supposed 
small enough to ensure, by the hypotheses on f, that y ¢ f(0N). Since f(N) and f(dN) 
are closed one may choose z € R? near y so that 0 < d(z, f(N)) < d(z, f(AN) ) where d 
denotes the usual R* metric. If then d(z, f(N)) = d(z, f(t)) forsomet € N, which exists 
because f (JN) is closed, it follows ¢ lies in the interior of N. But if M is the Jacobian matrix 
att, f(t +h) = f(t) + Mh+ n will certainly be nearer to z than f(t) for all sufficiently 
small h in a suitable direction, and t +h will be in N, contradicting the definition of f(t) as 
a nearest point of f(N) to z. So after all f must be strongly open and the lemma is proved. 

The theorem is therefore applicable, and since Y = f(U) is open in R? every point of it 
is ordinary, and every p € U is contained in the interior of a closed 2-cell A which maps by 
f onto aclosed 2-cell B containing f (p) in its interior, with f|, topologically equivalent to 
a power mapping w = z* on |z| < 1. Since p corresponds to 0, a small circle centered on p 
will map to a curve winding k times about g—but since f is a non-singular linear mapping 
to first order in the neighbourhood of p, the image curve only winds once round gq, so that 
k = 1 and f|, is a homeomorphism of A onto B. If a small circle centre p is imagined 
oriented anticlockwise then the image curve will be oriented anticlockwise or clockwise 
according as J(p) > O or J(p) < 0. And similarly for any other point p* in the interior 
of A. But if A is a 2-cell then agreement in sense for two simple closed curves interior to 
A is invariant under a homeomorphism of A (Whyburn, p.63) and it follows that J(p*) and 
J(p) have the same sign. Hence Ui = { p: J(p) > 0} and UL = {p: J(p) < 0} are 
open sets and it follows by connectedness of U that U = U; or U = U_. 

It is of interest to note that the above has actually proved that a differentiable f: U > R? 
with non-zero Jacobian is locally a homeomorphism and so one gets a strong version of 
the inverse function theorem for such mappings, without any continuity assumptions on the 
partial derivatives of the codrdinate functions of f/f. 


There is no record of any other solution being received. 
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One Norm Doesn’t Restrict Algebraic Conjugates 


10221 [1992, 461]. Proposed by Raphael M. Robinson, University of California, Berkeley, 
CA. 


Let a and £ be conjugate algebraic numbers with |a| = 1. 
(a) Show that if |6| 4 1, then |6|* must be irrational. 
(b) Show that the possible values of 8 are everywhere dense in the complex plane. 


Solution by Richard Holzsager, The American University, Washington, DC. Since a is on 
the unit circle, its complex conjugate 1/a@ is among its conjugates over Q. Therefore each 
of its conjugates is accompanied by its reciprocal, and so |B|? = B/y, where y = 1/8. If 
this were a rational number r, then y and ry have the same minimal polynomial P(x) = 
x” + an_yx""! 4+... + a9, 80 aj = ajr"—/, and r (being positive) must be 1. 

To see the density of such 6, we change the problem a bit. Since the roots occur in 
reciprocal pairs, aj = a,_; and n is even. Putn = 2m. Then P(x)/x”™ is a polynomial 
QO(y) in y = x + 1/x. The polynomial P has a root on the unit circle if and only if Q has 
a root in [—2, 2]. Therefore, if we can show that conjugates b of algebraic numbers a in 
[—2, 2] are dense in the plane, we can pass to B = (b+ Vb2 — 4)/2 to get the result for the 
original situation. 

In fact, any interval (—r,r) is sufficient. To see this, consider the conjugate numbers 
u = 2!/4 and v = ui. Look at the two-dimensional subspace U of R? consisting of vectors 
(a, b,c) such that au + bu* + cu? = 0. Since av + bv? + cv? = 0 only for b = 0 
and a — 2!/*c = 0, the mapping L : U — C given by (a,b,c) B® av + bu* + cv? 
is a one-to-one linear function. Thus L is onto, and varying a, b, and c slightly gives a 
rational combination of v, v”, and v° that is arbitrarily close to a given number, while the 
corresponding combination of u, u~, and u? is in (-—r,r). 


Editorial comment. All solvers proved refined versions of (b). Generally, constructions 
used cubic polynomials in the role of Q(y) to show that sextic irrationals suffice. Robert B. 
Israel showed that, in addition, a can be restricted to any interval of the unit circle. On 
the other hand, the proposer and Ilan Kozma showed that, allowing all degrees, a could be 
required to be an algebraic integer. 


Solved also by R. J. Chapman (U. K.), R. B. Israel (Canada), S. Kanetkar, I. Kozma (Israel), O. P. Lossers (The 
Netherlands), and the proposer. 


Ratios in a Matrix Power 


10224 [ 1992, 362]. Proposed by Yves Nievergelt, Eastern Washington University, Cheney, WA. 


Consider all 2 by 2 real matrices A = (a;,;) having non-negative determinant, all entries 


(P) the entries of the 


positive, and a1,1 = a2,2. Also, for each positive integer p, denote by a, j 


power A”. Prove that 


Solution by Benjamin G. Klein, Davidson College, Davidson. NC. For notational conve- 


ab 
ca 


a*—bc > Oandb > Oaresuperfluous.) Leta = a+-J/bc and u = a—/bc. Computations 
involving eigenvalues lead to the following formula, which is readily verified by induction. 


1 AP + pP J/b/c (AP — pw?) 
Pos _ 
A 5 TB (AP — uP) WP + pe for p= 1,2,3,.... 


nience, write A = | We assume only that a and be are positive. (The assumptions that 
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Thus 
(p) 


| al? qg”? 


1 — (u/A)P T+ (u/d)?P 


Qu. 


Since the absolute value of J is greater than the absolute value of 1, (/A)? > Oas p > ov, 
and the desired result follows at once. 


Solved by 44 readers and the proposer. 


A Half-Binomial Identity 


10229 [1992, 570]. Proposed by Herman Bavinck, Delft University of Technology, Delft, The 
Netherlands. 


Given that m and p are integers with m > p > 1, evaluate 


(arenes) 


j= 


Solution by O. P. Lossers, University of Technology, Eindhoven, The Netherlands. The sum 


is equal to 
ip —1/2\ (-1/2 
2m(2m + 1)\m — p)\m+ p) 


This result was found by extensive use of binomial identities, but it is easily verified by 
induction on p: The result is easily seen to hold for p = 1 (or for p = 0), and the induction 
step requires only the straightforward identity 


p-1 ( —1/2 )( —1/2 ) 
2m(2m + 1)\m—pt+1/\m-+p-1 


_ Pp (Vn) = ( 1/2 ) 1/2 
2m(2m + 1)\m — p/\m+ p 7 m—pt+l1 (,.. ») 


Editorial comment. The identity actually holds for all integers p > 0. Some more general 
results were found through the use of identities for hypergeometric series. In particular, 
Francois Grondin reduced the formula to a special case of a result of J. L. Lavoie, “Partial 
sums of coefficients of well-poised hypergeometric series”, Boll. Un. Mat. Ital. (3)21 (1966), 
346-352. The solution can also be found with Gosper’s indefinite summation algorithm, 
which is implemented in several computer algebra systems. 

Solved also by D. Alvis, J. Anglesio (France), R. J. Chapman (U. K.), F. Grondin (Canada), H. van Haeringen (The 
Netherlands), I. Kastanas, I. Nemes (Austria), D. Zeilberger, and the proposer. 


Another Version of the Axiom of Choice 


10245 [1992, 675]. Proposed by M. A. Bezem, Utrecht University, Utrecht, The Netherlands, 
and A. J. C. Hurkens, Catholic University, Nijmegen, The Netherlands. 


Let S be a set of finite, non-empty sets. A transversal of 5 is a set which has a non-empty 
intersection with every element of 5. The Principle of Minimal Transversal states that every 
such 5 has a tranversal that is minimal with respect to set inclusion. Prove that the Axiom 
of Choice is equivalent to the Principle of Minimal Transversal. 
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Solution by S. F- Barger, Youngstown State University, Youngstown, OH. For aset 8 of finite, 
non-empty sets, the intersection of a chain of transversal of 5 is evidently a transversal of 
5, so Zorn’s Lemma (equivalent to the Axiom of Choice) implies the Principle of Minimal 
Transversal. Conversely, for any partially ordered set P, let 5 be the set of all 2-element 
antichains in P. A subset C € P is achain if and only if it contains no element of 8; i.e., if 
and only if P — C is a transversal of 5. Hence a minimal transversal of S$ is the complement 
of a maximal chain in P, so the Principle of Minimal Transversal implies Zorn’s Lemma. 


Solved also by R. Holzsager, I. Kastanas, O. P. Lossers (The Netherlands), K. Schilling, K. Smith & R. Griffus 
(students), Western Maryland College Problems group, and the proposers. Four incorrect solutions were received. 


Reconstructing an Isosceles Triangle 


10249 [1992, 782]. Proposed by O. Yumlu, Munich, Germany. 


Suppose that the inradius of an isosceles triangle and the ratio of the distances from its 
incenter to its vertices are given. Give a Euclidean construction of the triangle. 


Solution by Paul Yiu, Florida Atlantic University, Boca Raton, FL. Let AABC be the desired 
isosceles triangle with AB = AC, incenter J and inradius r. Construct an isosceles right 
triangle AT XY with 1X = XY =r and aright angle at X. Let X’, X” be points on the 
line JX such that ]X’ : 1X” =k = the given ratio A : JB (It is assumed that k is given 
in constructible form). The line through X’ parallel to X”Y meets the line /Y at a point 
Y’ such that TY’ = /2kr. On the line XY, mark a point P with XP = JY’ = J2kr. 
Let C, @’ be the circles with center J passing through X and P respectively. Note that a 
tangent from any point on @’ to @ has square length 2k2r?. Mark a point Q on C with 
XQ =r, and extend X Q to meet C’ at K. Finally, extend the ray XJ to a point A such that 
IA = XK. The triangle bounded by the tangents to C from A and the tangent at X is the 
desired isosceles triangle. 

To justify the construction, let ~@ and 6 be the angles at A and B respectively. Since 
IA(IA—r) = KX(KX — QX) = the square length of the tangents from K to C = 2k?r?, 
we have 


—_ ( a -r) = 2k*r?, 
sin5 \sin5 
1 — sin 5 = 2k? sin? £. (1) 
Since 57+ B= 3%, 
sin 2 = cos 6 = 1 — 2sin? £. (2) 


From (1) and (2) it follows that sin p =k-sin>,andJA=k-JB. 


Solved also by E. Alkan (student, Turkey), N. K. Artemiadis (Greece), S. F. Barger, A. Berenstein (student, Colombia), 
M. V. Bjelica (Yugoslavia), R. J. Chapman (U. K.), A. Coffman, I. Dimitrié, J. Dou (Spain), J. Fukuta (Japan), H. Helfgott 
(student, Peru), I. Kastanas, O. P. Lossers (The Netherlands), J. McHugh, P. Ranaldi, G. Velissarios (Greece), and the 
Anchorage Math Solutions Group. One incorrect solution was received. 


Collaborating editors: David F Appleyard, Paul T. Bateman, Bruce C. Berndt, Duane 
M. Broline, Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. Griggs, 
Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, Daniel J. Kleitman, 
Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, Stephen L. Portnoy, J. O. Shallit, John 
Henry Steelman, Kenneth B. Stolarsky, David E. Tepper, Douglas B. Tyler, Daniel Ullman, and 
William E. Watkins. 
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REVIEWS 


Edited by Darrell Haile 
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Brouwer’s Intuitionism. By Walter P. van Stigt, Elsevier Science, New York, 


1990, xxvi + 532 pp, $94.70. 


Reviewed by C. Smorynski 


Some years ago my topology teacher told the class that Brouwer had been a 
competent topologist who went off the deep end and invented intuitionism. More 
recently, in Zhe Emperor’s New Mind, Roger Penrose retold the same story in 
greater detail, if a bit more politely, with his remark that “intuitionism was 
initiated, in 1924, by the Dutch mathematician L. E. J. Brouwer as an alternative 
response—distinct from that of formalism—to the paradoxes (such as Russell’s) 
that can arise when too free a use of infinite set is made in mathematical 
reasoning.” Concerning the firing of Brouwer from the editorial board of Mathe- 
matische Annalen, Constance Reid, in her biography of David Hilbert, tells us that 
this was because Brouwer insisted on handling all papers on topology or of Dutch 
authorship and that he caused unconscionable delays with these papers. Simulta- 
neously, in Abraham Fraenkel’s autobiography, we can learn that Brouwer was 
fired because he rejected too many papers written by Russian-Jewish mathemati- 
cians. Fraenkel does, however, compare Brouwer favourably with van der Waer- 
den, who remained in Nazi Germany throughout the war. While van der Waerden 
has been reconciliated with his compatriots, Brouwer is still widely believed to 
have been a Nazi, and indeed the pre-publication version of one major obituary of 
him reported that Brouwer had joined the SA and marched in its parades, his hair 
flowing majestically in the wind. 

Brouwer was not a topologist before becoming an intuitionist. He did not 
become an intuitionist in response to the paradoxes. His firing from the editorial 
board of Mathematische Annalen had nothing to do with either his performance or 
any antisemitism on his part. And, there is no evidence whatsoever that Brouwer 
was ever a Nazi. There are kernels of truth in some of these stories, but no real 
fruit. The transition from topology to intuitionism is only apparent. Brouwer’s 
dissertation of 1907 had been on the foundations of mathematics. In 1908 he 
published two papers on intuitionistic lines, one rejecting the higher infinite 
cardinals and one questioning the validity of the Law of Excluded Middle in 
infinite domains. Thereafter, at the suggestion of his advisor, D. J. Korteweg (of 
the Korteweg-de Vries equation), Brouwer worked in classical mathematics, 
single-handedly developing algebraic topology. Brouwer seemingly abandoned 
intuitionism for several years for the sake of job procurement and security. This 
seeming desertion was, however, merely apparent. All along, Brouwer developed 
the type of topology which he believed could be rendered intuitionistically accept- 
able. After he had become an internationally recognised mathematician, he began 
his return to intuitionism, his historically most important paper being published in 
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two parts in 1918 and 1919 (not 1924) and accompanied by commentaries in Dutch 
and German journals. In these papers he rejected outright the application of the 
Law of the Excluded Middle in infinite domains, and yet was greatly successful in 
developing set theory intuitionistically. 

The true reasons behind Brouwer’s firing from the board of editors of Mathe- 
matische Annalen have already been discussed by Dirk van Dalen in the Mathemat- 
ical Intelligencer (vol. 12, #4 (1990), pp. 17-31) and need not be repeated here. 
Suffice it to say that Constantin Caratheodory, one of the senior editors, consid- 
ered Brouwer to be the most conscientious editor on the staff, and that, when 
Brouwer subsequently founded his own journal, his Nazi friend Ludwig Bieberbach 
was displeased by the number of Jewish editors the journal had. It is true that 
Brouwer had Nazi friends and that, because of property owned in Germany, he 
was slow to divorce himself from the regime. It is also true that he refused to break 
off his friendship with one Dutch mathematician who had been a Nazi during the 
war and even tried to find the fellow a position afterwards. But none of this makes 
Brouwer himself a Nazi, and he was even cleared by the Dutch government after 
the war. 

The sources of some of the misinformation on Brouwer are obvious. Brouwer’s 
influential paper on intuitionism—the one that converted Hermann Weyl and 
precipitated the notorious Grundlagenstreit of the 1920s—came after his famous 
work on algebraic topology. The conclusion drawn by my topology teacher and 
Penrose is an immediate corollary. That Brouwer’s intuitionism was a response to 
the paradoxes is a corollary to the general Theorem that all foundational work is a 
response to Russell’s paradox. Even Hilbert, who discovered paradoxes in the 
mid-to-late 1890s, whose student Ernst Zermelo had discovered Russell’s paradox 
by the late 1890s, who publicly cited a consequence of the paradoxes in his address 
to the International Congress of Mathematicians in 1900, and who himself was not 
unduly moved by them (he didn’t even bother to mention them in 1900) until he 
saw Russell’s and Dedekind’s overreactions to them—even Hilbert, I say, who 
knew that Leopold Kronecker died in 1891 before the paradoxes were known, 
concluded that Kronecker’s rejection of set theory was a response to the para- 
doxes. In point of fact, the paradoxes had little direct impact. Russell, Frege, 
Dedekind, and Weyl took them seriously. Hilbert took the reaction to them 
seriously, but was not directly moved by them. He only sprang into action when 
Weyl was converted to intuitionism, in which approach to mathematics the para- 
doxes were irrelevant. Hilbert’s foundational work of the 1920s was a response to 
intuitionism, not to the paradoxes. Yet he paraded it as a response to the 
paradoxes, expanding their role to include Kronecker and, by implication, Brouwer. 

The fact of the matter is that much of what we know about Brouwer was written 
by Hilbert, a great mathematician, but an unreliable historian. If, for example, we 
read that Brouwer attacked Hilbert’s formalism as a “formula game,” we can trace 
the accusation to one of Hilbert’s papers. We will not find it in any of Brouwer’s 
papers, but we will find that Hermann Weyl used the phrase! In all of Hilbert’s 
papers of the 1920s there is, in fact, but a single direct reference to any of 
Brouwer’s writings—and this concerns the possibility of finding decimal expan- 
sions of real numbers. Every other reference to Brouwer by Hilbert is either a 
reference to something Weyl said or Kronecker did; it is not a direct reference to 
Brouwer. 

If one reads the papers of Hilbert and Brouwer of the 1920s, one begins to 
doubt there ever was a Grundlagenstreit at all. One finds Hilbert bravely fighting 
the ghost of Kronecker, or a cardboard replica of Brouwer unwittingly fabricated 
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by Weyl. Those familiar with the anecdotes about Kronecker or the more recent 
antics of Bishop will be surprised to find that Brouwer’s papers of the period are 
not the least bit dogmatic. Unlike Kronecker who argued ad hominem, or Weyl 
and Bishop who simply labelled absurd that with which they disagreed, Brouwer 
offered careful criticism of formalism and argued dispassionately for intuitionism. 
The only show of temper on Brouwer’s part in the foundational struggle occurred 
late, when Brouwer declared that formalism need not sneer at intuitionism 
considering both that it had not accomplished anything (true enough) and that it 
had plagiarised intuitionism (very probably true, but not yet proven to the histo- 
rian’s satisfaction). Perhaps what Brouwer said in public differs from what he 
wrote; if so, I would like to learn of it. For the time being, however, it seems to me 
that Brouwer’s reputation for the sort of behaviour correctly associated with 
constructivist-fascists like Kronecker and Bishop rests entirely on Hilbert’s accusa- 
tions based on Weyl’s bombastic misrepresentations. 

I hesitate to say that it is time to rehabilitate Brouwer. He was, after all, no 
saint. He did battle with Hilbert (albeit only on tactics to better the treatment of 
German scientists after the First World War). He fought numerous battles over 
priority and could be quite petty in such fights. He was also a rabid sexist. We 
certainly have no cause to revere the man; but we should not revile him either. 
Brouwer, the founder of algebraic topology and of one of the few coherent 
philosophies of mathematics, was one of the great mathematicians of our century, 
the greatest Holland has produced since Christiaan Huyghens, and he deserves 
better from the mathematical community than he has received. 

With some reservations I would refer the curious or even repentant reader to 
van Stigt’s book for a more accurate picture of Brouwer than his topology teacher 
may have given him or he may have presented to his own students. The most 
serious reservation is that the book is a work on the philosophy of mathematics, 
not the history thereof, and is not, therefore, the appropriate choice for the 
mathematical community. It is to a large extent, however, the only choice currently 
available and we must consider it. Only the first chapter, semi-biographical in 
character, is of direct relevance to the mathematician. This biography, however 
sketchy, is the most extensive one available in English and, however much it 
focuses on philosophy, it does touch on Brouwer’s mathematical work, particularly 
on his topology, and its relation to his philosophy. The chapter certainly ought to 
correct the widespread misunderstanding about the late origin of Brouwer’s 
foundational interests or their being responses to the paradoxes. The discussion of 
the Grundlagenstreit, of necessity short, will do nothing to replace the popular 
fiction and must therefore be considered inadequate for revising popular Brou- 
werian history. (Proof of this inadequacy is at hand: Barrow’s recent Pi in the Sky 
relies heavily on the present book and treats Brouwer as not only an active 
participant in the fight, but also as the villain.) Van Stigt does run rings around 
Reid and Fraenkel on the matter of Brouwer’s firing from Mathematische Annalen, 
but van Dalen’s already cited Intelligencer article is superior. As to Brouwer’s 
postwar difficulties, van Stigt has the right attitude, but offers no details. All of 
this, of course, is not a criticism of the book, but indication of how it doesn’t serve 
a purpose for which it wasn’t intended—whence the reservation cited above. 

Brouwer’s Intuitionism itself cannot, however, escape criticism. Reviewers have 
made it clear that it is not entirely reliable. It was clearly not proofread and may 
not even have been edited. The fact, for instance, that repeated Brouwerian 
quotations are translated anew on repetition suggests this latter. Conclusions and 
speculations are offered as facts on inadequate grounds. For example, in discussing 
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the Mathematische Annalen affair, van Stigt quotes Brouwer’s speculation that 
Hilbert’s motive was anger at Brouwer’s declining a position in Gottingen in 1919 
and, instead of critically examining this or recalling other documented sources of 
Hilbert’s anger (which would suggest Brouwer was not completely right in his 
estimation), van Stigt supports the assertion with further speculation. As an 
indication of just how far from the mark such speculation can get, I might mention 
van Stigt’s remark on Brouwer’s fellowship with various young poets: “Brouwer 
lacked the ability to express his feelings artistically”. This certainly runs counter to 
what every Dutchman I’ve spoken to has said of Brouwer’s correspondence with 
Adama van Scheltema: Brouwer’s prose is uniformly favourably compared with 
that of the poet. One criticism I’ve read and must agree with is that van Stigt’s 
psychological analysis of Brouwer is excessive and excessively negative. Indeed, it 
might be said that van Stigt has done a hatchet job on Brouwer. But, in doing so, 
he has still managed to paint a more positive image of Brouwer than is generally 
displayed to mathematicians and the reader can get a first, albeit rough, approxi- 
mation to Brouwer from van Stigt’s book. Thus, with the reservations that the book 
is only tangentially relevant to the mathematical community and that it must be 
approached very critically, I refer the reader to it for a faltering first step in getting 
to know the real Brouwer. 


429 South Warwick Avenue 
Westmont, IL 60559 


There are in this world optimists who feel 
that any symbol that starts off with an 
integral sign must necessarily denote 
something that will have every property 
that they should like an integral to pos- 
sess. This of course is quite annoying to us 
rigorous mathematicians; what is even 
more annoying is that by doing so they 
often come up with the right answer. 


—E.J. McShane 


Bulletin of the American Mathematical Society 
v 69, p 611, 1963. 


Answer to Picture Puzzle 
(p. 770) 


No, they are not: they are Emile Borel and 
Armand Borel. 
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General, P, L. Comic Sections: The Book of 
Mathematical Jokes, Humour, Wit and Wisdom. 
Desmond MacHale. Boole Pr, 1993, v + 152 pp, 
(P). [ISBN 1-85748-007-4; 1-85748-006-6] 
A considerable compendium of jokes, quota- 
tions, cartoons, excerpts, bloopers, verse, and 
historical anecdotes on (loosely) mathematical 
themes. Inevitably, the amusement index varies 
wildly, but on balance a valuable resource for 
common rooms, department chairs, and the con- 
versationally challenged. PZ 


General, P, L**. Proofs Without Words: Ex- 
ercises in Visual Thinking. Roger B. Nelsen. 
Classroom Resource Materials, No. 1. MAA, 
1993, ix + 152 pp, $23 (P). [ISBN 0-88385- 
700-6] A delightful collection of “pictures or 
diagrams that help the observer see why a partic- 
ular statement may be true, and also to see how 
one might go about proving it true.” Sections 
on geometry and algebra; trigonometry, calcu- 
lus and analytic geometry; inequalities; integer 
sums; sequences and series. AO 


General, L*. The Visual Mind: Art and Math- 
ematics. Ed: Michele Emmer. MIT Pr, 1993, 
XVii + 274 pp, $39.95. [ISBN 0-262-05048-X] 
35 essays by mathematicians and visual artists, 
in 4 sections: geometry and visualization; com- 
puter graphics, geometry and art; symmetry; 
perspective, mathematics, and art. Many color 
illustrations. A fascinating approach to both 
the mathematics of visual art and the visualiza- 
tion of mathematics, and a valuable resource for 
mathematics appreciation courses. AO 


Reference, P, L*. I4TIgX: A Document Prepa- 
ration System User's Guide and Reference Man- 


1994] 


TELEGRAPHIC REVIEWS 


ual, Second Edition. Leslie Lamport. Addison- 
Wesley, 1994, xvi + 272 pp, (P). [ISBN 0-201- 
52983-1] Reflects changes introduced with 
I4TpX 2¢, the new standard version of I4Tpx. 
(First Edition, TR, November 1986.) AO 


Reference, P*, L*. The I4TRX Companion. 
Michel Goossens, Frank Mittelbach, Alexan- 
der Samarin. Addison-Wesley, 1994, xxx + 
528 pp, (P). [ISBN 0-201-54199-8] Ad- 
vanced guide to I4TpX 2¢ and to 150+ packages 
that extend or modify basic I4TpX system (e.g., 
the New Font Selection Scheme, Aa4S-I4TpX). 
Essential for serious J4TpX users. AO 


Reference, C, P. Math Into TfX: A Simple 
Introduction to A,4S-I4TpX. George Gritzer. 
Birkhéuser, 1993, xxix + 294 pp, $42.50 (P), 
disk included. [ISBN 0-8176-3637-4] De- 
scribes and illustrates A, 4S-I4TpX (a high-level 
mathematical document processor, built atop 
TpX) for both rank beginner and seasoned TRX 
and J4TeX users. Three parts: basic introduction 
(including installation on PC, Macintosh); com- 
prehensive reference manual; advanced topics. 
Complete index, useful appendices. Informal 
exposition; many examples. Many sample files 
provided on diskette. PZ 


Reference, P, L. Memorabilia Mathemat- 
ica: The Philomath’s Quotation Book. Robert 
Edouard Moritz. Spectrum Ser. MAA, xiii 
+ 410 pp, $24 (P). [ISBN 0-88385-513-5] 
Reprint of 1914 compilation of 1140 Anec- 
dotes, Aphorisms, and Passages by Famous 
Mathematicians, Scientists & Writers. Com- 
panion to Out of the Mouths of Mathemati- 
cians. BC 
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Reference, P, L. Out of the Mouths of Mathe- 
maticians: A Quotation Book for Philomaths. 
Rosemary Schmalz. Spectrum Ser. MAA, 
1993, x + 294 pp, $29 (P). [ISBN 0-88385-509- 
7) Companion to Memorabilia Mathematica, 
with up-to-date quotes. This reviewer’s fa- 
vorite, due to Mark Kac: “There are worse 
things than being wrong, and being dull and 
pedantic are surely among them.” BC 


Recreational Mathematics, P. Challenging 
Puzzles. Colin Vout, Gordon Gray. Cambridge 
Univ Pr, 1993, 126 pp, $14.95 (P). [ISBN 
0-521-44602-3] 100 interesting and amusing 
puzzles, with hints and solutions. DH 


Recreational Mathematics, P. How Amaz- 
ing. Charles Snape, Heather Scott. Cambridge 
Univ Pr, 1992, 48 pp, $9.95 (P). [ISBN 0-521- 
35672-5] Colorful photographs and drawings 
of famous mazes and related puzzles. Elemen- 
tary combinatorial situations, topological tricks, 
and network problems. Fun for everyone; use- 
ful resource for middle school teachers. MW 


Elementary, T*(13: 1). Intermediate Algebra: 
Graphs and Functions. Roland E. Larson, et al. 
DC Heath, 1994, xxii + 905 pp. [ISBN 0-669- 
33755-2] Emphasizes problem solving skills, 
algebraic modeling. Includes discussion prob- 
lems, optional sections on graphing calculators, 
computer graphing. TH 

Elementary, S(15-18), C. NUMERATOR in 
the Mathematics Classroom. Peter Armstrong, 
et al. Comp. in Math. Teach. Ser. Mathemati- 
cal Assoc, 1993, iv + 20 pp, $3.75 (P). [ISBN 
0-906588-30-8] Report on pros and cons of 
NUMERATOR, British software that simulates 
function machines. Makes comparisons with 
spreadsheets, LOGO, BASIC. MW 


Education, S(15-18). Number Sense Now! 
Reaching the NCTM Standards. Proj. Ditr.: 
Francis Fennell. NCTM, 1993, xiv + 128 pp, 
$98 (P), with 3 videos. [ISBN 0-87353- 
361-5] Three half-hour classroom vignettes, 
showing: (1) nature and importance of num- 
ber sense; (2) instructional strategies, empha- 
sizing communication and hands-on activities; 
(3) real-life applications. Guidebook contains 
vignette summaries, background information, 
and blackline masters. Useful for pre-service 
and in-service elementary teachers. MW 


Education, S(17-18), P. Rethinking Elemen- 
tary School Mathematics: Insights and Issues. 
Eds: Terry Wood, et al. J. for Res. in Math. 
Educ., No. 6. NCTM, 1993, vi + 122 pp, 
$7 (P). [ISBN 0-87353-362-3] Report on 
a field-based study of second-graders’ math- 
ematical learning in a constructivist-oriented 
classroom. Data collection from whole-class 
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teaching experiments, individual clinical inter- 
views, and pair collaborations. Methodology 
may serve as a model for other studies. Con- 
clusions support blending cognitive and socio- 
logical interpretations of mathematical activity. 
Final chapters offer advice for educational in- 
novators and researchers. MW 


Education, S(15-17), C. Spreadsheets: Ex- 
ploring their Potential in Secondary Mathemat- 
ics. David Green, Peter Armstrong, Richard 
Bridges. Comp. in Math. Teach. Ser. Mathe- 
matical Assoc, 1993, ix + 154 pp, $8.50 (P). 
[ISBN 0-906588-29-4] Basic concepts and 
example spreadsheets for arithmetic, algebra, 
statistics. Ideas for dynamic modeling and sim- 
ulation. Disk available for Excel. MW 


Education, S(15-17). Graphic Calculators in 
the Mathematics Classroom. Peter Arter, et 
al. Comp. in Math. Teach. Ser. Mathematical 
Assoc, 1993, vii + 124 pp, $6.75 (P). [ISBN 
0-906588-31-6] Introduction to use of calcu- 
lators; keystroke sequences and programs for 
Casio fx-7000, fx-7700, and TI-81. Ideas for 
exploring algebra, graphing, and statistics. Or- 
ganization sometimes confusing. MW 


Education, S(15-17), P. Reforming Science 
Education: Social Perspectives and Personal 
Reflections. Rodger W. Bybee. Ways of Know- 
ing in Sci. Ser. Teachers College Pr, 1993, xvii 
+ 197 pp, $19.95 (P). [ISBN 0-8077-3260-5] 
10 provocative essays, written over 15 years, 
on precollege science education reform. AO 


Education, S(14-18). Activities for Active 
Learning and Teaching: Selections from the 
Mathematics Teacher. Eds: Christian R. Hirsch, 
Robert A. Laing. NCTM, 1993, v + 244 pp, 
$11 (P). [ISBN 0-87353-363-1] 35 activi- 
ties on problem solving, numeracy, algebra and 
graphs, geometry and visualization, data anal- 
ysis and probability. Some computer activities 
updated to use graphing calculators. Resource 
for teachers of grades 7-10. MW 


Education, S. Self-Directed Problem Solv- 
ing: Idea Production in Mathematics. M. 
Ann Dirkes. University Pr of America, 1993, 
123 pp, $17.50 (P). [ISBN 0-8191-9130-2] 
Describes a program in which “students aggres- 
sively pursue what they already know and use 
it to construct new concepts.” Primary vehi- 
cle seems to be use of various brainstorming 
techniques. Claims to be effective in secondary 
school and college mathematics courses, but 
mathematical content is arithmetic. MW 


Education, S(17-18), P. Reconstructing Math- 
ematics Education: Stories of Teachers Meeting 
the Challenge of Reform. Deborah Schifter, 
Catherine Twomey Fosnot. Teachers College 
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Pr, 1993, xv + 215 pp, $18.95 (P). [ISBN 0- 
8077-3206-0] Stories of teachers implement- 
ing constructivist-based teaching strategies in 
elementary classrooms. Valuable resource for 
in-service programs. MW 


Education, P. Classroom Dynamics: Imple- 
menting a Technology-Based Learning Envi- 
ronment. Ellen B. Mandinach, Hugh F. Cline. 
Lawrence Erlbaum Assoc, 1994, xvi + 211 pp, 
$49.95. [ISBN 0-8058-0555-9] Case study 
of a project using simulation software in high 
school science. Reveals curriculum and teach- 
ing issues inherent in use of technology. MW 


History, P**, L. Golden Years of Moscow 
Mathematics. Eds: Smilka Zdravkovska, Pe- 
ter L. Duren. History of Math., V. 6. AMS, 
1993, ix + 271 pp, $94. [ISBN 0-8218-9003-4] 
11 essays/reminiscences on mathematical life 
at Moscow State University in the half-century 
following the Revolution. Fascinating glimpse 
of a vital and fertile mathematical milieu. SK 


History, S**(13—17), L**. Ars Magna or The 
Rules of Algebra. Girolamo Cardano. Transl: T. 
Richard Witmer. Dover, 1993, xxiv + 267 pp, 
$8.95 (P). [ISBN 0-486-67811-3] Transla- 
tion of classic Renaissance text from 1545, us- 
ing modern algebraic notation. A must for any- 
one interested in mathematical history. (1968 
MIT edition, TR, June-July 1969.) DP 


History, S, P, L*. Lectures in the History of 
Mathematics. Henk J.M. Bos. Hist. of Math., 
V. 7. AMS, 1993, x + 197 pp, $86. [ISBN 0- 
8218-9001-8] 11 thought-provoking, insight- 
ful, informative lectures on topics from 17th 
and 18th century mathematics. BH 
Combinatorics, T(17-18: 1), S, P, L. Com- 
plexity: Knots, Colourings and Counting. 
D.J.A. Welsh. London Math. Soc. Lect. Note 
Ser., V. 186. Cambridge Univ Pr, 1993, 
viii + 163 pp, $37.95 (P). [ISBN 0-521- 
45740-8] Readable treatment of algorithmic 
aspects of knot theory, combinatorics, statisti- 
cal physics. A clear and up-to-date survey of 
what’s known—and what’s still unknown. BC 


Combinatorics, T(17: 1), P. The Theory of 
Finite Linear Spaces: Combinatorics of Points 
and Lines. Lynn Margaret Batten, Albrecht 
Beutelspacher. Cambridge Univ Pr, 1993, 
x + 214 pp, $49.95. [ISBN 0-521-33317- 
2] Covers work on combinatorial problems in 
linear spaces including embedding higher di- 
mensional linear spaces in projective spaces. 
Presents recent results using group theory. With 
exercises, open problems. DH 


Combinatorics, P. Finite and Infinite Combi- 
natorics in Sets and Logic. Eds: N.W. Sauer, 
R.E. Woodrow, B. Sands. NATO ASI Ser. C, 
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V.411. Kluwer Academic, 1993, xvii + 453 pp, 
$214. [ISBN 0-7923-2422-6] Proceedings 
of a 1991 NATO Advanced Study Institute in 
Banff, Canada. 


Number Theory, S(15-16), L. Solved and 
Unsolved Problems in Number Theory. Daniel 
Shanks. Chelsea, 1993, xiv + 305 pp, $27.50. 
[ISBN 0-8284-2297-X] The Fourth Edition 
of this unique and engaging book (Third Edi- 
tion, TR, June-July 1986). Only one new page 
in this edition, but still a bargain. SG 


Linear Algebra, S(14-16: 1). Linear Algebra 
with DERIVE. Benny Evans, Jerry Johnson. 
Wiley, 1994, x + 280 pp, $20.95 (P). [ISBN 
0-471-59194-7] Exercises, solutions, and in- 
troduction to DERIVE. Exercises include stan- 
dard problems, problems not solvable by hand, 
and exploratory applications. DS 


Linear Algebra, T*(13: 1). Elementary Lin- 
ear Algebra, Seventh Edition. Howard Anton. 
Wiley, 1994, xvi + 620 pp, $67.95. [ISBN 0- 
471-58742-7] Now with early introduction to 
linear transformations, eigenvalues, and char- 
acteristic equations in R”, more visualization in 
R? and R°, more proofs tailored for beginners. 
(Sixth Edition, TR, October 1991.) TH 


Linear Algebra, T(13-14: 1, 2). Linear Al- 
gebra with Applications, Third Edition. W. 
Keith Nicholson. PWS, 1995, xviii + 540 pp. 
[ISBN 0-534-93666-0] “This edition contin- 
ues the trend toward spending more time on 
matrix computations as well as applications, a 
view supported by the Linear Algebra Curricu- 
lum Study Group.” Inner product spaces now 
follow diagonalization, linear transformations. 
No computer exercises. (Second Edition, TR, 
May 1990.) HD 


Group Theory, T*(18: 1), P. Local Repre- 
sentation Theory. J.L. Alperin. Stud. in Adv. 
Math., V. 11. Cambridge Univ Pr, 1993, x 
+ 178 pp, $24.95 (P). [ISBN 0-521-44926- 
X] Paperback version of 1986 text (TR, April 
1987). DP 


Group Theory, P. Linear Algebraic Groups 
and Their Representations. Eds: Richard S. EI- 
man, Murray M. Schacher, V.S. Varadarajan. 
Contemp. Math., V. 153. AMS, 1993, xii + 
200 pp, $42 (P). [ISBN 0-8218-5161-6] Pro- 
ceedings of a 1992 conference at UCLA. 


Algebra, P. Symmetries in Science VI: From 
the Rotation Group to Quantum Algebras. Ed: 
Bruno Gruber. Plenum Pr, 1993, xvii + 770 pp, 
$149.50. [ISBN 0-306-44584-0] Proceedings 
of a 1992 symposium in Bregenz, Austria. 

Algebra, T(15-16: 1, 2). Contemporary Ab- 
stract Algebra, Third Edition. Joseph A. Gal- 
lian. DC Heath, 1994, xix + 577 pp. [ISBN 0- 
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669-33907-5] Changes from earlier editions 
include writing functions to the left of their ar- 
guments, chapter on symmetry and counting, 
new exercises, some internal reorganization. 
(First Edition, TR, May 1986.) JS 


Algebra, T(18: 1, 2), P. Cohen-Macaulay 
Rings. Winfried Bruns, Jiirgen Herzog. Stud. 
in Adv. Math., V. 39. Cambridge Univ Pr, 
1993, xi + 403 pp, $79.95. [ISBN 0-521- 
41068-1] Self-contained introduction to ho- 
mological and combinatorial aspects of com- 
mutative algebra: Goldstein rings, local coho- 
mology, canonical modules, Hilbert functions, 
Stanley-Reisner rings, semi-group rings, and 
determinantal rings. TH 

Algebra, T(18), P*. Polynomial Invariants of 
Finite Groups. D.J. Benson. London Math. 
Soc. Lect. Note Ser., V. 190. Cambridge Univ 
Pr, 1993, ix + 118 pp, $32.95 (P). [ISBN 0-521- 
45886-2] Studies the action of a finite group 
on the ring of polynomial functions on a linear 
representation, both in characteristic zero and p. 
Restriction to finite groups makes the material 
accessible to graduate students. DP 


Calculus, S(13-14). HP-48G/GX Calcula- 
tor Enhancement for Science and Engineering 
Mathematics. Ed: Donald R. LaTorre. Saun- 
ders College, 1994, xii + 418 pp, $12 (P). 
[ISBN 0-03-097000-8] 6 chapters on uses of 
HP-48G/GX calculators in single-variable cal- 
culus, multivariable calculus, differential equa- 
tions, linear algebra, engineering mathematics, 
probability and statistics. Examples, exercises, 
activities, explorations, and projects. HD 


Complex Analysis, S(18), P. An Introduction 
to Sato’s Hyperfunctions. Mitsuo Morimoto. 
Transl: Mitsuo Morimoto. Transl. of Math. 
Mono., V. 129. AMS, 1993, xii + 273 pp, $87. 
[ISBN 0-8218-4571-3] English translation of 
1976 Japanese original. Assumes substantial 
background in complex variables, topological 
linear spaces, functional analysis. Treats theory 
of hyperfunctions of one and several variables, 
cohomology groups, microfunctions. JS 


Dynamical Systems, T(17), P. Chaos, Frac- 
tals, and Noise: Stochastic Aspects of Dynam- 
ics, Second Edition. Andrzej Lasota, Michael 
C. Mackey. Appl. Math. Sci., V. 97. Springer- 
Verlag, 1994, xiv + 472 pp, $49. [ISBN 0-387- 
94049-9] Originally published as Probabilis- 
tic Properties of Deterministic Systems (TR, 
November 1987). New material includes chap- 
ter on evolution of distributions. SK 

Dynamical Systems, P. Bifurcations and Pe- 
riodic Orbits of Vector Fields. Ed: Dana 
Schlomiuk. NATO ASI Ser. C, V. 408. Kluwer 
Academic, 1993, xvii + 472 pp, $180. [ISBN 
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0-7923-2392-0] 10 papers from the 1992 
NATO Advanced Study Institute at Université 
Montréal. 

Dynamical Systems, T?, S(16-18), P, L. Dis- 
crete Iterated Function Systems. Mario Perug- 
gia. AK Peters, 1993, xvi + 180 pp, $34.95. 
[ISBN 1-56881-015-6] Iterated Function Sys- 
tems (IFSs) can be used to encode digitized 
color images via a method popularized in 
Barnsley’s Fractals Everywhere. Book com- 
pares continuous and discrete IFSs; explores 
ramifications of using one to approximate the 
other. Background needed: topology and met- 
ric spaces, probability, Markov chains. KS 


Numerical Analysis, P. Box Splines. C. de 
Boor, K. Hollig, S. Riemenschneider. Appl. 
Math. Sci., V. 98. Springer-Verlag, 1993, xvii+ 
200 pp, $34. [ISBN 0-387-94101-0] Theory 
of box splines (compactly-supported, smooth, 
piecewise polynomial functions). DH 


Numerical Analysis, T(15: 1). Numerical 
Mathematics and Computing, Third Edition. 
Ward Cheney, David Kincaid. Brooks/Cole, 
1994, xiv + 578 pp, $60.75. [ISBN 0-534- 
20112-1] Changes include pseudocode algo- 
rithms, examples solved using either Maple or 
MATLAB. (First Edition, TR, April 1982.) DH 


Numerical Analysis, P. Exploiting Symmetry 
in Applied and Numerical Analysis. Eds: Eu- 
gene L. Allgower, Kurt Georg, Rick Miranda. 
Lect. in Appl. Math., V. 29. AMS, 1993, xii + 
457 pp, $58 (P). [ISBN 0-8218-1134-7] | Pro- 
ceedings of a 1992 AMS-SIAM Summer Sem- 
inar at Colorado State University. 


Functional Analysis, T(18: 1), P. Introduc- 
tion to Regularity Theory for Nonlinear El- 
liptic Systems. Mariano Giaquinta. Lect. in 
Math. Birkhauser, 1993, viii + 131 pp, $29 (P). 
[ISBN 0-8176-2879-7] Lecture notes from 
a winter 1983 course at ETH, Zurich. Re- 
quires strong background in graduate real anal- 
ysis. Works through Hilbert-space regularity, 
Schauder-estimates and L? theory to regularity 
in both scalar- and vector-valued cases. SM 


Analysis, P. Different Perspectives on Wave- 
lets. Ed: Ingrid Daubechies. Proc. of Symp. in 
Appl. Math., V. 47. AMS, 1993, xi + 205 pp, 
$45. [ISBN 0-8218-5503-4] 8 lectures from 
a 1993 AMS short course in San Antonio. BH 


Algebraic Geometry, T(17-18: 1). Algebraic 
Varieties. George R. Kempf. London Math. 
Soc. Lect. Note Ser., V. 172. Cambridge Univ 
Pr, 1993, x + 163 pp, $37.95 (P). [ISBN 0-521- 
42613-8] Introduces theory of algebraic func- 
tions on varieties from sheaf-theoretic stand- 
point. Sheaves and sheaf cohomology are pre- 
sented from the beginning; curves are used to 
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illustrate the theory. Excellent background for 
study of schemes. DS 


Algebraic Geometry, P. Moduli Spaces of 
Abelian Surfaces: Compactification, Degener- 
ations, and Theta Functions. Klaus Hulek, 
Constantin Kahn, Steven H. Weintraub. Ex- 
pos. in Math., V. 12. Walter de Gruyter, 1993, 
xii + 347 pp, DM 168. [ISBN 3-11-013851-4] 
Main themes: constructing toroidal compactifi- 
cations of moduli spaces; constructing degen- 
erate abelian surfaces using ideas of Mumford; 
geometry of Horrocks-Mumford bundle. RM 


Differential Geometry, P. The Penrose Trans- 
form and Analytic Cohomology in Representa- 
tion Theory. Eds: Michael Eastwood, Joseph 
Wolf, Roger Zierau. Contemp. Math., V. 154. 
AMS, 1993, x + 259 pp, $47 (P). [ISBN 0- 
8218-5176-4] 15 papers from a 1992 AMS- 
IMS-SIAM Summer Research Conference at 
Mt. Holyoke College. 


Differential Geometry, T(18), S, P. Global 
Affine Differential Geometry of Hypersurfaces. 
An-Min Li, Udo Simon, Guosong Zhao. Expos. 
in Math., V. 11. Walter de Gruyter, 1993, xiii 
+ 328 pp, DM 178. [ISBN 3-11-012769-5] 
Equiaffine hypersurface theory, from graduate 
level to latest results. First local theory, then 
global results: affine hyperspheres, rigidity and 
uniqueness theorems, variational problems and 
affine maximal surfaces, geometric inequalities. 
No exercises; large bibliography. SG 


Geometry, P. Finite Geometry and Combina- 
torics: The Second International Conference at 
Deinze. Eds: A. Beutelspacher, et al. London 
Math. Soc. Lect. Note Ser., V. 191. Cambridge 
Univ Pr, 1993, ix + 412 pp, $39.95 (P). [ISBN 
0-521-44850-6] 35 papers from a 1992 con- 
ference in Astene-Deinze, Belgium. 

Algebraic Topology, T(17-18: 1). Fibre Bun- 
dles, Third Edition. Dale Husemoller. Grad. 


Texts in Math., V. 20. Springer-Verlag, 1994, - 


xix + 353 pp, $54. [ISBN 0-387-94087- 
1] Complete guide to theory of fibre bun- 
dies, from homotopy theory through character- 
istic classes. Essential reference for K-theory, 
classical groups, differentiable manifolds, and 
many other subjects from algebraic topology. 
(Second Edition, TR, January 1976.) DS 


Topology, P. Topics in Knot Theory. Ed: M.E. 
Bozhiiyiik. NATO ASI Ser. C, V. 399. Kluwer 
Academic, 1993, xiv + 353 pp, $149. [ISBN 
0-7923-2285-1] 14 papers from the Septem- 
ber 1992 NATO Advanced Study Institute in 
Erzurum, Turkey. 

Optimization, T(16-17). Interior-Point Poly- 
nomial Algorithms in Convex Programming. 
Yurii Nesterov, Arkadii Nemirovskii. Stud. in 
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Appl. Math., V. 13. SIAM, 1994, ix + 405 pp, 
$68.50. [ISBN 0-89871-319-6] Genuinely 
new in setting a body of material (optimiza- 
tion, stressing non-linear convex problems) in 
a unified context. Suggests new methods for 
many traditional problems. AWR 


Stochastic Processes, T(17), S. Lecture Notes 
in Control and Information Sciences-194: Real- 
ization Probabilities: The Dynamics of Queuing 
Systems. Xi-Ren Cao. Springer-Verlag, 1994, 
x + 320 pp, $78 (P). [ISBN 0-387-19872-5] 
Dynamic approach to steady-state sensitivity 
formulas. RWJ 


Elementary Statistics, S*(7-12). From Home 
Runs to Housing Costs: Data Resource for 
Teaching Statistics. Ed: Gail Burrill. Dale Sey- 
mour Pub, 1994, 110 pp, (P). [ISBN 0-86651- 
734-0] Interesting data sets from government, 
media, and other sources, designed for use with 
Dale Seymour’s Quantitative Literacy Series. 
With suggestions for data analysis, simulation 
activities, lessons, student projects. RSK 


Elementary Statistics, T(13), L. Exploring 
Measurements. Peter Barbella, James Kepner, 
Richard Scheaffer. Dale Seymour Pub, 1994, 
vii + 120 pp, (P). [ISBN 0-86651-639-5] Part 
of Quantitative Literacy Series developed by 
NCTM, American Statistical Association for 
school use. Nicely covers measures of cen- 
ter and variability, random sampling, mean and 
standard deviation of sample average, confi- 
dence intervals for population means; uses vari- 
ous small data sets (e.g., sports, weather). RWJ 


Elementary Statistics, S(13—16). Understand- 
ing Probability and Statistics: A Book of Prob- 
lems. Ruma Falk. AK Peters, 1993, xiii + 
239 pp, $39.95. [ISBN 1-56881-018-0] Tech- 
nically elementary (calculus-free), but concep- 
tually challenging problems. Interesting com- 
ments and solutions to open-ended problems 
(e.g., the “Let’s Make a Deal” controversy 
sparked by “Ask Marilyn” vos Savant). RSK 


Statistics, P. Clinical Trials and Statistics. Na- 
tional Research Council. National Academy Pr, 
1993, vii + 45 pp, (P). Proceedings of a 1992 
symposium at the National Academy of Sci- 
ences, exploring “how statistical research can 
contribute to gathering increased information 
from clinical trials and observational studies.” 


Statistics, T(16-17). Fundamental Concepts 
in the Design of Experiments, Fourth Edition. 
Charles R. Hicks. Saunders College, 1993, 
xii + 509 pp, $49.25. [ISBN 0-03-097710- 
X] Single-factor and several-factor (factorial, 
nested hierarchical, ‘nested factorial) experi- 
ments. New chapter on Taguchi design proce- 
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dures in this edition. SAS routines and output 
accompany many examples. RWJ 


Programming, P. Migrating to Fortran 90. 
Jim Kerrigan. O’ Reilly & Assoc, 1993, xxvi + 
361 pp, $27.95 (P). [ISBN 1-56592-049-X] 


Computer Systems, P. sendmail. Bryan 
Costales, Eric Allman, Neil Rickert. O’Reilly 
& Assoc, 1993, xxxvi + 792 pp, $32.95 (P). 
[ISBN 1-56592-056-2] 


Computer Systems, P. Distributing Appli- 
cations Across DCE and Windows NT. Ward 
Rosenberry, Jim Teague. O’Reilly & Assoc, 
1993, xxvi + 274 pp, $24.95 (P). [ISBN 1- 
56592-047-3] 


Computer Science, P. System Software and 
Tools for High Performance Computing Envi- 
ronments. Eds: Paul Messina, Thomas Ster- 
ling. SIAM, 1993, xix + 160 pp, $5 (P). [ISBN 
0-89871-326-9] Report of a 1992 workshop 
on key issues in developing high-performance 
computing systems. 


Computer Science, T(16—17: 1), L. The For- 
mal Semantics of Programming Languages: An 
Introduction. Glynn Winskel. Found. of Comp. 
MIT Pr, 1993, xviii + 361 pp, $40. [ISBN 0- 
262-23169-7] Up-to-date introduction to pro- 
gramming language semantics. Shows how 
operational and denotational approaches mesh, 
and how Gédel’s theorem implies impossibil- 
ity of a fully complete axiomatic semantics. 
Nicely develops semantics of higher and recur- 
sive types, nondeterminism, parallelism. RM 


Computer Science, P. Algorithms & Architec- 
tures: Proceedings of the Second NEC Research 
Symposium. Ed: T. Ishiguro. SIAM, 1993, 
xX + 282 pp, $37.50. [ISBN 0-89871-312-9] 
Highly parallel computing from various points 
of view. 10 technical papers and synopses of 
panel discussions from a 1991 symposium in 
Tsukuba City, Japan. 


Applications (Biological Science), P. Pre- 
dicting Spatial Effects in Ecological Systems. 
Ed: Robert H. Gardner. Lect. on Math. in the 
Life Sci., V. 23. AMS, 1993, v + 168 pp, 
$33 (P). [ISBN 0-8218-1174-6] Proceedings 
of a symposium at the 1991 Annual Meeting of 
the AAAS. 


Applications (Engineering), P. Simulation 
in the Design of Digital Electronic Systems. 
John B. Gosling. Electronics Texts for Eng. 
& Sci. Cambridge Univ Pr, 1993, xv + 273 pp, 
$70; $34.95 (P). [ISBN 0-521-41656-6; 0-521- 
42672-3] 

Applications (Engineering), P. Lecture Notes 
in Control and Information Sciences-190: Ex- 
perimental Robotics IT. Eds: Raja Chatila, Gerd 
Hirzinger. Springer-Verlag, 1993, xv + 560 pp, 
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$95 (P). [ISBN 0-387-19851-2] Proceedings 
of a 1991 symposium in Toulouse, France. 
Applications (Fluid Dynamics), T(17-18: 1). 
Mathematical Theory of Incompressible Non- 
viscous Fluids. Carlo Marchioro, Mario Pul- 
virenti. Appl. Math. Sci., V. 96. Springer- 
Verlag, 1994, xi + 283 pp, $49. [ISBN 0-387- 
94044-8] Develops Euler’s equation for in- 
compressible nonviscous fluids from viewpoint 
of mathematical physics. Underlying physical 
ideas are developed along with rigorous mathe- 
matical concepts. Assumes basics of ODE and 
PDE, measure theory, analytic functions. DS 
Applications (Physics), T(16-17: 1, 2), S, P, 
L. Inverse Problems in Scattering: An Intro- 
duction. G.M.L. Gladwell. Solid Mechanics & 
Its Applic., V. 23. Kluwer Academic, 1993, x + 
361 pp, $129. [ISBN 0-7923-2478-1] Aims 
to display some of the mathematics of one- 
dimensional inverse scattering problems. First 
6 chapters treat layered media; last 4 cover 
quantum mechanical models. Densely written, 
numerous exercises. MU 


Applications, P. Mathematical Research in 
Materials Science: Opportunities and Perspec- 
tives. National Research Council. National 
Academy Pr, 1993, xi + 129 pp, (P). [ISBN 
0-309-04930-X] Surveys past collaborations 
between mathematics and materials science; 
then “indicates which particular areas of mathe- 
matical sciences research hold the most promise 
for advancing materials science.” 


Applications, P. Transportation and the Math- 
ematical Sciences: The Changing Interaction. 
National Research Council. National Academy 
Pr, 1993, vii + 32 pp, (P). Proceedings of a 
1993 symposium at the National Academy of 
Sciences on planning and operation of surface 
and air transportation systems. 


Applications, P. Future Directions of Nonlin- 
ear Dynamics in Physical and Biological Sys- 
tems. Eds: P.L. Christiansen, J.C. Eilbeck, R.D. 
Parmentier. NATO ASI Ser. B, V. 312. Plenum 
Pr, 1993, xv + 557 pp, $139.50. [ISBN 0- 
306-44562-X] Proceedings of a 1992 NATO 
Advanced Study Institute in Lyngby, Denmark. 


Reviewers 


BC: Barry Cipra, St. Olaf; HD: Hung Dinh, Macalester; 
SG: Steven Galovich, Carleton; TH: Tom Halverson, 
Macalester; BH: Bruce Hanson, St. Olaf; DH: Deanna 
Haunsperger, St. Olaf; RWJ: Roger W. Johnson, Carleton; 
SK: Steve Kennedy, St. Olaf; RSK: Richard S. Kleber, 
St. Olaf; SM: Steve McKelvey, St. Olaf; RM: Richard Mol- 
nar, Macalester; AO: Arnold Ostebee, St. Olaf; DP: David 
Peifer, St. Olaf; AWR: A. Wayne Roberts, Macalester; KS: 
Karen Saxe, Macalester; JS: John Schue, Macalester; DS: 
Dan Schwalbe, Macalester; MU: Milton Ulmer, Carleton; 
MW: Martha Wallace, St. Olaf; PZ: Paul Zorn, St. Olaf. 
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EXCURSIONS IN CALCULUS: 
an Interplay of the Continuous 


and the Discrete 
Robert M. Young 


An excellent source of projects for well motivated 
students. This list of 463 references is a valuable 
aid for those who wish to dig d. . 

i who wish to dig CEPEl. OWHOICE 


This excellent book belongs in every school library 
and on every mathematics teacher's bookshelf. 
—The Mathematics Teacher 


Any book published in the Dolciani Mathematical 
Expositions series will naturally be measured 
against the remarkably high standards already 
well established by the previously published titles. 
In the present case, however, it is safe to say that 
this latest addition (the thirteenth in the series) not 
only easily meets the previously set standards but 
sets an entirely new standard for future volumes. 

—Mathematical Reviews 


Printed with eight full-color plates. 


The purpose of this book is to explore, within the 
context of elementary calculus, the rich and el- 
egant interplay that exists between the two main 
currents of mathematics, the continuous and the 
discrete. Such fundamental notions in discrete 
mathematics as induction, recursion, combinato- 
rics, number theory, discrete probability, and the 
algorithmic point of view as a unifying principle are 
continually explored as they interact with tradi- 
tional calculus. The interaction enriches both. 


The book is addressed primarily to well-trained 
calculus students and their teachers, but it can 
serve as a supplement in a traditional calculus 
course for anyone who wants to see more. 


Name 
Address 
City 


State Zip Code 


: TRECONTINUOLS AND THE DEXCRE 


Robert M. Youngs 


Exachud deiapigksd assthet Raestiscr ¢ 


CONTENTS: 


e Infinite Ascent, Infinite Descent: The Principle 
of Mathematical Induction 

e Patterns, Polynomials, and Primes: Three 
Applications of the Binomial Theorem 

¢ Fibonacci Numbers: Function and Form 

e« On the Average 

¢ Approximation: from Pi to the Prime Number 
Theorem 

e Infinite Sums: A Potpourri 


The problems, taken for the most part from 
probability, analysis and number theory, are an 
integral part of the text. Many point the reader 
toward further excursions. There are over 400 
problems presented in this book. 


408 pp., 1992, Paperbound 

ISBN 0-88385-317-5 

List: $42.00 MAA Member: $34.00 
Catalog Number DOL-13 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-800-331-1622 Fax (202) 265-2384 


Total $ 
Payment 7 Check 0 VISA © MASTERCARD 


Credit Card No. 


Signature Exp. Date 


MAIL ORDER 
TEXTBOOKS & 
Expert Service-lmmediate Ship Worldwide 
0% Standard Academic Discount* 


SPECIAL OFFER 


NEW CUSTOMERS WILL RECEIVE 
0% DISCOUNT ON FIRST ORDER** 


To order, call TOLL FREE: 800-621-1220 


onday to Saturday: 1O0am-6pm EST 
Visa/Mastercard accepted. Catalog available 


FAX orders: (212) 675-4230 


To write: 18 East 16th Street 
New York, NY 10003 


New York residents only add sales tax 
* Minimum two books 
** Order limit: Three books. Offer expires Nov. 30th, 1994 


Just Released! 


THEORIST 


For Windows, Macintosh 
and Power PC 


Waterloo Maple Software 


(519) 747-2373 
>@mapleseft.on.ca 


+49 
100 


Symbolic Mathematics and Graphing 2.0 


ES A FIRST COURSE OF @ 
_ COLLEGIATE MATHEMATICS _ 


by Joseph B. Dence & Thomas P. Dence 


Orig Ed. 1994 376 pp. 


$49.95 


ISBN 0-89464-592-7 


: This text would be useful in courses such as General College 7 
..| Mathematics, Introduction to Mathematical Thinking, and Pre-Calculus.|” 


‘It is unjustifiable to make students sit through one more class of the 
same kind of drudgery that they have seen since their freshman 
year in high school and thereby deny them exposure to intrinsically 
more interesting material just because they are not mathematics, 


science, or engineering majors." - 


from the Preface 


— Call or write for a FREE Brochure and Sample Pages — 


KRIEGER PUBLISHING COMPANY 
P.O. Box 9542, Melbourne, FL 32902-9542 - es 
(407) 724-9542 « Direct Order Line (407) 727-7270 ® FAX (407) 951-3671 See 


Domestic orders add $5.00 for first book, $1.50 each additional for shipping. Foreign shipping costs available upon request. 


Visualization in Teaching and 
Learning Mathematics 


Walter Zimmermann and 
Steve Cunningham, Editors 


Buy thas book. Ifyou can't buy a, have the library order 
it. Ifthe library won't order it, ask to borrow a copy from 
a friend. But do read this book. 

—The Mathematics Teacher 


High school, community college, and untversity teach- 
ers who use or are interested in using graphics to teach 
calculus, deductive reasoning, functions, geometry, or 
statistics will find valuable ideas for teaching... A 
must for every college or university library with a 
mathematics department.—CHOICE 


The twenty papers in this book give an overview 
of research, analysis, practical experience, and 
informed opinion about the role of visualization 
in teaching and learning mathematics, espe- 
cially at the undergraduate level. Visualization 
in its broadest sense is as old as mathematics, but 


progress in computer graphics has generated a 
renaissance of interest in visual representations 
and visual thinking in mathematics. 


230 pp., Paperbound, 1991 
ISBN 0-88385-071-0 

List: $24.00 

Catalog Number NTE-19 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


No matter how you 
express It, it still means 
DERIVE. Is half price. 


CYL UwVLZ 


The DERIVE A Mathema- 
tical Assistant program lets 
you express yourself symbol- 
ically, numerically and 
graphically, from algebra 
through calculus, with vec- 
tors and matrices too—all 
displayed with accepted 
math notation, or 2D and 3D 
plotting. DERIVE is also easy 
to use and easy to read, 
thanks to a friendly, menu- 
driven interface and split or 


DERIVE is a 
registered trademark of 
Soft Warehouse, Inc. 


overlay windows that can 
display both algebra and 
plotting simultaneously. 
Better still, DER/VE has been 
praised for the accuracy and 
exactness of its solutions. 
But, best of all the suggested 
retail price is now only $125. 
Which means DERIVE is now 
half price, no matter how you 
express It. 


Soft Warchouse: 


HONOLULU*e*HAWAII 


System requirements 


DERIVE: MS-DOS 2.1 or 
later, 512K RAM, and one 33" 
disk drive. Suggested retail 
price now $125 (Half off!). 


DERIVE ROM card: 

Hewlett Packard 95LX & 
100LX Palmtop, or other PC 
compatible ROM card 
computer. Suggested retail 
price now $125! 


DERIVE XM 

(eXtended Memory): 

386 or 486 PC compatible 
with at least 2MB of extena- 
ed memory. Suggested list 
price now $250! 


Soft Warehouse, Inc * 3660 Waialae Ave. 
Ste. 304 » Honolulu, HI, USA 96816-3236 
Ph: (808) 734-5801 + Fax: (808) 735-1105 


Essays in Humanistic 
Mathematics 


Alvin White, Editor 


A dazzling array of essayists reveal humanistic 
mathematics in this volume, and in so doing go 
beyond the facts, formulas, and algorithms that most 
students associate with mathematics to a presenta- 
tion of mathematics as an intellectual discipline with 
a human perspective and a significant history. Hu- 
manistic mathematics challenges dogmatic teach- 
ing styles that expect students to parrot the lecturer. 
It demands creativity from both the teacher and 
student. 

Teaching mathematics humanistically seeks to 
place the student more centrally in the position of 
inquirer than is generally the case, while at the same 
time acknowledging the emotional climate of the 
activity of learning mathematics. This type of 
teaching encourages students to learn from each 
other and to better understand mathematics as _ so- 
cially constructed knowledge, rather than as an 
arbitrary discipline. 

Teaching humanistic mathematics brings the fo- 
cus less upon the nature of the teaching and learning 
environment and more upon the need to reconstruct 
the curriculum and the discipline of mathematics 
itself. This reconstruction relates mathematical 
discoveries to personal courage, discovery to verifi- 
cation, mathematics to science, truth to utility, and 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 
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mathematics to the culture in which it is embedded. 

The humanistic mathematics movement, which 
began as the personal vision of a few, has now 
become a major part of mathematical culture. What 
was viewed with skepticism is now accepted and 
expected. Humanistic mathematics is not a new 
discovery. Itis arecent rediscovery of ideas that go 
back to Plato. It has provided a vocabulary for 
previously unarticulated concepts and approaches. 

The essays in this volume illustrate and help to 
define humanistic mathematics. The variety and 
scope indicate the richness and fruitfulness of the 
concept. Although each essay is independent, a 
sense of unity emerges. A glimpse at the table of 
contents will give you an idea of the excitement and 
range of the ideas presented. 


212 pp., Paperbound, 1993 
ISBN 0-88385-089-3 

List: $24.00 

Catalog Number NTE-32 


Total $ 
Payment OQ Check O VISA QO) MASTERCARD 
Credit Card No. 
Signature 


Exp. Date 


From Zero to Infinity 


Constance Reid 


Fourth Edition 


In a delightful way, it well serves the purpose of the 
author—to get others interested in the fascinating 
theory of numbers. —The Mathematics Teacher 


This is a book one can read and enjoy from cover to 
cover in one sitting... [tis highly recommended for all. 
—Crux Mathematicorum 


The 1992 version of a graceful book, justly praised for 
almost 40 years... Any closely attentive reader who 
knows schoolroom arithmetic and the use of exponents 
is prepared for these pages, and few outside the 
profession will find the material too familiar. 
—Scientific American 


From Zero to Infinity has dazzled readers with 
its freshness and clarity since being published in 
1955. This book shows how interesting the ev- 
eryday natural numbers 0, 1, 2, 3, ... have been 
for over two thousand years, and still are today. 
It combines the mathematics and the history of 
number theory with descriptions of the mys- 
tique that has on occasion surrounded numbers 
even among great mathematicians. 
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In the death of Professor Eliakim Hastings Moore on December 30, 1932, the 
University of Chicago lost one of its most inspiring leaders and the world ...of 
mathematicians lost one of its most gifted members. ... Professor Moore was one of 
the youngest of this group of men [at Chicago] but his dauntless spirit, his boundless 
enthusiasm, his scientific integrity, and his unsurpassed logical insight soon 
established an influence which has extended far and wide and will continue to be felt 
wherever students of mathematics are seeking to discover the mainspring of their 
inspiration. 

Looking back over a period of forty-one years ... this writer ... now wishes 
to enumerate certain outstanding characteristics of this truly great man and, if 
possible, to set them forth with sufficient emphasis ... to attract the emulation of the 
on-coming generation of students of mathematics. 

First, and perhaps most important, was Professor Moore's exemplification 
of intellectual honesty. It was impossible for him to accept as established any 
proposition whose proof was in doubt... There was no place for camouflage of any 
sort in his mental make-up. A striking and critical test of this characteristic occurred 
at the opening meeting of the Mathematical Club in one of the early summer quarters. 
Moore was presenting a paper on a highly technical topic to a large gathering of 
faculty and graduate students from all parts of the country. When half way through 
he discovered what seemed to be an error (though probably no one else in the room 
observed it). He stopped and re-examined the doubtful step for several minutes and 
then, convinced of the error, he abruptly dismissed the meeting — to the 
astonishment of most of the audience. It was an evidence of intellectual courage as 
well as honesty and doubtless won for him the supreme admiration of every person 
in the group — an admiration which was in no wise diminished, but rather increased, 
when at a later meeting he announced that after all he had been able to prove the step 
to be correct. 

A second outstanding characteristic was the intellectual freedom which he 
claimed for himself and which he cheerfully granted to all others. He expected every 
man on the staff to do his full duty but he did not follow him up or visit his 
classroom. He trusted, rather, to the freedom based upon individual responsibility 
and, on the whole, the results were far more satisfactory than any system of espionage 
could possibly have been. There were many departmental open discussions where 
every instructor's views could be fully weighed and evaluated in the light of all other 
considerations; but in the end individual freedom was accorded to each one for 
working out the general plan. If in any case the results were unsatisfactory, it was up 
to the instructor to discover the trouble and to find the remedy. To this acid test of 
freedom, Professor Moore subjected himself on equal terms with all his instructors, 
even to the extreme test, on occasion, of admitting his own failure and seeking the 
remedy from the counsel of others. ... 


H. E. Slaught 
Monthly 40(1933), 191-195. 
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Georg Cantor and Transcendental 
Numbers 


Robert Gray 


1. INTRODUCTION. Conflicting statements have been made about Cantor’s proof 
of the existence of transcendental numbers. For example, consider the following 
statements: 


The contrast between the methods of Liouville and Cantor is striking, and these methods 
provide excellent illustrations of two vastly different approaches toward proving the existence of 
mathematical objects. Liouville’s is purely constructive; Cantor’s is purely existential. 


— Mark Kac and Stanislaw M. Ulam [20, p. 13] 


Sometimes people have gone on to say that Cantor’s method is not constructive, and cannot 
yield an explicit transcendental number. The words of E. T. Bell on page 569 of his Men of 
Mathematics are typical: 


The most remarkable thing about Cantor’s proof is that it provides no means whereby a 
single one of the transcendentals can be constructed. 


This is not true. Cantor’s idea can be used so as to yield an utterly explicit transcendental 
number. 


—J. N. Herstein and I. Kaplansky [19, p. 238] 


Kac and Ulam are comparing Cantor’s method with Liouville’s earlier construction 
of transcendentals [24, 25]—they find Cantor’s method to be non-constructive. But 
Herstein and Kaplansky insist that Cantor’s method is constructive, that it can 
produce a transcendental. A few lines later, they assert that this transcendental “is 
as well determined a number as e or 77.” 

After reading the statements by Kac and Ulam, and Herstein and Kaplansky, 
we decided to study Cantor’s work and how it has been presented. This article 
contains the results of our study. We begin by analyzing Cantor’s original articles, 
his 1874 article that contains his first proof and his 1891 article that contains his 
diagonal proof. Our analysis will show that Cantor’s methods lead to computer 
programs that generate transcendentals, and it will also determine which transcen- 
dentals are generated by the diagonal method. Next we will examine the history 
behind Cantor’s first proof. Finally, we will consider how some commonly-held 
views about mathematics and its history have affected the interpretation of 
Cantor’s work. 


2. CANTOR’S FIRST PROOF. In 1874, Cantor published his first proof of the 
existence of transcendentals in an article titled “On a Property of the Collection of 
All Real Algebraic Numbers” [3, 5]. Cantor begins his article by defining the 
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algebraic reals and introducing the notation: (w) for the collection of all algebraic 
reals, and (v) for the collection of all natural numbers. Next he states the property 
mentioned in the article’s title; namely, that the collection (w) can be placed into a 
one-to-one correspondence with the collection (v), or equivalently: 


... the collection (w) can be thought of in the form of an infinite sequence: 
(2.) D1) W500, Wyy vee 


which is ordered by a law and in which all individuals of (w) appear, each of them being located 
at a fixed place in (2.) that is given by the accompanying index. 


Cantor states that this property of the algebraic reals will be proved in Section 1 of 
his article, and then he outlines the rest of the article: 


To give an application of this property of the collection of all real algebraic numbers, I 
supplement Section 1 with Section 2, in which I show that when given an arbitrary sequence of 


real numbers of the form (2.), one can determine, in any given interval (a --- 8), numbers 7 
that are not contained in (2.). Combining the contents of both sections thus gives a new proof of 
the theorem first demonstrated by Liouville: In every given interval (a --: B), there are 


infinitely many transcendentals, that is, numbers that are not algebraic reals. Furthermore, the 
theorem in Section 2 presents itself as the reason why collections of real numbers forming a 
so-called continuum (such as, all the real numbers which are > Oand < 1), cannot correspond 
one-to-one with the collection (vy); thus I have found the clear difference between a so-called 
continuum and a collection like the totality of all real algebraic numbers. 


To appreciate the structure of Cantor’s article, we number his theorems and 
corollaries: 


Theorem 1. The collection of all algebraic reals can be written as an infinite 
sequence. 


Theorem 2. Given any sequence of real numbers and any interval [a, B), one can 
determine a number 7 in[a, B] that does not belong to the sequence. Hence, one can 
determine infinitely many such numbers 7 in [a, B]. (We have used the modern 
notation [a@, B] rather than Cantor’s notation (a --: B).) 


Corollary 1. In any given interval [a, B], there are infinitely many transcendental 
reals. 


Corollary 2. The real numbers cannot be written as an infinite sequence. That is, they 
cannot be put into a one-to-one correspondence with the natural numbers. 


Observe the flow of reasoning: Cantor’s second theorem holds for any sequence 
of reals. By applying his theorem to the sequence of algebraic reals, Cantor obtains 
transcendentals. By applying it to any sequence that allegedly enumerates the 
reals, he obtains a contradiction—so no such enumerating sequence can exist. Kac 
and Ulam reason differently [20, p. 12-13]. They prove Theorem 1 and then 
Corollary 2. By combining these results, they obtain a non-constructive proof of the 
existence of transcendentals. 
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Cantor’s theorems are worded constructively, but are they proved construc- 
tively? Since Cantor’s original proof of Theorem 2 is not commonly known, we 
present it before answering this question. 


Proof of Theorem 2: Recall that we have been given an interval [a, B] and a 
sequence of real numbers w,. We must find an 7 in [a, B] that does not belong to 
this sequence. Cantor assumes that the members of the sequence are distinct; to 
handle an arbitrary sequence, we can either eliminate duplicates from the se- 
quence or modify his proof to handle arbitrary sequences. 

Cantor begins his proof by finding the first two numbers in the given sequence 
that belong to [a, B]. Denote the smaller of these numbers by a, and the larger by 
B,. Now form the interval [a,,8,], and locate the first two numbers in the 
sequence that belong to [a@,, B,]. Denote the smaller of these numbers by a, and 
the larger by B,. Then form the interval [a@,, B,], and continue this procedure of 
generating intervals. 

We have two cases: Cantor’s procedure yields finitely many intervals [a,,, B,,] or 
infinitely many such intervals. In the first case, let [a,, By] be the last interval 
generated. Since there can be at most one w, in [a,, By], any 7 in the interval 
besides this w, and the endpoints of the interval will satisfy the conclusion of the 
theorem. In the second case, let a,, = lim, _,,.@, and B,, = lim,.,., B,. These 
limits exist since the a,,’s form an increasing sequence that is bounded from above, 
and the B,,’s form a decreasing sequence that is bounded from below. 

The second case breaks into two cases: Either a,, = B,, or a,, < B,. At this point 
in his proof, Cantor notes that a, = B,, holds for the sequence of algebraic reals 
[3, p. 261; 5, p. 308-309]. So Cantor not only applies his theorem to the sequence 
of algebraic reals, but he also analyzes how his proof handles this particular 
sequence. 

To complete the proof, we must produce a suitable 7 for the two remaining 
cases. In the case where a,, = B,,, let 7 be this common limit. Note that 7 cannot 
be a member of the given sequence since for every k,w, does not belong to 
[@,41 B41]. In the case where a,, < B,, let 7 be any number in the interval 
[a.» Bool. 

Cantor’s proof is constructive—he uses the given sequence w, to define the 
sequences a, and B,, breaks his argument into three cases depending on the 
behavior of these sequences, and constructs a suitable 7 for each case.* If 
the sequence w,, contains all the algebraic reals, then a, and B, are converging 
nested sequences whose common limit 7 is a transcendental number. 

Perhaps the most convincing way to show that Cantor’s argument produces a 
transcendental is by computing one. Using the methods in Cantor’s proof, we have 
written a computer program that generates the digits of a transcendental in the 
interval (0,1). Output from our program is given in Figure 1. Our program 
generates the sequence w, by enumerating the polynomials with integer coeffi- 
cients and approximating their roots. We approximate roots by using Sturm’s 
theorem and Horner’s method [31, p. 138-156]. (For a precise description of the 
w,, sequence generated by our program, see the appendix below.) 


*We call a proof “constructive” if it constructs an object using methods acceptable to most 
mathematicians. For a proof of Cantor’s Theorem 2 that meets the demands of constructive mathemati- 
cians, see [2, p. 27]. 
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Approximation Associated Polynomial 


roan 
x-1 


x +3x-2 
—1 


x — x*+2x-1 
x3+2x74+2x-2 


2x3 —-2x*-3x+2 
2x7 4+ x%7 4+ 4x -3 


= 573402... A4x* — 3x34 3x74+2x-2 
573416... 2x4 — 4x3 — 4x? -—-2x4 3 


156510 = -5734104... 3x2 + 5x4 - x34 4x74 2x -3 
® 144803 = 5734122. oe 3x + 3x4 _ 2x3 —_ 3x2 —_ 2x + 2 


@ 1406370 = -57341146... x®— x? 4+ 2x44 3x9 - x74 x-1 
® 1057887 => 57341183 eas x° _- 4x? _- x4 + 5x? + 2x2 + 3x _- 3 


Figure 1. Generating a transcendental using Cantor’s 1874 method. 


While generating the w, sequence, our program also generates the sequences 
a, and B,, which approximate our transcendental 7. Figure 1 shows the first seven 
members of a,, and B,. As Cantor points out in his proof, these sequences have a 
common limit—so our program will produce closer approximations. In fact, we can 
calculate a bound on the number of algebraic reals that need examining in order to 
find an a, and £B, that approximate 7 to within 1/k. This calculation requires a 
look at our enumeration of the algebraic reals. 

For ease of programming, we use a different enumeration of the algebraic reals 
than Cantor’s. Cantor enumerates the polynomials with integer coefficients by 
their height, where the height of the polynomial a)x* + --- +a, is k — 1+ |ap| 
+ --+ +|a,|, and then he enumerates the roots of these polynomials. We enumer- 
ate polynomials by their size, which for the polynomial just mentioned is 
max(k, |a,|,...,|a,|). Polynomials of the same size are ordered by treating them as 
(k + 1)-digit numbers whose digits range from —k to k. For example, the 
polynomials 20x — 1 and x*° — 1 are both size 20 and the first precedes the 
second in this ordering. Our enumeration of polynomials generates an enumera- 
tion of their roots; when a polynomial has more than one real root, we enumerate 
its roots in numerical order. 

To obtain an a, and £6, such that B, — a, < 1/k, we first enumerate those 
roots of polynomials of size 2k or less that are in the interval (0, 1). Applying the 
procedure in Cantor’s proof to this enumeration, we obtain the finite sequence 
a1, B,,@>, Bo,.--,@,,B,. Now if B, —a, >1/k, then we would have a, < 
j/k) < (ji + 1)/@k) < B, for some j between 1 and 2k — 2. Since j/(2k) and 
(j + 1)/(@2k) are roots of polynomials of size 2k, we have found two roots of our 
enumeration between a, and £,. But Cantor’s procedure allows at most one of 
these roots to be between a, and B,. Hence, we must have B, — a, < 1/k. Since 
there are (4k + 1)?**! — (4k + 1) polynomials of size 2k or less, and since each 
one has at most 2k roots, we need to examine at most 2k[(4k + 1)?**! — (4k + 1)] 
algebraic reals in order to approximate our transcendental 7 to within 1/k. 

This simple argument produces a poor bound. Figure 1 shows that we do not 
need to enumerate polynomials of size 200 to obtain approximations differing by 
less than 1/100. Nevertheless, our computer program generates digits inefficiently 


822 GEORG CANTOR AND TRANSCENDENTAL NUMBERS [| November 


—asymptotically, it takes at least oO" ) steps to generate n digits of our 
transcendental number [18]. Any program requiring this many steps is regarded as 
inefficient by computer scientists [16, p. 6—9]. 

Cantor’s proof leads to a computer program that generates a transcendental, 
but a program is not necessary for understanding his article. The constructive 
nature of Cantor’s article is clear from the wording and proof of Theorem 2. This 
theorem separates the constructive part of his article from the proof-by-contradic- 
tion needed to establish the uncountability of the set of reals. Since we will be 
referring to Theorem 2 throughout our article, we shall give it a name—Cantor’s 
theorem on real sequences. 

We are far from the first to point out that Cantor’s article is constructive. In 
1930, Fraenkel stated that the method in this article is ““a method that incidentally, 
contrary to a widespread interpretation, is fundamentally constructive and not 
merely existential’ [15, p. 237]. 


Exercise. The sequence 1/2, 1/3, 2/3, 1/4, 2/4, 3/4,.... contains all the 
rationals belonging to (0, 1). Apply the algorithm in Cantor’s proof to this sequence 
to generate the digits of an irrational. If you are using pencil and paper, just 
compute a,, B,, a>, and B,. 


3. CANTOR’S DIAGONAL PROOF. We now turn to Cantor’s 1891 article [9], 
which contains his well-known diagonal proof. Cantor begins by discussing his 1874 
article. He points out that it contains a proof of the theorem: There are infinite 
sets that cannot be put into one-to-one correspondence with the set of positive 
integers. Then he asserts that this theorem has a much simpler proof than the one 
given in 1874. His new proof uses the set M of elements of the form E = 
(X1,X>,.--,X,,...), where each x, is either m or w. Cantor states that M is 
uncountable, and notes that this result is implied by the following theorem: 


If E,, E,,...,E,,... is any simply infinite sequence of elements of the set M, 
then there is always an element E of M which corresponds to no E.,,. 


Cantor proves his theorem by using the diagonal method to construct E,. Note 
that, once again, Cantor states a theorem that separates the constructive content 
of his work from the proof-by-contradiction needed to establish uncountability. 

By introducing sequences of abstract symbols, Cantor shows that the phe- 
nomenon of uncountability does not depend on properties of the real numbers, 
such as the existence of limits for bounded increasing (or decreasing) sequences of 
reals. Thus, Cantor shows that uncountability is a fundamental phenomenon of set 
theory. Also, his 1874 theorem on real sequences follows easily from his new 
theorem. Take any sequence of real numbers and expand its members into their 
binary representations. (A real of the form m/2” must be expanded into both of 
its binary representations.) This gives us a sequence of binary representations. 
Apply Cantor’s new theorem to obtain the binary representation of a real number 
that does not belong to the original sequence. 

Cantor’s diagonal method is simpler than his earlier nesting method, and it 
generates transcendentals much more efficiently. The diagonal method can gener- 
ate n digits of a transcendental in O (n? log’ n log log n) steps [18]. Algorithms 
requiring less than O(n?) steps are considered practical by computer scientists [16, 
p. 9]. 
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Figure 2 contains output from a computer program that uses Cantor’s diagonal 
method to generate the digits of a transcendental in (0, 1). Our program generates 
the same mw, sequence as our program in Section 2. It uses this sequence to 
generate the digits of a diagonal number as follows: Let d be the nth digit of ,,. 
Our program sets the nth digit of our diagonal number to d+ 1 unless d is 9; in 
this case, it sets the nth digit to 0. Cantor’s diagonal argument guarantees that the 
decimal representation of our diagonal number differs from the representations we 
use for the algebraic reals. But as Figure 2 shows, our program does not generate 
both representations of fractions such as 1/2. Hence, we cannot conclude that our 
diagonal number is transcendental until we show that it differs from all fractions 
having two decimal representations. Our diagonal number does differ from these 
fractions because its decimal expansion contains infinitely many 2’s—the decimal 
expansions of 1/9, 1/90, 1/900,... generate 2’s on the diagonal. 

Note that our diagonal number can be written as: 


> rem(|10” - w,| + 1,10) 
n=1 10” 


where w,, is the nth member of our sequence of algebraic reals; rem(m, n) is the 
remainder left after dividing m by n; and |x] is the floor of x (the largest integer 
equal to or less than x). 


Exercise. Write the rationals in (0,1) as a sequence: 1/2, 1/3, 2/3, 1/4, 2/4, 
3/4,... . By applying the diagonal method to this sequence, generate an irrational 
number in (0,1). Compute the first 10 digits of this number, compute the 25th 


7 = 3660254... 


3 = 66666666... 


Figure 2. Generating a transcendental using Cantor’s diagonal method. 
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digit. Verify that this number can be written as: 


m 
«© nel rem({| 10/0") —| + 1,10] 


» » TNE 
where f(m,n) = (n — 2)(n — 1)/2 + m. 


4. ALL TRANSCENDENTALS LIVE ON DIAGONALS. As we have seen, Cantor’s 
diagonal method does construct transcendentals—but which ones? To answer this 
question, we first observe that the digits generated by the diagonal method depend 
on the enumeration of algebraic reals we use. Different enumerations usually lead 
to different transcendentals. Since there are 2*° transcendentals and 2*° enumera- 
tions of the algebraic reals, perhaps all transcendentals live on diagonals. In a 
precise sense, they do. 

Before we can state our theorem about diagonals and transcendentals, we need 
some definitions. Let b(n) denote the nth digit of b, where b is the binary 
representation of a real number. We define the diagonal number of the sequence 
b,, 55, b3,... of binary representations to be the real number whose binary 
representation d is obtained by the following rule: d(n) = 0 if b,(m) = 1, and 
d(n) = 1 if b(n) = 0. (With binary representations, there is only one way to 
change a digit, so there is only one diagonal number associated with a sequence. 
To work with other representations, we would have to talk about the diagonal 
numbers of a sequence.) We say that a sequence consists of all the binary 
representations of algebraic reals if it contains all the binary representations of the 
algebraic reals (including both representations of the fractions m/2”) and if it 
does not contain any representations of non-algebraic reals. Such a sequence may 
contain the same representation more than once. Using the above definitions, we 
can express the relationship between transcendentals and diagonal numbers: 


Theorem 3. A real number in the interval (0,1) is transcendental if and only if it is 
the diagonal number of a sequence that consists of all the binary representations of 
algebraic reals in (0, 1). 


Proof: By Cantor’s diagonal argument, the diagonal number of such a sequence is 
transcendental. 

Now assume that ¢ is transcendental. Let a, be any sequence consisting of all 
the binary representations of algebraic reals in the interval (0, 1). We will define a 
sequence b, that is a permutation of the sequence a, and that generates ¢ as its 
diagonal number. - 

Throughout our proof, we use the same notation to denote a real number and 
its binary representation. We start by finding the first a, such that a,(1) # ¢(1). 
Our search is bounded by the binary representations of 1/2 since one of these 
representations starts with 1 and the other starts with 0. After finding our a,, we 
mark it as used and set b, = a,. Now assume that we have found 5,, b,,...,5,_}. 
To obtain a suitable b,, we look for the first unused a, such that a,(n) # t(n). If 
t(n) = 0, then our search is bounded by the binary representations of 1/2” + 
1/2"*' where i = 1,...,n. The representations of these numbers have a 1 in their 
nth place and at least one of them is unused. Similarly, if t(m) = 1, then our search 
is bounded by the binary representations of 1/2"*' where i = 1,...,n. After 
finding our a,, we mark it as used and set b, = a,. 
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To complete our proof, we must show that the sequence b, is a permutation of 
the sequence a,. Assume that some a, was not used, and let a, be the unused one 
with the least index. Now each a,, for i < k, was used to define a b;. Let N be 
greater than the indices of these b,’s. (If k = 1, then there are no such b,’s so we 
let N be 1.) By our definition of the sequence b,, the only way for a, to stay 
unused is for the equality a,(m) = t(n) to hold for all n = N. Hence, t — a, is 
rational. Since ¢ is transcendental, a, must also be transcendental—but this 
contradicts the fact that a, is an algebraic real. Thus, the sequence b, is a 
permutation of the sequence a,,. 

If we apply the method in our proof to a transcendental t whose digits are 
computable, then we can compute a sequence of algebraic reals whose diagonal 
number is ¢. For example, we have written a computer program that uses the 
binary representation of 1/e to generate a sequence of algebraic reals whose 
diagonal number is 1/e. Output from this program is given in Figure 3. The 
sequence generated by our program is a permutation of a sequence w, that 
consists of all the binary representations of algebraic reals in the interval (0, 1). 
This w,, sequence is similar to the w, sequence of our previous programs—the key 
difference is that our new sequence consists of representations rather than 
numbers. Since our new sequence contains both binary representations of the 
fractions m/2”, its numbering differs from that of our previous sequence. For 
example, in Figure 3, both w, and w, are binary representations of 1/2. 


Approximation Binary Representation Associated 
to 1 /e of Algebraic Real Polynomial 


z 
WOnr,=. 


0101 5 = .0110 


2 
3 2 
; 
01011110 2x2+ x-2 
0101111000 3x1 


Figure 3. Generating a sequence whose diagonal number is 1 /e. 


Exercise. Construct a sequence consisting of all the binary representations of 
rationals in (0, 1) by expanding the rationals 1/2, 1/3, 2/3, 1/4, 2/4, 3/4,... into 
their binary representations. Now use the algorithm in the proof of Theorem 3 to 
generate the first 10 members of a sequence of binary representations whose 
diagonal number is the irrational V2 — 1. (The binary representation of 2 — 1 is 
0.0110101000... .) 


5. CANTOR’S UNPUBLISHED PROOF. In Section 2, we saw that Cantor’s ideas 
can be used to write either a direct (constructive) proof or an indirect (non-con- 
structive) proof of the existence of transcendentals. We now investigate whether 
Cantor knew that his ideas could produce such different proofs. 
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The thinking that led to Cantor’s 1874 article can be found in the correspon- 
dence between Cantor and Dedekind [11, p. 187-191; 28, p. 12-16]. We start with 
Cantor’s letter of November 29, 1873, in which he asks Dedekind the following 
question: 


Take the collection of all positive whole numbers n and denote it by (m); further, imagine the 
collection of all positive real numbers x and denote it by (x); the question is simply whether (n) 
and (x) can be corresponded so that each individual of one collection corresponds to one and 
only one individual of the other. 


Cantor says that at first glance it appears that no such correspondence could exist 
—after all, one collection is discrete and the other continuous. But then he brings 
out the subtlety of his question by stating that it is easy to construct a one-to-one 
correspondence between the collection of positive integers and the collection of 
rational numbers. Cantor also states that a one-to-one correspondence can be 
constructed between the collection of positive integers and general collections of 
the form (4, _,,,.,n,)» Where the indices n,,,,...,”, and v are positive integers. 

Dedekind replies that he is unable to answer the question, but he does give 
Cantor a one-to-one correspondence between the collection of algebraic numbers 
and the collection of positive integers. Dedekind also advises Cantor not to waste 
too much time on his question because it has no “particular practical interest.” 

In his next letter, dated December 2nd, Cantor acknowledges Dedekind’s advice 
but points out that his question is of interest: “It would be nice if it could be 
answered; for example, provided that it were answered no, one would have a new 
proof of Liouville’s theorem that there are transcendental numbers.” 

Cantor’s letter of December 7th contains the result he is seeking. His proof 
Starts: 


Suppose that the positive numbers w < 1 can be broken up into the sequence: 


(7) W1,07,..-,Wyyee. 


After an involved argument, Cantor obtains a contradiction. 
In his next letter, dated December 9th, Cantor outlines the proof he will 
publish: 


I show directly that if I start with a sequence 
(7) 1) D7, 001y ODyyee- 


I can determine, in every given interval (@..B), a number 7 that is not included in (I). Hence, it 
follows immediately that the collection (x) cannot correspond one-to-one with the collection 


(n)... 


Taken together, Cantor’s letters of December 2nd and 7th provide an indirect 
proof of the existence of transcendentals. But his letter of December 9th contains 
his theorem on real sequences, which provides a direct construction of transcen- 
dentals. 


6. WHY IS CANTOR’S ARTICLE MISINTERPRETED? Cantor’s correspondence 
with Dedekind, which contains his indirect existence proof, was not published until 
1937 [17, p. 104]. By then, other mathematicians had rediscovered the proof. Klein 
outlined it in 1894 [21, p. 51]. Since Klein (as far as we know) published the proof 
first, we will call it Klein’s proof. In 1907, Osgood presented Klein’s proof, 
but called it “Cantor’s proof for the existence of non-algebraic numbers” 
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[29, p. 159-160].* In 1921, Perron presented Klein’s proof, attributed it to Cantor, 
and then critiqued it [30, p. 161-162]: 


...Cantor’s proof for the existence of transcendental numbers has, along with all its simplicity 
and elegance, the great disadvantage that it is only an existence proof; it does not enable us to 
actually specify even a single transcendental number. Free from this disadvantage is another—in 
fact, the oldest—existence proof due to Liouville... 


Perron’s critique is similar to the one given by Kac and Ulam (see Section 1). So 
this view of Cantor’s work has been around for many years. Why do some 
mathematicians misinterpret Cantor’s article? Undoubtedly, there are a variety of 
reasons. We will only discuss how these misinterpretations are encouraged by some 
commonly-held views about mathematics and its history. 

One such view states that Cantor’s set theory was initially attacked by many 
mathematicians of his time. The problem with this view is that it fails to distinguish 
the parts of Cantor’s work that were attacked, from those that were not. For 
example, consider the following statement by Birkhoff and MacLane [1, p. 436—437]: 


Cantor’s argument for this result [“‘Not every real number is algebraic’’] was at first rejected by 
many mathematicians, since it did not exhibit any specific transcendental number. 


If Cantor’s argument was rejected, then it would be reasonable to suspect that his 
argument is non-constructive and that this was the reason for its rejection. 
However, we have found no evidence indicating that Cantor’s argument was 
rejected. Kronecker—the mathematician most likely to reject it—had a chance to, 
but did not. 

Cantor sent the article containing his existence proof to Crelle’s Journal 
(Journal fiir die reine und angewandte Mathematik), even though he knew that 
Kronecker, as one of the journal’s editors, could reject or delay the article. 
Previously, Kronecker had delayed the publication of an article written by Heine, 
one of Cantor’s colleagues. In fact, Kronecker had even tried to persuade Heine to 
withdraw his article [12, p. 67 and p. 308-309]. Cantor’s experience was different 
—his article was printed quickly. Apparently, Kronecker found it to be no worse 
(from his point of view) than other articles appearing in Crelle’s Journal. 

Cantor’s article does contain revolutionary ideas, but Cantor rephrases these 
ideas using terminology familiar to his contemporaries. For example, he introduces 
the concept of a collection of reals corresponding one-to-one with the collection of 
positive integers, and then he provides an equivalent formulation—namely, that 
such a collection of reals can be written as a sequence. Also, he states the two 
theorems of his article in terms of sequences rather than one-to-one correspon- 
dences. Finally, Cantor incorporates a constructive idea into his article—careful 
reading reveals that his theorems only deal with sequences that are ordered by a 
“law” (see [3, p. 260; 5, p. 308] and our first Cantor quotation in Section 2). Cantor 
may have inserted this restriction to avoid problems with Kronecker. Kronecker 
required that a sequence or series be generated by an arithmetic rule—in fact, 
Kronecker probably objected to Heine’s article because it dealt with arbitrary 
trigonometric series [14, p. 71]. 


*Klein never attributed his proof to Cantor. In 1895, Klein gave an exposition of Cantor’s 1874 
article in which he replaced Cantor’s old nesting method with the newer diagonal method. Klein called 
the resulting constructive proof ‘““Cantor’s proof” [22, p. 49-54]. 
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In his next article [4,6], which was published in 1878, Cantor introduces 
concepts that apply to all sets—these concepts cannot be rephrased into the 
mathematical terminology of his time. Cantor begins this article by introducing the 
notion of a one-to-one correspondence between two arbitrary sets, finite or 
infinite. Next he defines the ordering that this correspondence induces—that is, 
what it means for one set to be of lesser, equal, or greater power (cardinality ) than 
another. Cantor devotes most of this article to proving that for every positive 
integer n, the set of all n-tuples of reals can be put into one-to-one correspon- 
dence with the set of reals. 

Once again, Cantor submitted his work to Crelle’s Journal—but this time, 
publication was delayed. Cantor blamed Kronecker for the delay and never sent 
another article to Crelle’s Journal [12, p. 69-70; 17, p. 111-113]. We have no 
record of why Kronecker had difficulty with Cantor’s article. However, to under- 
stand this article, one must work with countably infinite and uncountably infinite 
sets. Kronecker, with his constructive philosophy, could never accept Cantor’s 
reasoning. 

As Cantor developed his ideas about infinity further, more mathematicians 
began to criticize his work. For example, Cantor presented his theory of transfinite 
ordinal numbers in his 1883 monograph Foundations of a General Theory of Sets 
[7,8]. After reading Cantor’s monograph, Poincaré made the following comment 
[13, p. 278; 27, p. 95-96]: 


... these numbers in the second, and especially in the third, number-class have the appearance 
of being form without substance, something repugnant to the French mind. 


(Cantor’s first number class consists of the natural numbers. His second number 
class consists of the countable ordinals, which are the ordinals representing the 
well-orderings of the first number class. His third number class consists of the 
ordinals that represent the well-orderings of the second number class.) 

So the popular view of the history of set theory needs to be refined. Criticism of 
Cantor’s theory did not begin with the publication of his 1874 article. It began with 
his 1878 article, which contains arguments that require the use of infinite sets. 
Criticism increased as Cantor introduced new concepts involving the infinite. 

A commonly-held view about existence proofs also needs examining— namely, 
the view stating that most non-constructive existence proofs are simpler than their 
constructive counterparts. Most non-constructive proofs are simpler. However, we 
must recognize those cases in which a construction yields the simplest proof. For 
example, Perron (see quotation above) mentions the ‘simplicity and elegance” of 
the non-constructive existence proof that we call Klein’s proof. Comparing Klein’s 
proof to Liouville’s constructive proof, it is tempting to conclude that the former is 
simpler because of its non-constructive nature. However, Cantor’s constructive 
approach yields an even simpler proof. 

Klein’s proof requires that we first prove that the set of reals is uncountable. 
How do we prove this? By assuming that there is a sequence that enumerates the 
reals, applying the diagonal method to this sequence, and obtaining a contradic- 
tion. Dissecting this proof, we find that it rests upon two facts: 


(1) If we apply the diagonal method to a sequence of reals, then we obtain a 
real not in the sequence. 

(2) If we assume that the reals can be enumerated by a sequence and obtain a 
contradiction from this assumption, then no such enumerating sequence 
exists. 


1994] GEORG CANTOR AND TRANSCENDENTAL NUMBERS 829 


Now if we apply the diagonal method to a sequence containing all the algebraic 
reals, we only need fact (1) to guarantee that we have constructed a transcenden- 
tal. So Cantor’s constructive approach is simpler than the non-constructive one 
(and it provides excellent preparation for the uncountability proof). 

There may be other views about mathematics and its history that lead some 
mathematicians to misinterpret Cantor’s article. We encourage our readers to 
explore this subject further. 


7. CONCLUSION. We started this article with two conflicting statements about 
Cantor’s proof of the existence of transcendentals. We have seen that these 
statements are talking about two different proofs of the existence of transcenden- 
tals, a constructive proof that goes back to Cantor’s original articles and a 
non-constructive proof that appeared later.* 

We advocate teaching either both proofs or just the constructive one. As 
mentioned in our last section, Cantor’s constructive approach is simpler than the 
non-constructive one. Also, while discussing Cantor’s approach, we can give him 
the credit he deserves for presenting his work so constructively. 


APPENDIX. Figures 1 and 2 (see Sections 2 and 3) contain output from computer 
programs that use Cantor’s methods. Both of these programs generate a sequence 
w, Of algebraic reals. We now define this sequence precisely so that interested 
readers may verify our results. 

As discussed in Section 2, we enumerate the polynomials with integer coeffi- 
cients by their size and we use this enumeration to generate an enumeration of the 
algebraic reals. We only need to enumerate irreducible polynomials; but since 
Cantor’s methods do work with sequences containing duplicates, we did not bother 
to check for irreducibility. (For those who are interested in testing for irreducibil- 
ity, see [23, p. 431-434].) However, to simplify our calculations and avoid many 
duplicate roots, we decided to enumerate polynomials p(x) with the following 
properties: 


(1) p(x) has a root in the interval (0, 1). 

(2) The coefficients of p(x) have no common factor greater than one, and the 
leading coefficient is positive. 

(3) Either p(x) is linear or it has no linear factors. 

(4) p(x) and its derivative p’(x) have no common roots. 

(5) p(x) and its second derivative p”(x) have no common roots. 


The first condition simplifies our calculations. The second and third conditions 
eliminate many duplicate roots. 

The last two conditions are needed by Newton’s method. As Figure 1 shows (see 
Section 2), Cantor’s 1874 nesting method generates digits very slowly. So for this 
method, we only used Sturm’s theorem and Horner’s method to approximate roots 
[31, p. 138-156]. But the diagonal method generates digits much faster than the 
nesting method. To generate a large number of these digits, we need an efficient 
approximation algorithm, such as Newton’s method. Now to guarantee that 
Newton’s method does converge to a root of p(x), we first use Sturm’s theorem 
and Horner’s method to isolate our root to an interval where p’(x) and p”(x) do 


*That is, appeared in print later—our study of Cantor’s correspondence (see Section 5) shows that 
he knew both proofs. 
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not vanish. Since the polynomials in our enumeration do not share roots with their 
first or second derivatives, our root does belong to such an interval. After isolating 
our root, we set our initial approximation to an endpoint of this interval, and then 
we use Newton’s method to generate better approximations (see [31, p. 174-177] 
for details—such as, which endpoint to choose for the initial approximation). 
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Added in Proof. The quotation of Poincaré in Section 6 is out of context. It comes 
from a letter explaining why French mathematicians may not appreciate a pro- 
posed translation of Cantor’s 1883 monograph—a monograph that Poincaré twice 
calls “beautiful” [13, p. 278]. Poincaré warns that, unless they are given concrete 
examples, mathematicians unfamiliar with Cantor’s previous work may find his 
ordinals to be ‘form without substance.” 

A quotation that accurately reflects some of the criticism of the time can be 
found in an 1883 letter from Hermite to Mittag-Leffler [13, p. 209; 27, p. 96]: 


The impression that Cantor’s memoirs makes on us is distressing. Reading them seems, to all of 
us, to be a genuine torture... . While recognizing that he has opened up a new field of research, 
none of us is tempted to pursue it. For us it has been impossible to find, among the results that 
can be understood, a single one having current interest. The correspondence between the points 
of a line and a surface leaves us absolutely indifferent and we think that this result, as long as no 
one has deduced anything from it, stems from such arbitrary methods that the author would have 
done bettter to withhold it and wait. 


Hermite’s “‘us” includes Appell, who read a draft of the letter, and probably 


Picard, who was also critical of Cantor’s recent work [13, p. 212]. The criticism at 
the end of the quotation is directed at the major result of Cantor’s 1878 article. 
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The 500th Anniversary of the Sharing 
Problem (The Oldest Problem 
in the Theory of Probability) 


Milton Sobel and Krzysztof Frankowski 


1. INTRODUCTION. The problem of sharing (also known as the Problem of 
Points, Probléme des Parties, Teilungsproblem) is the oldest known problem in 
probability theory according to K. Jordan [2] having been described in a book of 
the famous mathematician Lucas de Burgo Pacioli published in 1494 [3], about 500 
years ago, in the ancient Italian language. It occupied the attention of many 
illustrious mathematicians including Fermat, Pascal, Huygens, Ja. Bernoulli, Mont- 
mort, De Moivre, Laplace, Lagrange, Tartaglia and Pacioli, to name a few. The 2 
and 3-player problems with unequal parameters and unequal cell probabilities 
were solved long ago by the people listed above (cf. K. Jordan [2] and Todhunter 
[6] for details) but no unified treatment for the general case for equal or unequal 
cell probabilities to our knowledge has ever been published. In this paper the 
recently developed Dirichlet integrals [4], [5] are used to give a unified treatment 
of the general case with k players and equal or unequal cell probabilities. In 
addition we extend the problem by assuming that the contest might be continued 
at some future time and given the past results as initial conditions we derive 
results: 

i) for the expected waiting time (first moment) to complete the tournament and 
also higher moments, 

ii) for the probability of each possible vector-ranking of the players at the 
completion of the tournament, 

iii) for four different disciplines used in completing the tournament, 

iv) allowing the possibility of a quota-free cell, so that the sum of the cell 
probabilities for the competitive players may be less than unity, and 

v) allowing ties which clearly affect the expected waiting time, but without giving 
any fractional credit for additional games won. 

The problem is quite simple to describe. A tournament with k players consist of 
games with all k players involved in each game. Each game produces 1 winner and 
the games are carried out sequentially until one of the players wins n games. At 
some point when the ith player has won w, <n games (i = 1,2,...,), or needs 
n —w, =r, > 0 games to win the tournament, the contest is suddenly discontin- 
ued. How should the prize money be shared if we wish to distribute it in 
proportion to each player’s conditional probability of winning the tournament, 
assuming that the contest were continued? The probability p,; > 0 that the ith 
player wins in a single game (i = 1,2,...,k) is given and the games are indepen- 
dent of each other. 

In the original formulation there is no chance of a tie in any game. We later 
introduce an additional quota-free cell with cell probability pp = 1 — re, p; < 1, 
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which can also be regarded as the probability of a tie; nobody gets any fractional 
credit for a tied game, but it does add to the expected waiting time for completion 
of the tournament. On the other hand the expression ‘‘sink’”’ could indicate that we 
are singling out the cell of a particular one of the k players and waiting until he 
reaches a fixed number of wins and hence it may not refer to an additional cell. If 
r,-values are regarded as quotas, then only the cell associated with py is quota free. 
If there is only 1 cell (with probability p,) that is used for stopping, we call it a 
counting cell. The quantity b is used in the tables in [4] and [5] and is usually equal 
to k or k — 1;b =k if there are k competitive players and we are not using any 
player as a sink, but if we use one player as a sink for a computation then 
b =k — 1 in that computation, regardless of whether a quota-free cell is present 
or not. Hence the use of the symbol b in a formula (instead of k) gives us more 
flexibility. 


2. BACKGROUND TOOLS AND THEIR NOTATION. The Dirichlet integrals 
consist of two types: Type 1 (includes the J, the J and the JJ) and Type 2 (includes 
the C, the D and the CD); the integrals of Type 2 constitute the main tool used in 
this paper. These integrals are defined, studied, tabulated with many illustrations 
of usage in [5] and we repeat them briefly. 

For any positive integer b and m with vectors @ = (a,,...,a,)(real) and 
r= (r,,...,7,natural) we define 


T(m+R) pa, a, T17_yxj"* dx; 


COR, m) = ————_—~ — 
PA) = Toor Ge Eby 


(2.1) 


where R=r,+...+7,,4;=D;/Py G = 1,2,..., 6), p; are the cell probabilities 
and py) = 1 — X)_1p; < 1. 

For the probability interpretation of (2.1) we consider a multinomial setting with 
b blue cells, where the ith cell has cell probability p,; and quota r7;. In addition we 
have a counting cell with cell probability p). We sample from this multinomial 
until the counting cell reaches frequency m > 1 for the first time, referring to this 
as “at stopping time” or ast. Then (2.1) is the probability that for each of the b 
blue cells, the ith cell has its quota of at least r; ast. 

The dual probability that for each of the b cells, the ith cell has less than 7, 
(i = 1,..., b) ast is given by: 


[(m + R) J of IT?_,xj'"* dx, 
[(m)I1?_,T(7;) ay (1 + 1x x)" 


oO 


D?(?,m) = (2.2) 


with the same notation as in (2.1). We need also the mixed integral, where 
0 <c <b integrals are from 0 to a, or of the C-type while the remaining b — c 
integrals are from a, to © or of the D-type, namely 

C(m + R) fr ie [ TDP xt? dx, 
Tm) (7) (1+ Eby)" 


(2.3) 


CD&?-(?, m) = 


The probability interpretation is given in the same multinomial setting, with the 
obvious difference that for each of the c cells the ith cell (i = 1,...,c) has its 
quota of at least r,; and for the rest of the b — c cells the ith cell (i =c + 1,...,b) 
has less than r;. For b = 0 we define C® = D© = CD©” = 1, 
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In the homogeneous case we have all p, = p(i = 1,2,...,b), py) = 1 — bp and 
we use the common scalar a = p/p, instead of a. Whenever we use more than 
one r (or s) value in a Dirichlet function we separate them from the last argument 
with a semicolon. 

For the proof in Section 3 we need a slight generalization of (2.1). For the 
present r= (r,,7r,,...,7,) and @ = (a,,a),...,7,) are each b vectors. Let X, 
denote the frequency in the ith blue cell i = 1,2,...,b)and R=r,+7r,4+...4+7, 
as before. Let P = C{/(7;m) denote the probability, when the counting cell 
reaches m (for the first time), that the joint event X,=r, (i= 1,2,...,j) and 
X,2>r, (@=j+1,j+2,...,b) for given j with 0 <j <b has occurred. Then 


we have 


_ [(m + R) IT/_,(@/'/7;) i ie rere aX 


['(m) T1?_.T(r,) 20 0 (1 + Ul a,+ rig t,) 


(2.4) 


Clearly the j cells with equality are taken to be the first j cells only for notational 
convenience. For j = 0 we drop the second superscript as in (2.1) and for j = b we 
get multinomial probabilities as in (2.26) of [5]. The same result (2.4) holds for 
De! (7; m), where the event changes to X, <r,(a =j + 1,..., 5), except that all 
the limits of integration are from a, to ©. 

We also use in Section 4 (and elsewhere) the multiple multinomial sum 
expression for the Dirichlet D integral as follows: 


1 
D?(?, m) = ————— > woe > | 


m—1+ mee 


(1 + P= 14;) ner x,<r, | — 1,%,,..-,X, 
b a; x; 

x —.— | , 2.5 
I 1+ =| 


where the square bracket denotes the multinomial coefficient; for the case of 
common a and r, this result appears as (2.15) in [5]. 


3. PROBABILITY RESULTS FOR THE SHARING PROBLEM. The D and C 
Dirichlet integrals provide an immediate unified solution to the generalized 
problem with k player with equal or unequal cell probabilities. Assuming the 
tournament is continued, the first player to reach n wins is clearly the winner of 
the tournament; if the contestants continue beyond that, we say that the last player 
to reach n wins is the loser (or the kth rank winner) of the tournament, i.e., he is 
the last player to reach his quota. If we order all k players in order that they reach 
their quota, we call it a vector-ranking of the players and we are also interested in 
the probability of any particular vector-ranking. 

Let d; = (p\/Pj,- ~~, Di-1/Dis Dizi /Dix ++» Pe/Bi) ANA 7, = (Tye oy Pia, 
V344)+++)%;,), both with b = k — 1 components. [Here we are singling out the ith 
player and getting ready to use him as a sink.] From the probability interpretation 
of the D integral the probability W, that player i reaches r; games won before any 
player j reaches r; games won (j # 7) is (exactly) 


19> "1 


W, = Dz (75 1) (3.1) 
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and this interpretation also gives us the identity 


k k 
UW, = LD (Fin) = 1, (3.2) 
i=1 i=1 
since the probability of a tie, namely pp, is less than 1 and hence one of the k 
players has to win the tournament. Both of the above results hold regardless of 
whether quota-free cells are present or not. 
Similarly the probability that player i is the loser of the tournament is 


L, = CS’(F 3 1;) (3.3) 
and 
k k 
LL; = » CH (Fi; r;) =1. (3.4) 
i=l i=1 


To illustrate (3.2) we repeat one of the seventeen calculations from the short 
but classical table of Huygens [1], republished in [2]. In this example we have 3 
players, with common probability p = 1/3 and quotas r, = 2,r, = 3,r, = 4. From 
the definition (2.2) we have, integrating by parts, 


9 x*y? dxd 
W, = D(3, 4;2) = LV) roarara! J 8 
r(2)r(3)T(4) Ji 4b (1 +x +) 
= 1p,(3,5) + D®(3,2) = .6186557, (3.5) 
T(9) xy* dxdy 
W, = DP(2, 433 TT nd 
2 = Di(2,453) = Toran 4 J (L+x+y) (3.6) 
= 75D"),(4,4) + gD{),(4,3) = .2674897, 
T(9) xy? dxdy 
W, = D?(2,3;4) = Td 
3 = Di'(2,354) TOTVEI ORM J (l+x+y) 
= 1p,(3,5) + £D%Y,(3, 4) ~ .1138546. (3.7) 


Each of the answers in (3.5), (3.6), (3.7) were obtained by reducing the D-functions 
to a single common r argument and then using the 8-place table in [5]. One could 
also finish integration and get exact values 451/729, 195/729, and 83/729, but 
since each of the given answers is correct to 8 decimals there is no need for that. 
Note that these answers sum to one, i.e., (3.2) is satisfied. 

The three C-integrals with exactly the same arguments give us, respectively, the 
probability that player j; will lose the tournament. Since the limits in this case are 
both from 0 to 1 there is a bit more calculation and the tables become more useful. 
The exact results are respectively 


451 16 195 1 
L, = 799 _ 32 = .1186557, L,= 799 + 32 ~ = ,2987397, 
83 15 
Ly = ag + ay = 5826046. (3.8) 


Since these also sum to one, (3.4) is satisfied. 

Let P(r,,7,,1r3) denote the joint probability that the player, who needed ,, 
more wins, would have rank i if the contest were completed (i = 1, 2,3). For the 
case of 3 players we can use the probability interpretation of (2.3) and in our 
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example this probability is 
T(9) 


3 
1-2 xy” dydx 
(2) (3)T (4) i J ~ 3887005, 


P(r,,17,,72) = CDo(2,4;3) = 
(11,172,713) aa ) (l+x+y) 


(3.9) 
and for the remaining five vector rankings we obtain 
P(1r1513) 12) = CD¢ (2,334) = .2298954, 
P(r3,71, 73) = CD ?(3,4;2) = .1938443, 
P(1r,,73,7,) = CD% (3,234) = .0736454, 
P(1r3) 11.1%.) = CD ?(4,3;2) = .0688443, 
P(1r3,12,1,) = CD%?(4,2;3) = .0450103, (3.10) 


all of which also sum to one. By combining the appropriate pairs we can also 
obtain the vector results for any one player. Thus if P,(R ;) denotes the probability 
that player 1 ends up with rank j (j = 1,2,3) we have from the above results for 
player 1 
902. 383 173 
(PR), PR). PAR) = (Ease Gash GD 


which also add to one. 

In the case of 4 or more players the probability of any particular vector-ranking 
is not nearly as simple as in the case of 3 players. For notational convenience 
below we replace (r,,r,,7r3,1,) by (f,5,t,r) respectively, and use A to denote 


D> + P3 + Yq. 
Theorem. The probability P(V°) of the vector ranking V° = (1,2, 3,4) of 4 players 
with quotas f, s, t, and r and cell probabilities p, i = 1,2,3,4) with p, + A <1is 
given by the double summation with tr terms 
Feleelee) 
A A A 


rz, l+it+j 
xCY A f,s +i +j)D®,,(r i,t - i). (3.12) 


P(O"’)= veh 
P4/P3 


iz0 ja0l 57 hed 


For common p the five ratios in (3.12) are all 1/3 or 1 and the result does not 
depend on p, only on the quotas. If the quotas are also common then the value is 
1/4! = 1/24 for any vector U. 


Sketch of the Proof: We simply condition on the number of games won by the third 
and the fourth player at the time when the second player reaches his quota s; call 
these i and j respectively. Then we can use the result (2.4) with j = 2 and b = 3 
with player 1 having attained his quota f, player 2 just reaching s and player 3 and 
4 temporarily at 7 and j wins respectively. In the ensuing games player 3 has to 
reach t — i before player 4 reaches r — j. Thus we have 
t-—l1r-1 
P(0") = Xu oy Coy) pa, ps/p2, Pa/ PDK foi, 55 8) Doe p{t — i,t — i). (3-13) 
i=0 j= 
Using only the integral expression for C from (2.4) into (3.13) we factor out 
from the integrand the quantity 1 + p,/p, + p,/p, and with an obvious change of 
variable we obtain the result (3.12). 
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We will illustrate this theorem using the case of the four players with quotas 
r, = 1, @ = 1,2,3,4) and common p = 1/4. From (3.1) using (2.2), we obtain 


W, = D®(2,3,4;1) = 6921387,  W, = D@(1,3,4;2) ~ .2150879, (3.14) 
W, = D®(1,2,4;3) = .0700684,  W, = D®(1,2,3;4) ~ .0227051. (3.15) 


Using (3.12) we obtain the desired numerical probabilities for the 4! = 24 vector 
rankings v° which we arrange in the following table 


Vector-ranking probabilities for 7 = (1, 2,3,4); common p = 1/4 
p° P(v®) p° P(v°) py? P(v®) py? P(v°) 


1234 .2508208 2134 0955250 3124 0292612 4123 0070258 
1243 1527470 2143 0573295 3142 0111104 4132 0045392 
1324 .1409879 2314 0334248 3214 0187104 4213 0049145 
1342 0564335 2341 0089897 3241 0048848 4231 0022199 
1423 0546842 2413 0135274 3412 .0035737 4312 0023510 
1432 0364654 2431 0062916 3421 0025278 4321 0016547 


sums .6921388 .2150880 .0700683 0227051 


As the last row indicates, summing up the columns gives us the values for W,, 
i = 1,2,3,4 again as given in (3.14) and (3.15), except for rounding errors. 

The basic difficulty in developing the expression (3.13) for the probability of 
vector rankings was in getting from 3 to 4 players. The method used for 4 players 
can be generalized in a straightforward manner to any number of players, but not 
without increasing the number of summations, the complexity of the analysis and 
the space needed to write down k! numerical solutions. Thus for 5 players in the 
analogue of (3.12) and (3.13) we would have 3 summations on the product of C“*» 
and CD“) i.e., we first use (2.4) with ; = 3, b = 4 with player 1 having attained 
his quota r,, player 2 just reaching r, and players 3, 4 and 5 temporarily at h, i 
and j wins respectively. In the ensuing games when player 4 reaches r, — i wins, 
player 3 must already have reached r, — h wins and player 5 must not yet have 
reached r; — i wins; this is easily expressed by a CD“) function. Hence we only 
need 1 extra summation to write the analogue of (3.13) for 5 players, namely for 
the vector ranking 7° = (1, 2,3, 4,5) of 5 players with quotas r,, ry, r3, r, and rz 
and cell probabilities p, (i = 1,2,3,4,5). Letting A=p,+p,+p,+p., with 
p, + A <1, we have 

r3—l1rg-lrs—1 


70) — 4,3 a 
p(v ) i » » » COSY) pa, Ps/ Pos P4/ Pas Ps/ Dah Ti) Ms by 572) 
h=0 i=0 j=0 


XC A(T rythti + j)CD Eo) ps. ps/pyl"3 —h,rs; —j3;r4— i), 
(3.16) 


r3—1 rg—1 rs—1 


=L LD 


h=0 i=0 j=0 


rro-1+ht+ity 
ro —1,h,i,J 


which was obtained by a method analogous to that used for @.12). 


4. EXPECTATION (WAITING TIME) RESULTS FOR THE SHARING PROB- 
LEM. We refer to the time point when the contest is stopped and player 7 still 
needs r, games to win the tournament (i = 1,2,...,k) as the ‘time of abortion’. A 
related interesting problem (that appears not to have been considered) is to find 
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the expectation and/or distribution of the expected additional number of games 
(after abortion) needed to complete the tournament. We also consider continuing 
beyond that until a complete ranking or some other preassigned partial ranking is 
established. For the complete ranking we consider 4 different disciplines called 
Do, D,, D, and D,. Under discipline D, early winners stop playing as soon as 
they reach their quota (or r-value). If player i drops out we divide all the other cell 
probabilities by 1 — p,, Gi = 1,2,...,k), so that the sum is still one and the ratios 
are unchanged. Under discipline D, we stop the entire contest after the player 
with the rank k — 1 reaches his quota, so that the last one is determined. Under 
discipline D, (resp., Dj) we include both (resp., neither) of these two features. 
Our principal interest for more than one player reaching their quotas is in 
discipline Dj where we continue with all players until a desired goal is reached. 
Again we do not assume equal p-values. As in [5] the symbol !”! will denote the 
y'" ascending factorial moment of the additional waiting time needed under 
the discipline D, (y = 0,1,...;i = 0,1,2,3) and x!'”! will denote the product 
x(x + 1)...(x%+ y — 1). If we wait only for the first winner of the tournament 
then we omit the subscript indicating the discipline. We first carry out the solution 
for k =3 players and then write the result for k(=b + 1) players since the 
argument is essentially the same. Our development is based partly on the expres- 
sion in (2.5) of the D-function as a (multiple) multinomial sum. 

For k = 3 players the probability that player j has <r, (say, a,;) won games 
when player i wins the tournament with r,; games won (i = 1, 2,3) is obtained by 
summing (over 0 < a, < r,,j # i) the expression for i = 1 


pi'ps2p3° + 2 similar terms for 7 = 2 and 3. (4.1) 


r,—1,a,,a, 


Multiplying these 3 terms by (7, + a, + a3)", (r, + a, + a3)!” and (7, + a, + 
a3)'%] respectively and summing over a’s gives us the desired yth ascending 
factorial moment until the first (cf. argument 1) tournament winner as 


y] rly] r+y—-lt+a,ta;, rity ne 
UL (1) = => r+y—-la,ta Di! ‘ps?p3? + 2 similar terms 
1 <r 1 ree? 3 
oer (4.2) 
= —D? P3(r2,1337, + y) + 2 similar terms. 
py P\ P\ 


Using the notation a, and r; of Section 3 above (see (3.1)) the general result for 
k = b+ 1 players ean be written succinctly as 

k T(r,+ y) 

[v]l(q a 

7 ( )= py P(r i) D; ? 

this should be compared with equation (5.8) of [5]. The same formula (4.3) holds 

even if there is a dealer connected with the contest who has a p-value for each 

game but no quota, provided he is not included in the value of k which represents 

the number of active competitors. In other words the same result holds if 

Pp, +p,+...+p, <1 and we have a quota-free cell present. The use of b for 

k — 1 in (4.3) is to avoid any confusion since b is the parameter used in the tables 
in [4] and [5] and is also used in the definitions in Section 2 above. 

In a similar manner we can obtain for any given s(1 < s < k) the additional 

expected number of games until the player with rank s meets his quota, i.e., until 


DY (F; Fist, + y)s (4.3) 
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the first time that any s of the k players meet their quota. For fixed i and s we 
define a;. as a permutation of a; where the parameter 6 specifies combinations 


of s — 1 of the components of a; so that B runs from 1 to (‘ 7 1), This result needs 


a double summation, .i specifies the player of rank s(1 <i<k) and 8B selects 
s—1 out of k — 1 for the first s — 1 tournament winners. The result under 
discipline Dp is 


k-1 
kK T(r; + y) (S-1) 
[y] _ l (s-l,k-s)\(> 
Uy (Ss) = —_ CD; r..g31, + y), 4.4 
0 ( ) py T'(r;) p? » i; B ( ;B ) ( ) 


where 7; g selects the same subset from 7; that a; , does from a;. Note that the 
sum of the superscripts is again k — 1 = b. 

For s = 1 the result (4.4) reduces to the previous result (4.3) and for s = b we 
get the same as (4.3) with D replaced by C. This is the expectation under 
discipline D, if we wait for every player to reach his quota. 


Ilustrations. For the example of Huygens with k = 3, common p = 1/3 and 
r = {2,3, 4} we obtain from (4.3) for the first winner (s = 1) 
w(1) = 6D2(3,4;3) + 9D2(2,4;4) + 12D@(2,3,5) ~ 4.3909465, (4.5) 
wl(1) = 54D@(3,4;4) + 108D®(2, 4;5) + 180D®@(2,3;6) ~ 25.7695168, 
(4.6) 
o7(1) = w?I(1) — w(1)(1 + #(1)) = 2.0981590. (4.7) 


In another example with k = 3 players having common p = 1/3 and r= (2,1, 1), 
we wait for 2 players to reach their quota, 1.e., we take s = 2 and use discipline 
Dy. Then from (4.4) for y = 1 because of symmetry the 6 values reduce to 3 and 
we obtain 


Myo(2) = 12CD (1,133) + 6CDYY(1, 2;2) + 6CD® (2, 1, 2) 
= 12| D(1;3) — D®(1; 3)| + 6D (2;2) + 6D (1; 2) — 12D (1, 2; 2) 
= 12(¢ - 4) +3 + $- 12/4) = 2%. (4.8) 

In the same illustration for s = 1 we obtain from (4.3) for y = 1 
u(1) = 6DY(1,1;3) + 6DM(1,2;2) = 2/9 + 10/9 = 4/3. (4.9) 

The variance for this example from (4.3) is 

o? = wP(1) — w(1)(1 + w(1)) = 54DP(1, 154) + 36D2(1,2;3) — (4)(4) = 3. 
(4.10) 


The results in (4.8), (4.9) and (4.10) can also be checked by a complete enumera- 
tion, which we omit. 

In the next section we consider the effect of having early winners withdraw from 
the contest when s > 2, 1.e., when the discipline is D, rather than Dy and we wait 
for at least 2 players to reach their quota. 


5. EARLY WINNERS WITHDRAW FROM THE TOURNAMENT (DISCIPLINE 
D,). It should be noted that the probabilities of vector ranking are not affected by 
early withdrawal of winners. Actually we carry out the expected waiting time 
calculation only for s = 2 and y = 1 but k remains arbitrary; for the case k = 3 
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this already gives us the expected minimum number of games needed to determine 
the ranking of all 3 players. 

Let 7; ; and a, ; denote vectors as before but with both the i and j components 
removed ‘and using the common denominator p, in the a; , vector. Then the 
required expectation yw (2) under discipline D, for s = 2 and y = 1 is given by 


w(2) = L(+ r)CDY Or, F;, it) 


L#J 


r. 
+>) — Ye DaCDYe (7, F255; + 1), (5.1) 
i#] © J a#i,J 

where 7*,=7; , with r, replaced by r, — 1 for some a #i, j associated with a 
D-type integral and each outside summation has k(k — 1) terms with i denoting 
the first winner and j the second winner, i.e., the pairs (i, 7) are ordered. Here a 
sums over all k — 2 indices associated with D-type integrals and if in any term 
= 1 then r, — 1 = 0 and that term vanishes. In particular for common r = 1 
and common p = 1/3 from (5.1) we get w,(2) = 6(2)(1/6) + 6(0) = 2, which is 
clearly the correct answer. Although (5.1) contains our desired result for s = 2, 
y = 1 and arbitrary k, the extension of (5.1) to any s and any y has yet to be done. 


Proof of (5.1): Assume that r, corresponds to the first tournament winner and r, 
to the second; all remaining terms can be obtained by simply permuting the 
arguments. The basic idea is already present for k = 3 and we restrict our proof to 
this case. Consider the expression 7'S given by 


r, -lta,ta, a,-1+B,; 
TS= | 4 Pipspy dy 0". —~gq,-1 
Qy<Py ry 9 A>, a3 B3<r3- a3 a» » Bs 
a34<13 
p r2—@2 p B3 
ae As (5.2) 
P2 + D3 P2 + P3 


which gives us the probability of the event if we take Q = 1 and gives us the 
desired expectation F if we take Q =r, +r, +a; + B,; here a, and B; refer to 
the wins of the last tournament winner (or 3% contestant) before the first tourna- 
ment win and between the first and second tournament wins, respectively. For 
QO =(r, +1r,) + a, + B, we separate E into 3 parts (7,, T, and T;) where T, 
corresponds to r, + r,, T, to a, and 7, to B,. Then the first two 7’s are easily 
seen to be (using the linearity of the expectation operator) 


- 


rT, = (ry + ry) CDSS) p>, p3/py P1513; ry)5 


r,;p 
T, = CDE) (r, + 1,r3 — 137). (5.3) 


' (P1/P2>P3/P2) 


The third term T, can be written (letting y, = B; — 1) as 


T,=— Yd (ry — ay) pi ps2p$ 
° P2 =| ry a 1,a 2, ? ? a 
a3<13 
ry — Q, + 3 D2 mth" e2/ Ds ” 
x > a ——} (54) 
¥3<r3—1-a, Fy —~ A0, 3 Po + D3 D2 + D3 
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and we can now replace the limits of summation (for the two outer sums) by 
a, <r, +1 anda, <r, — 1 since the terms added and the terms deleted by this 
are both zero. Separating 7, into two terms corresponding to r, and —a,, we 
obtain 


ro P3 


Ty, = Dp ———= CD Ep,) p», ps/ py) riots — lyr + 1); 
2 
’ D3 
Tz, = - > CDE) p>. px/py'1 + 1,73 — 1512). (5.5) 
2 
Since 7, = —T, the final answer is T, + 7 ,. Hence our result for w,(2) for k = 3 


and y = 1 is the sum of 12 CD-functions and this result is already given in (5.1) 
except that the result contains an arbitrary k instead of k = 3. 

For the 3 player example with r, = 2,r, =r, = 1 with a general p-vector with 
P, + Po + p; = 1 the value of w,(2) is 


P2P3(2 — Po — D3) 
(1 — p2)(1 — ps) 


which reduces to 8/3 for p, = p, = p3 = 1/3. Since the corresponding value was 
seen to be 10/3 for discipline D, in (4.8), we see that the savings by using 
discipline D, is 2/3 of a game on the average. 

It should be possible to generalize formula (5.1) so that it holds for any s 
(2 <s <k), but this has not been done. One minor difficulty is that considerably 
more notation is required. In both sums the superscripts would be (s — 1,k —s) 


(2) = 3 — (5.6) 


on the CD-function. The first sum would have k(*- 7 terms since we sum first on 


j from 1 to &k and then on combinations of s — 1 components out of k — 1. The 
second (or inner) summation has in addition to this, a sum over the remaining 


k — s components of each @ (or 7’) vector, i.e., it would have k(*- \(K — s) terms. 


Hence altogether there would be k(k=1\k —s +1) terms, each with a CD- 


integral. With the help of 7S in (5.2), the derivation would be straightforward but 
not easy to read. 


6. A USEFUL IDENTITY FOR NEGATIVE MULTINOMIAL SAMPLING. In 
(5.2) we used a triple sum (7S) as a starting point (with Q = 1) instead of the 
double sum (DS) given by 


ps='E 


Q{2r, a3 <3 


-lt+a,t+a, . 
D{'DyD3°. (6.1) 


ry — 1, a,,@3 


In this section we show the identity between DS and TS without assuming the 
discipline D,, by first showing that DS is equal to a quadruple sum (QS) and then 
summing out one of the indices in QS to obtain the desired TS. To prove this 
identity we note that a, =r, indicates that the first player has already reached his 
quota r, when the second player reaches his quota r,. Hence we have a common 
point to break up every sequence into 2 disjoint parts, where the first part goes 
until the first player reaches r, and in that part a, <r, and a, <r;. In the 
second part player 2 gets exactly r, — a, wins (including the last win), player 3 gets 
B,; <r, —a, and player 1 gets B, => 0 wins. Hence we have the result that 
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DS = QS, 1.e., that 


rin a 
P1'P27P3? 


ry —~Q@,—-1+ 8, + Bs, 
x > pPips2— 2 p83, 6.2 
B,n0 | ry — A, — 1,B,,B3 1 2 3 ( ) 
B3<r3—@3 
Using the negative binomial identity (1 — p)~™ = | m wv '\p” with m = 


r, —-a@,+ B;, p =p, and y = B, we sum in (6.2) and easily obtain in (6.2) the 
desired TS in (5.2). This establishes the identity for those sequences where player 
1 reaches r, first and player 2 reaches r, second; a corresponding result holds for 
each of the 3(2) = 6 permutations. This shows that the value of B, does not affect 
any computations for Q = 1 in (5.2). The extension to k players is straightforward. 


7. THE EXPECTED TOTAL NUMBER OF WINS BY A SPECIFIED PLAYER. If 
we wait only for the first tournament winner then the expected number of wins by 
player i, denoted by E,W), is 


E,(W) = r,D9(73 r;) + v; > >, = (70; r; + 1) (7.1) 
isi 

where the first term corresponds to player i winning and otherwise (in the 
summation) player j (j #7) is the winner. As before FO indicates that r, is 
replaced by r; — 1 and if 7; = 1 in any term then that term is zero. As before 
k=b+1. Thus if we have a common r = 1 then the first D in (7.1) is the answer 
and if in addition there is a common p then the answer 1/k = (b + 1)~' is clearly 
correct. The derivation of (7.1) is straightforward and we omit it. 

As an example we calculate EW) for i = 1,2,3 for Huygens problem with 
r = (2,3, 4) and common p = 1/3. For the first term in (7.1), we can also use (3.5), 
(3.6) and (3.7). For i = 1,2,3 we obtain 


E,(W) = 2(.61865569) + .15500686 + .07133059 = 1.4636488, (7.2) 
E,(W) = 3(.26748971) + .53497943 + .12620028 = 1.4636488, (7.3) 
E,(W) = 4(.11385460) + .66666667 + .34156379 = 1.4636488, (7.4) 


all the same and the sum of these three is indeed the same as in (4.5) namely 
4.390947. Since the expected number of wins is w/3 for each player where 
uw = w(1) is given by (4.5), they must all be equal. More generally for the ith player 
the expected number of wins is wp;, regardless of the r-values (i = 1, 2, 3). 

The second-moment formula analogous to (7.1) is easily shown to be 


E{W(W — 1)} =7,(7; — I DY?(73 7;) 
r(r; + 1) 


no" 


i 


5 DP (7; 1; + 2), (7.5) 


; 


where FO 1S i; with the component 7, replaced by r, — 2. Then if r; = 1 or 2 in 
any term, that term vanishes. We use (7. 5) to get G; for i = 1,2 and 3 in Huygen’s 
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example above, where r, = 2, r, = 3 and r, = 4. The three answers are 
E{W(W — 1)} = 1.2373114, E,{W(W — 1)} = 1.9972565, 
E,{W(W — 1)} = 2.4279836. (7.6) 
From these we obtain the three variances o,7(i = 1, 2,3) as 
of = 5586923, oF ~ 1.3186374, o; = 1.7493645. (7.7) 


These are increasing with r,i = 1, 2,3) as one might suspect. 
In the k = 3 example with r, =r, =r, = 1,s = 1 and common p, the second 
moments are all zero and hence the three variances are 


o? = EW, - [EW = 3-5 = 3(i = 1,2,3). (7.8) 


8. THE EXPECTED WAITING TIME: SPECIAL CASE OF COMMON ARGU- 
MENTS. The expected waiting time for k players with a common p-value of 1/k 
and with equal or unequal r-values has not been heretofore studied or tabulated, 
although for equal r-values it is given simply in our notation by a single Dirichlet 
integral, namely by 


E(WT) = k*rD(r;r + 1), (8.1) 


' where b = k — 1 as before and the scalar arguments (1 and r) are used when the 
b components of the vectors @ and Fr, respectively, are all equal. The numerical 
values (cf. Table below) are remarkably smooth and monotonic both for increasing 
r and increasing k. The increase with respect to r is obvious (the first differences 
are actually increasing with r), but the same property also holds with respect to k 
and this is less obvious (the first differences are actually decreasing with k). A 
short table for k = 2, 3, 4,5 and 10 is given below for r = 1(1)10(5)100. 

Although the simple upper bound kr for (8.1) is easy to obtain, it appears to be 
challenging to get a good asymptotic approximation for r and k both large. If we 
use kr as a rough approximation for that purpose, the error decreases with r but 
increases with k. Thus for r = 100 the error in using kr = 200 for k = 2 is 6% but 
the error in using kr = 1000 for k = 10 is 15%. We consider a better approxima- 
tion below for E{WT} based on statistical considerations. 

Using the result in (2.345) of [5] with j = 0,a = 1,m =r + 1, namely 


DP(r,r + 1) = f G(x) dG,,(x), (8.2) 
0 
where G,(x) is the incomplete gamma function given by 
1 x 
G = —— | t’ le‘ dt, 8.3 
(x)= Foy Ite (8.3) 


it can easily be shown that (8.1) above is kKE(k,r) where E(k, r) is the expected 
minimum of kA random variables which are iid, each of which has the gamma 
distribution with parameter r. Hence the expected minimum of these can be 
approximated by the 1/(k + 1) quantile of the common distribution G.(x), i.e., by 
the solution in y of 
r—1 y' 1 

— _ —y _—_ — 
GOya=toe™ b= 


(8.4) 
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TABLE 8.1. Expected Waiting Time (common r, p = 1/k) 


1.000000 
2.500000 
4.125000 
5.812500 
7.539063 
9.292969 
11.067383 
12.857910 
14.661530 
16.476059 
25.666067 
34.985173 
44.386241 
53.845310 
63.348217 
72.885770 
82.451599 
92.041076 
101.650713 
111.277802 
120.920194 
130.576146 
140.244223 
149.923227 
159.612146 
169.310113 
179.016382 
188.730304 


1.000000 
2.888889 
5.049383 
7.348270 
9.734204 
12.181254 
14.674221 
17.203304 
19.761754 
22.344688 
35.522923 
48.991265 
62.643880 
76.426923 
90.308563 
104.268132 
118.291305 
132.367675 
146.489395 
160.650378 
174.845785 
189.071699 
203.324892 
217.602669 
231.902751 
246.223190 
260.562308 
274.918644 


1.000000 
3.218750 
5.864136 
8.730541 
11.736762 
14.841424 
18.020165 
21.257233 
24.541762 
27.865883 
44.912981 
62.429827 
80.245210 
98.271953 
116.458502 
134.771370 
153.187347 
171.689565 
190.265309 
208.904709 
227.599924 
246.344603 
265.133513 
283.962284 
302.827223 
321.725172 
340.653409 
359.609565 


1.000000 
3.510400 
6.606313 
10.010009 
13.609403 
17.346933 
21.188563 
25.112238 
29.102717 
33.148950 
53.981108 
75.475620 
97.391287 
119.605204 
142.044658 
164.662406 
187.425703 
210.310745 
233.299584 
256.378292 
279.535801 
302.763146 
326.052940 
349.399015 
372.796156 
396.239906 
419.726422 
443.252360 


1.000000 
4.660216 
9.711140 
15.533842 
21.856845 
28.536383 
35.486123 
42.649598 
49.987546 
57.471384 
96.467168 
137.204701 
179.051910 
221.685783 
264.914914 
308.614824 
352.699167 
397.105149 
441.785404 
486.703164 
531.829205 
577.139847 
622.615582 
668.240112 
713.999660 
759.882447 
805.878316 
851.978433 


which we denote by G; (1/k + 1). Then our first approximation is 


kG, [, sil> E{WT}. (8.5) 
For r = 100, k = 2 this gives 190.859 with error of 1.1% and for r = 100, k = 10 it 
gives 869.284 with error of less than 1%; clearly kr gives the upper bounds 200 for 
k = 2 and 1000 for k = 10. Inversion of the series in (8.4) would give essentially 
the same result in the form of an explicit expression, but the result is not simple 
and the convergence is slow; hence we omit further development along this line. 
We illustrate the applications of the table 8.1 with the following two problems: 


Problem 1. Suppose k = 3 players enter into a multinomial tournament consisting 
of 3-player games, where the first one to win r = 10 games wins the tournament. 
Each player has probability 1/10 of winning a game and the games are indepen- 
dent of each other; the remaining 7/10 probability corresponds to draws, Le., 
games with no winner. What is the expected number of games needed to obtain a 
first tournament winner? 

We use the fact that 3 players have equal probability and condition on the 
non-drawn games in which the common conditional p = 1/3. Then by using the 
table for k = 3 and r = 10 and dividing by 1 — p,, where p, = .7 is the probability 
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of a draw in each game, we obtain 


1 
E(WT} = 7 (2.344688) = 74.482293, (8.6) 


the large answer being due to the high frequency of draws. a 


Problem 2. If k = 5 players are assigned to the faces 1, 2, 3, 4 and 5 of a single 
fair die and the side 6 corresponds to a draw, what is the expected number of 
tosses needed to obtain a first tournament winner if 100 wins are needed to win 
the tournament. 

Using the same method as in problem 1, we take 443.252360 from the table for 
r = 100 and k = 5 and obtain 


1 
E{WT} = 576 (443.252360) = 532.903 (8.7) 


as the expected number of tosses required. a 

If we make use of the fact that G,CX) as a transformation yields the uniform 
U(0, 1) distribution, we can obtain a correction term to the approximation in (8.5). 
Using a standard asymptotic method based on the Taylor series expansion for 
G, '(x), we obtain 


- naif i \) ot k g(y) 
E{\G, (X)} ~ G, k+1 7) (k 4 1)°(k 4 2) [2.(y)]° ? (8.8) 
where 
r—le-y d 
8(y) = Tr) and g,(y) = ay 8) (8.9) 


and y is the solution of (8.4). 
To illustrate the numerical results for this asymptotic analysis we give the results 
for r = 100 (the last row of table 8.1) and k = 2, 3, 4, 5 and 10. The respective 


approximations are 
189.4, 276.0, 361.1, 444.9 and 854.5. (8.10) 


The maximum percentage error for these five approximations is 0.4%. 


9. CONCLUDING REMARKS. The authors have already noted a massive analogy 
between multinomial problems (sampling from an infinite population or with 
replacement) and hypergeometric problems (sampling from a finite population 
without replacement). All' aspects of the sharing problem above can be extended to 
analogous problems with a finite population when sampling without replacement. 
For purposes of brevity none of these problems are included in this paper. 

There are interesting analogies to be pointed out even when both problems are 
multinomial. Thus the sharing problem has many analogies with the following 
so-called “Banach match box” problem [5, p. 69]: A smoker starts with r; matches 
in the ith box and selects a box at random (i.e., with equal probabilities for each 
remaining box) every time he (or she) lights up. There are different disciplines 
depending on what we do with the emptied match boxes. The discard of the empty 
match box corresponds exactly to the withdrawal of the early winners from the 
tournament. Moreover each box emptied corresponds to some player reaching his 
or her quota and hence if we match up the parameters properly the problems 
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become identical. However some of the questions asked may be quite different. 
Thus there is little interest in the vector-ranking of the match boxes. On the other 
hand, some of our formulas above can be directly applied to the Banach match box 
problem and the- development of variance formulas for the waiting time seems 
more important there than in the sharing problem. 

Finally the sharing problem is analogous to the following version of the birthday 
(or birthmonth) problem: What is the expected number of people that have to be 
polled to find r people with the same birthday (or birthmonth or born under the 
same zodiac sign)? In this case the “contestants” are the days of the year (or the 
month) and the winner of the tournament is the day (or month) which is the first 
one to come up with r births in it. 
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...an understanding in depth of the 
mathematics of any given period is hardly 
ever to be achieved without knowledge 
extending far beyond its ostensible subject 
matter. More often than not, what makes. 
it interesting is precisely the early occur- 
rence of concepts and methods destined to 


emerge only later into the conscious mind 
of mathematicians; the historian’s task is 
to disengage them and trace their influ- 
ence or lack of influence on subsequent 
developments. 


—A. Weil 
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What is Teaching?* 


Paul R. Halmos 


Do you remember the first time you ever taught? The first day I taught was 
September 18, 1935—which was slightly over 58 years ago, or, if you want to be 
very precise, exactly 21,303 days ago today. Does that, I wonder, make me the 
person with the longest teaching experience in this room? Being aware of the 
length of my servitude, the authorities in charge of our meeting today deduced that 
I know, or, in any event, I ought to know what teaching is, and they instructed me 
to tell you. 

The course I taught in 1935 was called freshman algebra; its purpose was to 
reveal the secrets of quadratic equations (for which there was a formula) and 
parentheses (which were abominable entities and had to be eliminated at the drop 
of a hat). The course met at 8:00 in the morning, five days a week—yes, five days, 
‘ Monday through Friday, inclusive; my pay was $45.00 a month. Incidentally, I was 
living at the time in an old-fashioned, comfortable, large, 5-room apartment, within 
five minutes walk of the campus; the rent was $45.00 a month. 

J had no fear of teaching. Stage fright, yes; fear, no. Stage fright, in the sense of 
being keyed up and slightly nervous is something that has always been with me six 
minutes for each new class: five minutes before it first meets and one minute after 
it starts. The same is true of colloquium talks and all other kinds of public 
appearances. 

Even though there was still a lot I had to learn about teaching, I thought I could 
cope with it. I have always been surprised by beginners who say they don’t know 
how to teach yet. Haven’t they already spent almost twenty years under the 
influence of teachers, and haven’t they noticed that some techniques seem to work 
well and others are just annoying, and haven’t they ever muttered to themselves “I 
could explain that a lot better’? I had had some bad teachers along with the good 
ones, and I thought I knew what not to do—I marched to my first class confidently 
with my head held high and with the stage fright barely noticeable under my 
eagerness to get on with it. 

The ages of my students were around 17 and 18; I was a wise old graduate 
student aged 19. They believed what I told them. Some of them were good and 
some of them were hopeless. The only one whose name IJ’ll never forget (it was not 
Drossin, but let’s call him that) was one of the hopeless ones. Attendance spotty, 
homework missing or weak, midterm exams around D minus, the final exam might 
have helped him pass the course, but I didn’t have much faith and neither did he. 
The Saturday evening before the final a small party at my house was interrupted by 
the doorbell—it was Drossin. Could he speak to me a minute, privately? Some- 
what suprised, I took him to an unused room, and asked him what’s up. I was a 


*Talk presented at the annual meeting of The Mathematical Association of America, Cincinnati, 
Ohio, 14 January 1994. 
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graduate student, wasn’t I?, and perhaps I wasn’t too well off, was I?, and this 
course was important for him, so he’d appreciate it if I would help him pass it, and 
he’d make it worth my while—he’d give me five dollars. Half a week’s rent, a 
week’s food! I was too surprised to be angry, but I told him to go away. I returned 
to the party and told my friends that I had just learned how much I was worth. 
Next Monday, Drossin’s paper was the first I graded. He flunked, with no margin 
for error. 


Paul Halmos engaged in one of his favorite activities. 


Well, what about telling you what teaching is? The more I tried to think what to 
say, the more I was being led to the conclusion that nobody cain’t never teach 
nobody nuttin’ nohow. I don’t really mean that, but I mean it a lot more than its 
opposite, and I thought I’d get your attention more quickly by making a crisp 
statement. [ll spend some of the rest of my time telling you which part of that 
provocative crispness I really mean. 

There are three types of knowledge that we commonly speak of as subjects for 
teaching or learning; they can be most effectively identified as what, how, and why. 

To be educated means to remember something, to be able to use it, and to 
understand it. Frequently these three kinds of education are thought of as 
belonging to altogether different kinds of human activity, but ideally they are all 
present every time. Our memory knows that Napoleon was defeated at Waterloo, 
our muscles know that certain stretchings and bendings will cause our feet to 
alternate suitably so as to take us from the office to lunch, and our mind knows 
why three times five is the same as five times three. 

Many students confuse education with memorization. They tend to think that if 
we know the boiling point of beer, the gestation period of elephants, the conjuga- 
tion of French irregular verbs, and the population of Burma, together with many 
other such goodies about the moon, whales, protons, synapses, schizophrenia, and 
interest rates, then we are educated. A walking encyclopedia is, however, rarely an 
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educated person. To a historian, history is not just a collection of facts but an 
organized understanding of how we got to be what we are; Waterloo is not just a 
fact, but, possibly, a tool to be used to avoid catastrophes in the future. To a 
chemist, chemistry is not just purple liquids in test tubes, but a scheme for 
prediction and a way of understanding the world—and the same sort of thing is 
true of the physicist, the astronomer, the psychologist, and the economist. 

Sometimes education is oriented toward application, only application, and the 
result is just as wide of the mark. A linguist is not one who can speak strange 
languages, certainly not only that, and a cellist is not one who knows where to put 
the fingers of his left hand and at what angle to pull his right elbow back. 
Etymology and syntax for the one, and musicology and musicianship for the other 
play the role of the vitally necessary components of remembering and understand- 
ing. 

The third sin, that of identifying education with ratiocination is rarer, but not 
unknown: the philosophers who, allegedly, don’t want to be burdened with or 
confused by mere facts, and the pure logicians and mathematicians who not only 
don’t want to put their thoughts to work but are even inclined to deny that that can 
be done, they are the ones guilty of the sin of giving short shrift to matter and 
muscle. 

How then does one learn, and, more to the point here and now, how do we 
teach the what, the how, and the why? I have my prejudices about all of them, but 
. I can claim professional training and experience about one of them only. I’ll wave 
at the other two, quickly, in passing. 

As far as facts go, I am pretty much stumped. How can I learn Napoleon’s 
dates, the meaning of the Hungarian word “mell’’, the number of rings around 
Saturn, or the percentage of hydrogen and oxygen in tap water? I can ask an 
expert, a teacher, by way of a book or a class room, or, when it is physically 
possible to do so, I myself can look. One trouble is that I might not know enough 
to look: my teacher has to tell me not only about Napoleon’s dates, but also about 
telescopes. The teaching of “what” abuts on the teaching of “how’—how to learn, 
how to look, how to perform experiments. The way to teach “what” splits into 
(1) tell ’em the facts, and (2) tell °em how to get the facts. 

How do we teach the how? How do we teach someone to swim, to play a 
musical instrument, or to speak a foreign language? One possible answer is: 
nohow. Don’t do nuttin’: just wait. Throw the kid into the water, put him on the 
piano bench, or abandon him in France, and go away. After all, humanity has 
fumbled its way to these things without any external guidance, and, arguably, the 
best way for an individual to learn them is to rediscover them for himself. 
(Incidentally, as far- as language goes, is this idea related to Chomsky’s innate 
grammar?) . 

A somewhat different attitude to how-teaching is to regard the role of the 
teacher as that of a coach. To be sure, nobody can swim for me, nobody can play 
the piano with my fingers, and nobody can speak French for me, but somebody can 
save me an awful lot of time by showing me the right way right away. Once I have 
seen the crawl, heard the difference that fingering can make to the sound of a 
piece, or pronounced “an” while holding my nose, I have made hundreds of years 
of progress. 

Does it help the student of swimming, of piano, of French to “understand” 
what he is doing? Some argue that it does not, that it hurts. (Once you start 
thinking about how you swim, how fast you should play a passage, or whether you 
should use the subjunctive, you are lost.) Others argue that all knowledge helps: 
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the swimmer should understand the pertinent principles of physics, the pianist 
should see what the theory of harmony has to do with what he is doing, and the 
speaker should know grammar. You see where the latter view leads, don’t you? It 
says, in effect (and I am inclined to agree), that the teaching of how abuts on the 
why. 

How, finally, do we teach why? How do we teach logic and mathematics, how do 
we teach abstract concepts and the relations among them, how do we teach 
intuition, recognition, understanding? How do we teach these things so that when 
we are done our ex-student can not only pass an examination by naming the 
concepts and listing the relations, but he can also get pleasure from his insight, 
and, if he is talented and lucky, be vouchsafed the discovery of a new one? The 
only possible answer that I can see is: nohow. Don’t do nuttin’; just wait. The only 
way I know of for an individual to share in humanity’s slowly acquired understand- 
ing is to retrace the steps. Some old ideas were in error, of course, and some might 
have become irrelevant to the world of today, and therefore no longer fashionable, 
but on balance every student must repeat all the steps—ontogeny must recapitu- 
late philogeny every time. 

What then can we do to earn a living? Can a mathematician of today, for 
instance, be of any use to the budding mathematicians of tomorrow? My answer is 
yes. What we can do is to point a student in the right directions, challenge him 
with problems, and thus make it possible for him to “remember” the solutions. 
Once the solutions start being produced, we can comment on them, we can 
connect them with others, and we can encourage their generalizations. The worst 
we can do is to give polished lectures crammed full of the latest news from fat and 
expensive scholarly journals and books—that is, I am convinced, a waste of time. 

You recognize, of course, that I seem to be advocating what is sometimes called 
the Socratic or do-it-yourself or discovery method, or, especially in Texas, the 
Moore method. The method is not to tell students but to ask them, and, better yet, 
to inspire them to ask themselves—make students solve problems, and better yet, 
train students, by example, encouragement, and generous reinforcement, to con- 
struct problems of their own. Problem solving—that is the most highly touted 
current shibboleth, and that is the flag that I too want to wave. The flag should be 
kept waving; the important ideas deserve to be emphasized over and over again. 

The most effective way to teach mathematics by problem solving is to keep 
challenging students with problems that are just barely within their reach. One way 
to inculcate the historical attitude, for instance, is to ask a question that Archimedes 
didn’t have the most efficient tools to answer, and dare students to rediscover 
Archimedes’s research. The best meaning to give to the phrase “undergraduate 
research” in mathematics is to guide an undergraduate to re-do the research of 
Leibniz (or Lefschetz). : 

Everybody loves a puzzle. The ones that appear in the Sunday supplement of 
the local newspaper or that the telephone company sometimes sends along with 
their bill, get read and discussed almost as much as the comics and the sports page. 
The most popular and most widely read part of the American Mathematical 
Monthly is its problem department. Problems is the way to go. 

People much prefer stimulation to inundation. Don’t snow them; tease them. 
Puzzles, yes—preachments, no. The problem method of teaching is the best for 
students, and, once its technical difficulties are overcome, offers the greatest 
stimulation and reward to the faculty also. 

Have students sometimes asked you, when you whizzed through the derivation 
of the quadratic formula (or the quotient rule for derivatives, or the triangulariza- 
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tion of complex matrices), possibly in a tone of grudging admiration, “How do you 
remember all that stuff?” The answer, of course, is that you don’t remember it: 
you understand it. If students were guided through research that leads them to 
discover that completing the square does something useful to the equation 
2x? + 9x + 10 = 0, they’d have a chance to understand it much better than if they 
were just shown the technique and then made to practice it a hundred times. The 
problem method is, I am convinced, the way to teach everything. It teaches 
technique and understanding, it teaches research and problem solving, it teaches 
the way nature taught us (about fire and carpentry and the stars and weaving) 
before we invented teachers. 


Paul Halmos. 


The method doesn’t begin by proving Theorem 1. It begins with questions: what 
is true?, what do the examples we can look at suggest? It doesn’t say “look, here’s 
how it’s done’, it asks “how can it be done?’’. It teaches the right attitude toward 
the solution of all problems. The problems that we came here to solve can be 
solved—all problems can be solved—and the teaching of problem solving is the 
way to set about solving them. If we could teach every teacher to teach every 
course as a problem course, then one generation from now, in twenty five years, 
say, the need for talks such as this one would no longer exist. All we have to do is 
find out how to do that, and we can adjourn. 

Let me emphasize one thing I casually dropped along the way: I spoke of the 
way to begin. The way to begin all teaching is with a question. I try to remember 
that precept every time I begin to teach a course, and I try even to remember it 
every time I stand up to give a lecture—and, you may recall, I remembered it and 
acted on it today. 

Another part of the idea of the method is to concentrate attention on the 
definite, the concrete, the specific. Once a student understands, really and truly 
understands, why 3 X 5 is the same as 5 X 3, then he quickly gets the automatic 
but nevertheless exciting and obvious conviction that ‘it goes the same way”’ for all 
other numbers. We all seem to have an innate ability to generalize (shades of 
Chomsky again?). The teacher’s function is to call attention to a concrete special 
case that hides (and, we hope, ultimately reveals) the germ of the conceptual 
difficulty. 

One time I used the so-called Moore method in an honors class in linear 
algebra with about 15 students. The first day of class I handed each student a set 
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of 19 pages stapled together and I told them that they now held the course in their 
hands. Those 19 pages contained the statements of fifty theorems, and nothing 
else. There were no definitions, there was no motivation, there were no explana- 
tions—nothing but fifty theorems, stated correctly but brutally, with no expository 
niceties. That, I told them, is the course. If you can understand, state, prove, 
exemplify, and apply those fifty theorems, you know the course, you know every- 
thing that this course is intended to teach you. 

I will not, I told them, lecture to you, and I will not prove the theorems for you. 
I'll tell you, bit by bit as we go along, what the words mean, and I might from time 
to time indicate what this subject has to do with other parts of mathematics, but 
most of the classroom work will have to be done by you. I am challenging you to 
discover the proofs for yourselves, I am putting you on your honor not to look 
them up in a book or get outside help in any other way, and then I’ll ask you to 
present in class the proofs you have discovered. The rest of you, the ones who are 
not doing the presenting, are supposed to stay on your toes mercilessly—make 
sure that the speaker gives a correct and complete proof, and demand from the 
speaker whatever else is appropriate for understanding (such as examples and 
counterexamples.) 

They stared at me, bewildered and upset—perhaps even hostile. They had 
never heard of such a thing. They came here to learn something and now they 
didn’t believe they would. They suspected that I was trying to get away with 
something, that I was trying to get out of the work I was paid to do. I told them 
about R. L. Moore, and they liked that, that was interesting. Then I gave them the 
basic definitions they needed to understand the statements of the first two or three 
theorems, and said “‘class dismissed”’. 

It worked. At the second meeting of class I said, ““O.K., Mr. Jones, let’s see you 
prove Theorem 1”, and I had to push and drag them along before they got off the 
ground. After a couple of weeks they were flying. They liked it, they learned from 
it, and they entered into the spirit of research—competition, discouragement, 
glory, and all. 

If you are a teacher and a possible convert to the Moore method, don’t make 
the mistake that my students made: don’t think that you, the teacher, will do less 
work that way. It takes me a couple of months of hard work to prepare for a 
Moore course, to prepare the fifty theorems, or whatever takes their place. I have 
to chop the material into bite-sized pieces, I have to arrange it so that it becomes 
accessible, and I must visualize the course as a whole—what can I hope that they 
will have learned when it’s over? As the course goes along, I must keep preparing 
for each meeting: to stay on top of what goes on in class, I myself must be able to 
prove everything. In class I must stay on my toes every second. I must not only be 
the moderator of what ‘can easily turn into an unruly debate, but I must under- 
stand what is being presented, and when something fishy goes on I must interrupt 
with a firm but gentle “Would you explain that please? —I don’t understand.” 

Let me conclude by calling attention to a curious aspect of what I am recom- 
mending, an aspect visible in my urging attention to the concrete special case in 
order to understand the sweeping broad generalization. In effect I am saying that 
we do not, we cannot understand a vacuum—what we understand is always, in a 
sense, a fact—and, therefore, just as what cannot be taught without how, and how 
cannot be taught without why, the question has come full circle around to its start, 
and it turns out that why cannot be taught without what. 

Facts, methods, and insights—all are essential to all of us, all enter all our 
subjects, and our principal job as teachers is to sort out the what, the how, and the 
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why, point the student in the right direction, and then, especially when it comes to 
the why, stay out of his way so that he may proceed full steam ahead. 

Having expressed my strong views about why we should teach problem courses, 
I call your attention to my major omission: I haven’t said a single word about how 
to do that. How, exactly, does one go about teaching a problem course in freshman 
calculus, or, for that matter, in freshman rhetoric, or junior history, or graduate 
astronomy? I don’t pretend to know all the answers, but I have been working at 
finding them for many years. If you extended my lecture time by another hour or 
so, or, to be more realistic, by another month or so, I could try to tell you about 
some of the techniques that I have luckily blundered into. My presentation today 
was intended to touch briefly on the “why” of such teaching, not the “how”. That 
will have to wait for our next meeting—yjust tell me when and where it is, and I'll 
start packing my bag right away. 


Department of Mathematics 
Santa Clara University 

Santa Clara, CA 95053-2999 
Phalmos@SCUACC.SCU.EDU 


An Exchange of Letters 


Dear Ms. vos Savant: 


This refers to your piece on the Fermat problem. 

Mathematicians do not reject a hyperbolic method of squaring the circle, but accept 
it. Although squaring the circle is not possible in euclidean geometry, it is possible in 
hyperbolic geometry. Both these statements are true: Truth in mathematics is relative 
to the axiom system. An (imperfect) analogy: although the French say ‘“‘la lune” where 
we say “the moon’’, both terms are correct. Correctness in diction is relative to the 
language. 

Other weak spots in your piece are the suggestions that Wiles’s proof is 
“hyperbolic” and that elliptic geometry might somehow be “‘in error.” Given that the 
appreciation of mathematics by our citizenry is already fragile and tottering, there is 
no need to advertise your own misconceptions. Is it truly amazing that a conclusion 
about ordinary numbers...is derived from arguments in geometry? A hint as to how 
this can possibly be would help readers see that mathematics is a vibrant subject with 
an elegant and beautiful internal structure. 

Sincerely yours, 
Leonard Gillman 


Dear Dr. Gillman: - 

I’ve read your letter:and quite a few others that essentially agree with your 
conclusion, albeit through very different chains of reasoning, many of them inherently 
contradictory. I’m sure my mathematician friends would be surprised to read them. 
They are an education in themselves, and IJ feel fortunate to be a witness to what has 
unexpectedly become a fascinating look into the private world of the modern 
mathematical minds. 

I appreciate them all, I learn from them all, and I thank you all for taking the time 
to write. 

Sincerely, 
Marilyn vos Savant 
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What Is Wrong With the Definition 
of dy /dx? 


Hugh Thurston 


We shall use the notations dy/dx and f’(x) freely and interchangeably. [1] 
The fact that dy /dx and f’(x) are not interchangeable is evident when you consider that one 
does not write dy /d3 for f’(3). [2] 


For a start, the definition is incomplete. It is always a good idea to know what we 
are talking about, but definitions of dy/dx do not say what the x and y are; in 
contrast, definitions of f’(x) make it clear that f is a function and x a number. 

Secondly, the definition is ambiguous. Most texts describe dy /dx as another 
notation for f’(x) where y = f(x). For this to be valid they should prove that if 
f(x) = g(x) then f’(x) = g’(x), but they don’t. Indeed, it is hard to prove 
anything about dy /dx without knowing what x and y are. All we can say for 
certain is that they are not numbers: f’(3) cannot be denoted by dy /d3. 

Ambiguity breeds paradox. Whatever x may be, it is something that has 
values: in the familiar formula dy /dx|,-., c is a value of x. Moreover, y can be 
constant; we all know that if y is constant then dy /dx is zero. The formula 

dx \ dy 

dy dx 
shows that x and y are the same kind of entity, so in principle x can be constant. 
Suppose that x is constant with value 1. Then x* =x°, dx*/dx = dx?/dx, and 
2x = 3x”. But 2x has value 2 and 3x? has value 3. The obvious objection is that it 
is nonsense to differentiate with respect to a constant—we cannot have a rate of 
change with respect to something that is not changing. This objection may be 
obvious, but it is not valid; if x cannot be constant in dy/dx there should be 
something in the definition that implies this. 

So what are x and y? We can take a hint from the fact that they have values. 
There is a familiar entity that has values—the function. Or we can make the 
reasonable suggestion that the x and y in dy/dx are the same as in dx and dy. In 
the modern (Fréchet) theory of differentials, x and y are functions. We don’t 
apply Fréchet theory to elementary calculus, but if we did the x and y would be 
the familiar type of function whose values and arguments are real numbers. From 
now on by “function” I shall mean this type of function. 

Now what is dy/dx? First, look to Leibniz. His dx was an infinitesimal 
increment in x. An increment in x is x(c +h) —x(c) and the corresponding 
increment in y is y(c + h) — y(c). Then the value of dy /dx at c is 


y(c +h) — y(c) 


x(c +h) — x(c) (1) 
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where /: is infinitesimal. Nowadays instead of having / infinitesimal we have it 
approach zero. Then (1) becomes y’(c)/x’(c) if x and y are differentiable at c and 
x'(c) # 0. 

We find the same result if we consider tangents: the slope of a secant of the 
graph of y against x is (1), and the slope of the tangent is its limit. Rates of 
change lead to the same result: if x and y represent two quantities that vary with 
time the average rate of change of the second quantity with respect to the first 
between times c and c +h is (1) and the instantaneous rate of change is its limit. 

All this suggests the following definition. 


Definition. If x and y are functions, dy/dx is y'/x’. 


From this definition we can prove the familiar rules of differentiation, no longer 
as mere rules but as properly-stated theorems. It is convenient to use the 
not-uncommon figures of speech “f exists at c” for “f(c) exists” and “f = g at c” 
for “f(c) = g(c)”. 

For example 
Theorem (chain rule). If x, y and z are functions, 


dz dz dy 


wherever the right-hand side exists. 


Proof: If the ee side exists at c, then 


dz dy _a&, dy z'(c)y'(c) _ z'(c) _ dz . 
[= raG ) aC 4 rac c)= y(c)x'(c) — x'(c) dx | ). 


We can also say clearly and definitely: 


Theorem. Jf x is constant, dy /dx does not exist anywhere. 
Proof: x’ has the value 0. 


Theorem. Jf x is increasing on an interval I and dy /dx is negative on I then y is 
decreasing on I. 


There are analogous results if x is decreasing or dy /dx is positive or both. The 
proofs are obvious. 

Our definition legitimizes the use of parametric and implicit differentiation. For 
example, 


x(t) = 2cost, y(t) =sint 

is a parametrization of the ellipse x* + 4y? = 4. We have 
dy y(t) cos t 
a!” 7 x'(t) ~  Qsint’ 


giving the slope at (2cos ¢t, sin t). This calculation, and the corresponding implicit 
one, occur in texts, but in any text which defines dy /dx only where y is a function 
of x they are necessarily invalid. 
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Finally how does our definition relate to the traditional one? First, what is f(x) 
if x is a function? It is the function t — f(x(t)). For example, the speed of a body 
moving with positive acceleration is a function of the distance covered: if y denotes 
the speed and x the distance, then y = f(x) for some f. (Jf the acceleration a is 
constant, f(x) = y(2ax).) At time t, when the distance is x(t), the speed is 
f(x(t)), so that y(t) = f(x(t)). 

It follows that if y = f(x) then y’ = f’(x)x’ wherever the right-hand side exists, 
and so dy/dx = f'(x) wherever the right-hand side exists and x’ has a non-zero 
value. 

Adopting the definition suggested here would not alter the well-known and 
universally-accepted formulas involving the Leibnizian derivative, but it would give 
them a sound basis. 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


They frequently collaborated, but these photos were taken far apart: 
the first in 1986 and the second in 1938. 
(see page 923.) 
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the possibility of a Chinese origin for the Hindu-Arabic numeral system (Wylie, 
1897). The conjecture was echoed by early 20th century scholars (Fellows, 1921). In 
most recent times, this claim has been made most succinctly in the works of Wang 
Ling and Joseph Needham. In a 1958 paper delivered in Adelaide, Wang pre- 
sented a detailed case for a Sino origin of the “‘Hindu-Arabic” numerals and 
pointed to the strong possibility of westward transmission to India. Wang’s theory 
was further amplified in his collaborative work with Joseph Needham. Science and 
Civilization in China, vol. 3, devotes several pages (pp. 146-150) to this very issue 
and the phenomena of “‘stimulus diffusion”. Needham’s work clearly indicates the 
need for further research clarification as to the status of early Hindu mathematics 
and the possibility of cultural transmissions. It is exactly this research that must be 
undertaken to strengthen the claim for a Chinese genesis of our numeral system 
and, unfortunately, it is exactly this research that is lacking in Fleeting Footsteps. 
What was the status of ancient Indian mathematics during the Warring States 
period of Chinese history? How were the numerals used in ancient India? Could 
the Chinese have obtained their mathematical knowledge from India?—after all, 
Buddhism was an intellectual import from China’s western neighbor. These are 
some of the issues and questions that must be addressed in positing a claim of a 
Chinese origin for the “Hindu-Arabic’’ numeral system and they remain missing 
footsteps in the path this book has taken. 

Despite the inability to develop and strengthen its major premise, Fleeting 
Footsteps is a valuable resource for understanding early Chinese mathematics. Its 
sections devoted to the Sun Zi suanjing and its problems and to rod calculations 
provide historical information and insights previously unknown to a large audience. 
Although the “Fleeting Footsteps” themselves remain elusive, the book that bears 
this name is highly recommended for general reading and library acquisition. 
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Philip Harman and Aurel Wintner 
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A Stochastic Approach 
to the Gamma Function 


Louis Gordon 


1. INTRODUCTION. A measure of thé gamma function’s importance and ubiq- 
uity is that Erdelyi et al. (1981) give it pride of place in their encyclopedic 
treatment of special functions. Davis (1959) places the gamma function in a 
masterful historical context. Artin (1964) develops its elementary theory as a 
function on the real line in a beautiful exposition whose unifying theme is the 
solution of functional equations. Whittaker and Watson (1927) provide an exten- 
sive treatment of the gamma function of complex argument. 

Our purpose in this study is to provide a rigorous and unified exposition of the 
theory of the gamma function by elementary probabilistic means. Because an 
understanding of the gamma function is crucial to understanding much of statisti- 
- cal theory, such a treatment should be useful. In the process, we also illustrate a 
useful style of argument—to obtain a result about distributions, realize random 
variables in the most convenient fashion, evaluate probabilities for these variates, 
then express the results in terms of distributions, where the variates’ provenance 1s 
irrelevant. We also hope that some of the bounds we develop in discussing the 
Stirling formula may be of interest in their own right. 

We start by defining the gamma function for t > 0 as Euler’s second inte- 
gral: T(t) = fpx'~'e~* dx. That T(t) satisfies the fundamental functional equa- 
tion [(¢ + 1) = tl) follows immediately from integration by parts. That the 
normal probability density (27)~!/e7*"/2 integrates to 1 allows us to evaluate 
T(1/2) = vir. From this probabilistic foundation will rapidly follow the Legendre 
duplication formula, Stirling’s formula, Gauss’ multiplication theorem, the 
Newman product, the reflection formula and Euler’s sine product. 


2. THE GAMMA FUNCTION FOR POSITIVE ¢. We write X, for a gamma- 
distributed random variable with shape parameter ¢ > 0. Its density is 
x'~le-*/T(t) for x > 0. We assume we are given an unlimited supply of indepen- 
dent unit exponential random variables Y,, each having density e~* on the positive 


half line. Given random variables S and TJ, we write S toc for stochastic 
equality, namely that the two random variables have the same distribution. 

We formulate many results in terms of the logarithms of gamma variates. The 
following technical lemma is useful in dealing with various remainder terms. We 
then give the identity on which our entire presentation is based. 


Lemma 1. If X, is gamma-distributed with shape t > 0 then E{log(X,/t)} < 0. If 
t > 1 then E{log(X,/t)} > —1/(t — 1). In addition, 


Eflog?(X,/t)} < 2t/(t — 2)° ift > 2. (1) 
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Proof: For the first two inequalities, apply 1 —x~' < logx <x -— 1, valid 


for x > 0, to log(X,/t), and take expectations. For the last, use 
log? x < (1 —x7!)? + (x — 1)”. D 


We now present the fundamental representation in Theorem 2. The second 
identity (3) is too pretty and suggestive to pass up. In reality, when we have to 
rearrange series or compute expectations, we always go back to (2), where the real 
dirty work gets done. 


Theorem 2. The logarithm of a gamma-distributed variate X, with shape parameter t 
can be decomposed into a sum of independent unit exponential variates Y; and an 
independent remainder term. Specifically, for n = 1, 


n-1 | 4 
stoch j 
l xX = —- + ——_— — — / +] xX t+ +d 2 
og( ) Y faa j+1 j4+t og( t+n/ ( n)) n,t ( ) 
stoch “ 1 Y, 
log(X,) = —-y yy — - (3) 


where X,,,, is gamma-distributed with shape t + n, independent of the other sum- 
mands in the representation, y is the Euler constant, and the scalar remainder 
ld, 1< (+ W/fn. 


Proof: We use the elementary fact that if X, is gamma-distributed and Y is 
independently exponential then X,+ Y is gamma-distributed with shape ¢ + 1, 
independent of the ratio R = X,/(X, + Y), which has density f,(r) = tr‘~! con- 
centrated on the interval (0, 1). Directly evaluate the distribution function to show 
that R’ has a uniform distribution. 

Let Sy = 0 and S, = L/_,Y;. For any n > 1, 


n X, + Si-y stoch ” 1/(t+j-1) 
log( X,) — log I] pare i log( X, + Sn) — > log(U, } ) + log( X,4,), 
j=l X, +S; j=l 


where U,, U,,...stand for independent uniform variates on (0,1). Denote by h, 
. . . . toch 
the sum of the first n terms in the harmonic series. Observe that — log(U,)” = Y,, 


so that 


och, 


log( X,) MS log(t + n) —h, 


+ y (i +1) °-Y¥(j+ t)~"| + log(X,,,/(t + )). 


j=0 


Let d,,=y + log(t +n) —h,, with limit 0 as n > ». Note |d,,,,—d, |< 
(t + 1)/(n(n + 1)). The bound is a term in a telescoping series, proving (2). 

The remainder log X,,,,/(¢ + 1)) is negligible in mean-square because of (1). 
Assertion (3) follows from the Chebychev inequality. O 


We now use the representation to derive many of the important properties of 
the gamma function. Start by observing that the first and second logarithmic 
derivatives of the gamma function—the digamma and trigamma functions—are 
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respectively the expectation and variance of log X;: 


d 
E{log X,} = float x)x' 'e* dx /T(t) = Slog T(t) = ¥(¢) (4) 


and 


2 


=w(t). (5) 


[5 log?(x)x‘7! e*dx | fo log(x)x'~'e~* dx 


I(t) I(t) 


The following identities are of fundamental importance. The first is due to 
Wilks (1932), the second is the Legendre duplication formula. 


Var{log X,} = 


Theorem 3. For t > 0, let X, and X,,., 2 be independent gamma variates. Then 


stoch 


2 *X, Xy 41/2 _ XS, (6) 


22-1T(r)T(¢ + 1/2) = T(2t)T (1/2). (7) 


Proof: As in the classical proofs, break (2) into a stochastic piece of odd-numbered 
terms, a stochastic piece of even-numbered terms, and a scalar remainder. Next 
take limits to obtain 


2log( X,) = = "og( X 2) + log( X42) + 21og(2). 


Wilks’ theorem follows by the substitution 2¢ = s. Finally, set t = 1/2, raise both 
sides of (6) to the r power, and take expectations to obtain 
P(2r + 1) >, (r+ 1/2) P(r + 1) 


= E{X/" = 2*"F EY XT) = 2°’ —————_ ————— 
T(1) (xi) {Xi} E{ Xi} T(1/2) T(1) 
The assertion follows because (r+ 1)=rT(r). O 


Next is Stirling’s formula. With little effort we provide inequalities instead of 
the usual asymptotics. 


Theorem 4. Let X, be gamma-distributed with shape parameter t. For all 0 < t < , 
w(t) = Varllog(X,)} = Le_ p(t + /)7* and p(t) — log(t) > 0 as t > ». In addi- 
tion, 


1 1 1. 1 1 1 

~t ya te +t 7a + oe 

t 2t 6(¢ + 1/14) t 2t 6t 

1 4 ' \ <W(t)<) 4 4 1 1 
—~4+—~+-—.- —- ~+ 50+ 237 ST 
t 2t2 6f3 3065 t 2t 6t 30(¢ + 1/8) 


(8) 


Proof: Compute the moments of log(X,) from (2), (4), (5), Lemma 1 and Theorem 
2, noting the latter involves independent variates, each with variance 1 and an 
independent remainder term with negligible variance. The proof of (8) is motivated 
by the desire to approximate the variance with an integral by means of the 
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trapezoid rule. Formally, write 


1 1 +-¥ 1 1 5 1 1 
‘(rt — — — eee — 
W(t) = 5a + 5 a ae (+i) " G4+74 0? t+j t+jt 
1 1 1 <= 1 
=—+4+—-54+- 5 ——_.. (9) 
t 20° 2 yo (4 +7) (t +7 +1) 


Take € > 0 and verify by subtraction the two inequalities 


((€ + 1/14) > - (€ + 15/14) “)/3 < €7°(E+ DY < (E93 - (E+ DN), 


the first by showing 10218656? + 1240288é* — 2087596é + 453600 > 0. The top 
pair of inequalities in (8) follow from (9). The bottom ones follow similarly from 


—(€S- (E41) P°/)/15 < €7(E+ 1) - (EF - (EF 1) 
—((€ + 1/8) > — (€ + 9/8) °)/15. 0 
We now prove bounds which include Stirling’s formula, (11), as a special case. 


Theorem 5. For all t > 0, 


1 1 1 1 
log(t) - —- - = log(t) — — - ———-————} 
oF ee 8) oF Toa + 1/14? 
1 1 < w(t) < 1 1 1 
log(t) - — - SS SOU +:«C ee 
a(t) 2t 12? 120(¢ + 1/8)* log(t) It 12t2 + 1204" 
(10) 
In addition, 
1 1 
12t + 6/7 I(t) 12¢ 

1 1 [- oe pera *\ 1 : om) 


11 36073 12t —360(t + 1/8)° 


Proof: From Theorem 4, 
el 1 
[+ — a 
t 2x 6(x + 1/14) 


oO 1 1 1 
< [w(x - 5ar< [sa + Gate 


proving the upper pair of inequalities in (10). The lower pair are proved similarly. 
To prove (11), we integrate again. We may conclude from (10) that for fixed f 


the integral {y(x) — [log(x) — (2x)~']dx converges monotonically to a finite 
limit, say c, as y — ©. Hence, the upper pair of inequalities in (10) yields 


1 ; I(t) 1 D 
+ ————— < log| —5~— ] <c + — 
12t+ 6/7 P\ tt Pe-t } S° Y0y (12) 


Similarly integrating the lower pair of inequalities in (10) yields the lower pair in 


(11). 
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We now use the Legendre duplication formula (7) to show e° = ¥27. From 
(12), 


Cc 


P(2t) 
c= lim (21) 9-2! 
1 T(t) T(t + 1/2) g2!-1p-1/2 


= dim > saat oo es BL 
too var the! (¢ + 1/2)'e-EF/D 221-1204 4 17 (2t)) 


We next generalize Wilks’ identity (6) from which Gauss’ multiplication formula 
will follow. Note that Gauss’ formula (14) specializes to the Legendre duplication 
formula (7). 


Theorem 6. Let p > 2 be an integer and t > 0. Let X15 Xp4.4/p.--+» Xt4(p—yp EC 
independent gamma variates. Then 
nt stoch p 
Dp [1 X.+ip = Xt (13) 
p-1 
p?'- TT T(t + i/p) = (27)? PT (pt). (14) 
i=0 


Proof: Argue by sieving as in the proof of (7) to show there exists a constant c for 
which 


| (x yi = 1 Yij+i stoch om ( Xx ) 
P 108 = —pytcrt ToT Sct og i ) 
m i-oj-o/ +1 jJrtifp i=0 eh/P 


with independent summands on the right. We take expectations to show e° = p”. 
For t > 0, 


p-1 p-1 
e° Il (t+ i/p) = | 1 Xi 


- 


= E(x} = Por + p)/T(o8) = [1 (ot +i). 


We prove (14) by applying Stirling’s formula to (13). Set t = 1/p. Take expec- 
tations to obtain E{[ p?I1?_,X;,,)} = FLX’), giving p?’ TI, + i/p) = 
bI( pr + 1) for all r > 0 and some constant, say b. Take the limit as r > © to 
show b = (277)”~)/2p~!/2, Finally apply T(x + 1) = xI(x) to each side of the 
equation, proving (14). O 


The Newman product, used by Weirstrass as his definition of the gamma 
function, follows from the fundamental representation as the moment generating 
function of log(Y). It is of interest that — log(Y ) has a Type I extreme value, or 
Gumbel, distribution. 
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Theorem 7. For t > —1, we have 


ef) 


eo 1 
EfetleQ) = T(1 +4) =e” 
( ) I 1+t/ 


? 
when Y has the unit exponential distribution. 


Proof: Note that E{e~‘*} = (1 + t)~!. Use (2), with remainders given there, to 
write 


Ti¢+1)= i ye’ dy = Efe’ e™} 
0 


Xisn 


1l+n 


n—1l 1 
= ev I] 


Ses ef/GtI) ef4niBk 
j-o 1+t/(j + 1) 


| 
We now deal with the stochastic remainder term. From the Stirling formula, 
E{LX,,/m]} = Tm + t)/(m'T(m)) — 1 as m > &, proving the theorem. O 


The logistic density e~*(1 + e~*)~’arises, for example, when studying sto- 
chastic models of choice. Direct calculation shows that the random variable L = 
log(U/(1 — U)) is logistic if U is uniformly distributed on [0, 1]. From the repre- 


sentation U ston Y,/(Y, + Y,), it follows that log(Y,) — log(Y,) has a logistic distri- 
bution. We are therefore led to study the moment generating function for a logistic 
variate by means of the Newman product. The result is the reflection formula for 
the gamma function. 


Theorem 8. Let Y, and Y, be independent unit exponential variates. For 0 < t < 1, 


ind 1 Tt 
B{etlos los = 11 +2) T(1 - 1) = [| ——— = — —. 
te ( ) YW 1-t*/j* — sin(at) 


Proof: Independence and our definition imply Ef{e’(oe’)~e)} = Td + ATA - 
t). The representation as an infinite product is immediate from the Newman 
product, Theorem 7. The last equality is the Euler sine product, the next theorem. 
LJ 


We conclude the proof of the reflection formula by establishing Euler’s sine 
product. 


Theorem 9. For all real t, T1?_,{1 — t?/(w7j*)] = sin(t)/t. 


Proof: First observe that to pick a number at random from [0, 1], we might as well 
pick the first m binary random digits uniformly at random, and then finish the job 
by picking a number uniformly at random in the interval of length 2~™” to the right 
of the chosen dyadic rational. Formally, let B; be independent Bernoulli variates 
taking the values 0 or 1 with probability 1/2 each. For U uniform on [0,1], we 
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twwil 


"12 /B, + 2~"U. Now symmetrize about 0 to obtain 2U — 
stoch 


1 = Y7_,27QB, — 1) + 2-"QU — 1). The symmetrization makes it easy to com- 


pute the characteristic functions of left and right sides, yielding 


in in(2-"t) _” . 
” o = a Hoos"), (15) 


a truncated form of Vieta’s formula. Of course one could just as soon pull (15) out 
of the air, beginning with sin(x)/x and iterating with sin(x) = 2 sin(x /2)cos(x /2). 
Now rearrange (15) and apply the double-angle formula for the cosine to obtain 


sin(t) 1 n 


Tiny ery ~ sose42-) = TTosinrea-*9), (19 


where the functions Q,(x) are polynomials in x of degree 2*~!. Hence the 
product is a polynomial in sin?(2~"t) of degree 2”~!— 1. The first 2”~1 — 1 
positive zeros of the left-hand side occur at t = kaw for k = 1,2,...,2”~! — 1. 
Because the left-hand side of (16) is 1 when t = 0, and because sin(-) is strictly 
increasing in (0, 7/2), we see that 


sin (t) 1 "TT. ' sin*(2~"t) 
2-"sin(2~"t) cos(2~"t) eat sin? (ka2~") } 


The latter product converges to [I1?_,(1 — t*/(jzr)*) because x > sin(x) > x — 
x°/6 > 0 for x € (0,7 /2) implies sin?(2~"t)/sin?(k7r2~”") is bounded above by 
t?/k* when kk <2"71. O 


3. BIBLIOGRAPHIC NOTES. It is curious that the development presented here 
tends to flow in a backwards direction from customary presentations. For example, 
Wilks (1932) uses the Legendre duplication formula (7) to prove (6) by calculating 
moments and then appealing to deep results that tell when moments determine 
uniquely their distribution. Theorems 2, 3 and 6, with similar proofs, appear in 
Gordon (1989). Bondesson (1978) obtains (3) using characteristic functions. Our 
proof is closely related to Olshen and Savage’s (1968) theory of generalized 
unimodality. 

The first lower bound of (11) slightly improves Robbins’ (1955) useful and pretty 
lower bound 1/(12t+ 1). Robbins’ proof is for integers only. Diaconis and 
Freedman (1986) remark that the bound is difficult to generalize to non-integer 
values. We believe the first lower bound and second upper bound of (11) are new. 
The second lower bound and first upper bound of (8) are implicit in Fichtenholz 
(1964), Section 540. The corresponding inequalities in (10) and (11) are there given 
explicitly in Section 541. The use of the Legendre duplication formula to compute 
the constant in Stirling’s formula appears in Henrici (1991), Section 8.5. 

Related is work of Blyth and Pathak (1986) whose proof of the Stirling formula 
uses gamma variates and the central limit theorem. Other approaches to the 
Stirling formula are those of Namias (1986) by recursion based on the Legendre 
duplication formula, and of Berndt (1986) who relates log(T(x)) to the Hurwitz 
zeta function; he also proves the Gauss multiplication and reflection formulas. 

Elementary proofs of Euler’s sine product are in Artin (1964) and Eberlein 
(1977). Artin analyzes the periodic function (x) — x)sin(v x), after extending 
the domain of [(-). Feller (1967) uses Artin’s proof to find the constant in 
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Stirling’s formula. Eberlein’s (1977) proof, whose roots he traces to Euler, uses a 
clever approximation of sin(x)/x by a polynomial whose zeros can be explicitly 
determined. Our proof is a variant of Eberlein’s, using a polynomial whose study is 
naturally motivated in a probabilistic context. Vieta’s formula is the starting point 
for Kac’s (1959) monograph. 
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Some time ago, in Riemann’s day, 
Calculus was penned in a cumbersome way; 
seas of partials, indices too, 
Jacobian determinants spiced the stew. 
But now we have another way; 
linear maps have joined the fray. 


“Though some feel wrath, me thinks that Math 
hath truly felt a “breath of Fréchet.’ 


John A. Baker 
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Arrangements and Topological Planes 


Jacob E. Goodman, Richard Pollack, Rephael Wenger, 
Tudor Zamfirescu 


1. INTRODUCTION. Let I be a finite family of simple curves in the plane. When 
is there a homeomorphism of the plane to itself that takes all the curves in I to 
straight lines? . 

In the Euclidean plane, E”, we are faced with the fact that two non-intersecting 
curves in our family must map to two parallel lines. This introduces extraneous 
technical complications that only distract from the essence of the problem. As with 
many other geometric questions, it is much simpler to avoid the special cases 
caused by parallel lines by moving to the projective plane. The real projective 
plane P? is the Euclidean plane E* with an extra “line at infinity” adjoined, each 
point of which represents a parallel direction in E*. P? has the virtue of simplicity: 
every pair of points determines a unique line which is topologically a circle (i.e., a 
simple closed curve), and every two lines meet at a unique point. Thus our 
question becomes: When is there a homeomorphism of P? to itself that simultane- 
ously straightens all the members of a finite family I’ of simple closed curves? 

Certainly a necessary condition is that each of the curves be “nicely” embedded 
in the plane. More precisely, for each curve there must be some homeomorphism 
that takes P’ to itself and maps the curve to a straight line. In addition, every two 
of our curves must meet exactly once, and cross at their point of intersection, just 
as straight lines do. Are these two conditions sufficient? The answer is no, and a 
counterexample can easily be constructed using Desargues’ theorem. 

Desargues’ theorem, one of the basic theorems of projective geometry, asserts 
that if the corresponding sides of two triangles meet at three collinear points, then 
the three lines joining corresponding vertices are concurrent: see Figure la. On the 
other hand, Figure 1b is an example of an arrangement for which Desargues’ 
theorem fails: any homeomorphism of the plane to itself that mapped the ten 
curves in Figure 1b to straight lines would yield an arrangement of lines that 
violated Desargues’ theorem. Hence there is no homeomorphism of the plane to 
itself that simultaneously straightens the ten curves of Figure 1b. 

Let us look for a moment at our two necessary conditions. A straight line / in P” 
does not separate P’, since any two points in P’ \ {/} are connected by some path, 
perhaps one crossing the line at infinity. (In contrast to this, a “small” circle does 
separate P*.) Thus if there is to be a homeomorphism of P? to itself which maps 
some simple closed curve /' to a straight line, then /' must also not separate P?, 
i.e., P? \ {/'} must be connected. It follows from Schoenflies’ Theorem (see [12], for 
example) that the converse is true as well: If /' is a simple curve that does not 
separate P*, then there is a homeomorphism taking P? to itself that maps /’ to a 


Some of the main results of this paper were presented at the Eighth Annual ACM Symposium on 
Computational Geometry in Berlin on June 11, 1992 [5]. 
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Figure la. Desargues’ Theorem. Figure 1b. A non-stretchable arrangement. 


straight line. Such a simple closed curve is called a pseudoline. A finite family of 
pseudolines in P*, with the property that any two meet exactly once (and necessar- 
ily cross), is known as an arrangement of pseudolines. 

To visualize an arrangement & of pseudolines in P*, one can model the 
projective plane as a circular disk with opposite points identified. For this purpose, 
remove some pseudoline /* € <& from the projective plane. The remaining points 
then form a space (the Euclidean plane!) homeomorphic to an open circular disk 
in E’. Call the closure of the disk A. Each point on /* corresponds to an antipodal 
pair of points on the circle 0A. A pseudoline in ./ other than /* becomes a curve 
connecting antipodal points in A. Thus an arrangement of pseudolines in P? 
corresponds to a family of Jordan arcs connecting antipodal points on a circle, 
every pair of arcs intersecting exactly once (or possibly at their endpoints); see 
Figure 2. 


Figure 2. An arrangement of pseudolines in the disk model of P?. 


Arrangements of pseudolines have been studied since the work of F. Levi [8], 
who first pointed out that, in spite of their resemblance to arrangements of straight 
lines, they are topologically more general objects. B. Griinbaum [6] published an 
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extensive monograph on arrangements in 1971, in which he answered many 
questions about line- and pseudoline-arrangements and posed many others. It 
turns out that the study of arrangements of pseudolines is equivalent to that of 
oriented matroids of rank 3; see [1] for a definition and good introduction to 
oriented matroids, and in particular for a discussion of their relationship to 
pseudoline arrangements. 

As in any field of mathematics, we classify arrangements, i.e., we partition them 
into classes of similar or “isomorphic” ones. This is done by considering their 
topology: Two arrangements are called isomorphic if.there is some homeomor- 
phism of P? to itself that maps the pseudolines in one arrangement to those in the 
other. 

An arrangement © of pseudolines, like an arrangement of straight lines, 
induces a decomposition of the projective plane into a cell complex @(Y), 
consisting of cells of dimension 2 (“faces”), 1 (“edges’’), and 0 (‘‘vertices”). An 
isomorphism between two arrangements induces a one-to-one correspondence 
between their cell complexes that preserves incidence, i.e., neighboring faces in 
one arrangement map to neighboring faces in the other. The converse is also true: 
Suppose < and .’ are arrangements and there is a one-to-one correspondence 
between @(.W%) and ¢(.%’) that preserves incidence. Patching together homeo- 
morphisms between corresponding faces gives a homeomorphism of the plane to 
itself that maps the pseudolines of one arrangement to the pseudolines of the 
Other. Thus isomorphism between arrangements is really just a combinatorial 
relationship that can be defined solely in terms of the cells and their incidences. 

The question we posed at the beginning can now be restated as follows: Is every 
arrangement of pseudolines isomorphic to some arrangement of straight lines? 
The example above based on Desargues’ theorem shows that the answer is “no”’. 
(G. Ringel showed [15] that even if we consider only arrangements with no 
multiple points, the answer still remains “no”.) An arrangement of pseudolines 
that is isomorphic to some arrangement of straight lines is called stretchable. 
Unfortunately, determining whether an arrangement of pseudolines is stretchable 
turns out to be quite a hard problem (in fact, NP-hard: see [10, 11, 17].) 

While not every arrangement of pseudolines is stretchable, arrangements of 
pseudolines nevertheless share many properties with arrangements of straight 
lines. Given any arrangement of n straight lines we can always add another line 
through any two given points not both on the same line to form an arrangement of 
n + 1 straight lines. The same property holds for arrangements of pseudolines [8]. 

This result, known as the Levi Enlargement Lemma, implies that any arrange- 
ment of nm pseudolines can be extended to an arrangement of n + 1 pseudolines. 
However, an arrangement of n straight lines can be extended to a much larger and 
richer structure, the topological space consisting of al/ the lines in the plane. Each 
line is a “point” in this space, and a neighborhood of the line through two points 
consists of all the lines through nearby pairs of points. 

Can every finite arrangement of pseudolines be embedded in some continuous 
family of pseudolines analogous to the set of all lines in the plane? Griinbaum 
posed this question in [6]. What should be the properties of such a family of 
pseudolines? First, this family should form a topological space analogous to the 
space of all lines in the plane. Second, this set of pseudolines, together with the set 
of points in the projective plane, should obey the basic incidence axioms of 
geometry: any two points should lie on a unique pseudoline, and any two pseudo- 
lines should intersect in a unique point. Third, the geometric and topological 
properties should be linked as they are for lines: as two pseudolines vary continu- 
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ously, their point of intersection should vary continuously; as two points vary 
continuously, the unique pseudoline they define should vary continuously as well. 

Such structures are known to exist, and have in fact been studied for nearly a 
century. They are known as topological projective planes. A topological projective 
plane Q, in the sense we will use the phrase, consists of P as its underlying point 
set, and a second topological space L(Q) consisting of simple closed curves in P? 
as its set of “lines”; these satisfy the following conditions: 


1. for every two distinct points p, g € QO there is a unique curve /( p,q) € L(Q) 
containing p and gq; 

2. every two distinct curves /, /' © L(Q) intersect in exactly one point at which 
they cross; 

3. I(p, q) varies continuously as a function of p and gq; 

4. 1 l' varies continuously as a function of / and J’. 


As before, we use the term pseudolines for the curves in L(Q). 

Just as the neighborhood of a straight line through two points consists of all the 
lines through nearby pairs of points, the neighborhood of a pseudoline consists of 
all pseudolines through nearby pairs of points. Since P* is compact, this is 
equivalent to the topology induced by the Hausdorff metric on pseudolines: two 
pseudolines are within distance e of each other if each point of each pseudoline is 
within e of the other pseudoline; here the metric on P? is the one coming from the 
standard metric on the sphere S? when antipodal points are identified. 

Topological planes were discussed by Hilbert in his seminal book Foundations 
of Geometry [7]. There, he gave the first example of a topological Euclidean plane 
in which Desargues’ theorem failed to hold. F. R. Moulton subsequently gave a 
simpler example, now known as the Moulton plane [13], which was incorporated 
into later versions of Hilbert’s book. 

More recently, H. R. Salzmann [16] studied topological planes and their various 
axiomatizations. Among other results, he proved that, with hypotheses even 
weaker than the above conditions, L(Q) will always be homeomorphic to the space 
of points of the projective plane P*. He also showed that the fourth condition, that 
1 T/' vary continuously as a function of / and I’, is a consequence of the third 
condition, that /(p, q) vary continuously as a function of p and gq, and vice versa. 
Many other interconnections among properties of topological planes are given in 
Salzmann’s paper. 

Griinbaum’s question can now be reformulated as follows: Given an arrange- 
ment Y consisting of a finite number of pseudolines in the projective plane, is 
there some topological projective plane Q containing .%, i.e., a plane such that 
GSC L(Q)? If LM were stretchable, this would be trivially true: take an isomorphic 
arrangement of straight lines, consider it as embedded in P?, and use the 
homeomorphism of P?* that straightens the pseudolines of & to define a new 
topological plane structure on P? in which the “lines” are simply the inverse 
images of the straight lines of P*. But in general, if 7 is not stretchable, no such 
argument is available. Nevertheless, we will answer the question affirmatively in 
Section 2 below, by showing how to extend any arrangement of pseudolines to a 
topological projective plane; the solution will turn out, in fact, to be surprisingly 
simple. 

Griinbaum asked yet another, more sweeping, question in “Arrangements and 
Spreads”. Assuming that every finite arrangement of pseudolines can be embed- 
ded in some topological projective plane, is there a single topological plane that 
contains every finite arrangement of pseudolines up to isomorphism? Another way 
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of posing this question is to extend our notion of stretchability. Recall that an 
arrangement of pseudolines is stretchable if it is isomorphic to some arrangement 
of lines in P?. For a topological projective plane Q, let us call an arrangement 
of pseudolines in the projective plane stretchable in Q if it is isomorphic to some 
arrangement of pseudolines 7’ C L(Q). Then Griinbaum’s question becomes: Is 
there some topological plane Q such that every arrangement of pseudolines is 
stretchable in Q? 

We will show in Section 3, using the embedding theorem of Section 2, that the 
answer is again “yes”. Griinbaum called such a topological plane, whose existence 
he conjectured, a universal topological plane. 


2. FROM ARRANGEMENTS TO TOPOLOGICAL PLANES. One obvious ap- 
proach to extending arrangements to topological planes is the repeated use of the 
Levi Enlargement Lemma, adding new pseudolines one at a time in an infinite 
process. This would only generate a countably infinite family of pseudolines, 
however, so one must then “‘complete” this set by taking some sort of limit. The 
problem with such an approach is that one may unwittingly introduce discontinu- 
ities in taking this limit. While it is conceivable that such a technique may work, no 
one, to our knowledge, has been able to give a construction along these lines. 

Instead of adding pseudolines one at a time, our method will be to define 
pseudolines piecewise in different regions of the plane, and then to link the pieces 
together. As previously noted, an arrangement . of pseudolines splits the 
projective plane into faces. We will construct the topological plane by defining all 
the “pseudoline segments” traversing a given face and then showing how to join 
these segments to form pseudolines with the desired properties. 

To carry out our construction, we need a simple fact about stretchability, proved 
first by J. Richter-Gebert [14] in the “uniform” case, i.ec., where no three pseudo- 
lines are concurrent: 


Lemma 1. Let Y be an arrangement of n pseudolines. If some face of Y is 
bounded by at least n — 1 pseudolines, then S is stretchable, i.e., isomorphic to an 
arrangement of straight lines. 


The lemma is proved, without any assumption of uniformity, at the end of this 
section. 

We now proceed with our construction. Let .< be any arrangement of n 
pseudolines in P*. Fix some distinguished pseudoline /* € 7%. This pseudoline will 
play a role in the topological projective plane similar to the role of the line at 
infinity in the standard model of P?. 

For each face f of the arrangement %/, let L, be the set of pseudolines 
bounding f. By Lemma 1, L, U {/*} is stretchable. Let h, be a homeomorphism of 
the projective plane to itself that maps the pseudolines in L, U {/*} to straight 
lines (notice that it is possible that L,U {/*} =L,). For each pair of distinct 
points p and q lying on different segments of Of, there is a straight line segment a 
in hf) connecting h,(p) to h,(q). h; (a) is then an arc in f with endpoints p 
and g; see Figure 3. (As previously described, the arrangement .% can be 
visualized as a set of Jordan arcs connecting antipodal pairs of points in a disk 
where /* maps to the circle bounding the disk. Of course the “straight” lines 
shown in Figure 3b are not really straight in the disk model; they can, however, be 
taken to be arcs of circles joining antipodal pairs on the disk boundary—see [6].) 
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Figure 3a. Original face f. Figure 3b. A straightening of f. 


Let IT be the set of all such arcs over all faces of .%; the members of I’ form the 
“pieces” of the pseudolines we are going to construct. 

Let y be some arc of I lying in face f. h,(y) is a line segment which is part of 
some straight line / meeting A,(/*) at a point r “at infinity”. Let s © /* be h; 1(r), 
and label y with the point s. (One can think of s as the “slope” of arc y even 
though y may be far from linear, and even though this “slope” depends on the 
chosen straightening homeomorphism h fo Label every arc y € I in this manner, 
and notice that if p is a point on /*, then every arc with endpoint p has label p. If 
p is a point on the interior of some edge of f, where p € / and/€ L, and p € [*, 
then every point of /* \/ occurs exactly once as a label of an arc in f with 
endpoint p. Finally, if p is a vertex of f, where p€©//' and /, l’& L, and 
p € /*, then the labels of arcs in f with endpoint p form an arc on /* between 
lO 71* andl’ I*. | 

The arcs in I can now be linked together to form pseudolines. For every point 
p lying on some pseudoline / € o7\ {/*}, let T, be the set of arcs in I with 
endpoint p. Then for every arc y € I, with label s there is exactly one other arc 
y’ =I, on the other side of /, with the same label. Join these two arcs to form a 
longer arc, and continue. Repeating this for every point p lying on some pseudo- 
line other than /*, we get a set I’ of arcs. We claim that L(Q) = T U ® is the set 
of pseudolines of a topological plane Q. 

For the proof, it is useful to refer again to the disk model described above, in 
which each point s € /* is replaced by an antipodal pair s*,s~ on the circle 0A 
bounding a disk A, and a pseudoline that intersects /* at s becomes a curve with 
endpoints s* ands’, 

We first show that in this disk model the endpoints of every / € T constitute an 
antipodal pair in dA. Start with any arc y, € [' that forms a segment of /. y, has 
some label s. Arc y, is separated from s* by some k <n pseudolines of 7% One 
of the two arcs joined to y, is therefore separated from s* by only k — 1 
pseudolines; let this arc be y,. y, also has label s. Repeating this argument k 
times gives an arc y, with label s which has endpoint s*; thus / has s* as one 
endpoint. A similar argument shows that s~ is the other endpoint of /. 

If two arcs have the same endpoints on 0A, i.e., the same labels, then it is clear 
that they can never intersect in the interior of A. On the other hand, it is also clear 
that two arcs in L(Q) cannot intersect infinitely often, since they can meet at most 
once inside each face of .~%. To prove that they intersect exactly once, we first 
establish a general lemma about arcs intersecting in the disk (cf. [4]). 
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Let / and /’ be any two arcs in A connecting distinct antipodal endpoints s,, 5, 
and s,, 5,, respectively; / and /' may intersect at more than one point. Let p be an 
isolated point of intersection of / and /' at which they cross. In other words, there 
is some small topological disk A* containing p and no other point of intersection 
of / and 7’. Arcs / and /’ intersect 0A* at four points, s¥, s}, sj, and sf{, with s¥* 
lying between p and s, on / or /'. We say that p is a proper intersection point of / 
and /' if s,, 55, s3, and s, occur in the same order around A (clockwise or 
counterclockwise) as sj, 53, 53, and s{ do around A*. (See Figure 4.) 


Figure 4. p is a proper intersection point; g is not. 


The following lemma replaces the global condition that arcs intersect at pre- 
cisely one point, at which they cross, by the local condition that every point of 
intersection is proper. 


Lemma 2. 7wo arcs connecting distinct antipodal points in the disk that intersect 
finitely often intersect at precisely one point, and cross there, if and only if every point 
of intersection of the two arcs is proper. 


Proof: Since the endpoints of each arc are antipodal, the first arc separates the 
endpoints of the second and thus the arcs must have at least one intersection 
point. If there is only one, that intersection must clearly be proper. On the other 
hand, if our two pseudolines / and 7’ intersect at more than one point, we can list 
the points of intersection in order along /. Let p and gq be two successive points of 
intersection. Then it follows immediately from the definition that if p is proper, qg 
isnot. O 


We now prove that every two arcs in L(Q) intersect exactly once. Let J, /' 
€ L(Q) be two arcs connecting distinct antipodal endpoints s,,53; and s,,5,, 
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respectively. Suppose p is an intersection point of / and /' lying in the interior of a 
face f. By construction, / and /' meet the boundary of f in four points, s}, s3, s}, 
and sz, where s* lies between s; and p on / or /'. The order of s,, 55, 53, and s, 
around A agrees with the order of h,(s,), h,(sz), h,(s3), and h,(s,) around h,(A). 
Similarly, the order of sf, s}, 53, and sf around f agrees with the order of h ,(s7), 
h,(sz), h,(s3), and h,(s{) around A,(f). By construction, the intersection of 
h,() and h,(/') in h,(f) is proper; hence the intersection of / and /' in f must 
also be proper. A similar argument holds if p lies on the boundary of a face f. 
Thus, by Lemma 2, every two arcs in L(Q) intersect exactly once. 

For every point p € P? and every s € /*, there is a unique pseudoline in L(Q) 
passing through p and s. This pseudoline varies continuously as a function of s, 
sweeping over the whole projective plane as s runs through /*. Hence, for any 
Dp, g © P’, there is some pseudoline in L(Q) containing p and gq. If two distinct 
pseudolines 7, /' € L(Q) both contained p and gq, then / and 7’ would intersect 
more than once. Thus there must be a unique pseudoline /( p, g) € L(Q) contain- 
ing p and gq. 

Finally, the continuity conditions on /(p, gq) and ! \ /' follow from the fact that 
continuity is a local property, and that locally our pseudolines are nothing but 
homeomorphic images of lines. 

We have thus proved 


Theorem 1. Every arrangement of pseudolines in the projective plane can be extended 
to a topological projective plane. 


We conclude this section with the proof of Lemma 1 promised above. 


Proof of Lemma 1: Let & be an arrangement of n pseudolines, at least n — 1 of 
which bound a face f of the arrangement, and let / be the nth pseudoline (or any 
one if all n bound f). / is stretchable, so there is some homeomorphism of the 
projective plane to itself that maps / to the line at infinity, /.. Thus, without loss of 
generality, we may assume that / = /,. Let {/,,/,,...,1,_,} be the remaining set of 
pseudolines, .7\ {/,}. 

If we remove /, from the projective plane, we are left with a Euclidean plane, 
which we assume coordinatized. Each point /; /,, can now be identified with 
some slope s, in this plane. Without loss of generality, we can assume none of the 
slopes s; is infinite. For each /, let /; be the line with slope s, tangent to the unit 
circle, with /; chosen so that it passes above the unit circle if and only if /; passes 
above f; see Figure 5. (/; passes above f if there exists some suitably large y, such 
that /,; separates f from (0, y) for all y > yo.) 


We claim that the pseudoline arrangement {/,,/,,...,/,_,} is isomorphic to the 
straight line arrangement {/), /5,..., //,_,} in the Euclidean plane. Clearly this will 
imply that the arrangements {/,, /,,...,1,_1,/,$ and {/{, 4,...,/,,_,, 1} are isomor- 


phic in the projective plane. 

The proof is by induction on the number of lines in the arrangements. It is 
trivially true for the arrangements {/,} and {/{}. Assume the arrangement &% = 
{1,,/,,...,4,} is isomorphic to the arrangement &% = {I}, /5,...,1,}. Consider 
what happens when we add /,,, and /,,, to Y, and M, respectively. Without 
loss of generality, assume /, ,, lies below face f. Orient /,,, so that face f lies to 
its left. Pseudoline /,,, first intersects the pseudolines of A, lying above f whose 
slope is greater than s,,, in order of increasing slope. /,,, then intersects the 
pseudolines of A, lying below f in order of increasing slope. Finally, /,,, 
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4 3 


Figure 5a. Pseudolines around a face. Figure 5b. Lines around the unit circle. 


intersects the pseudolines of-A, lying above f whose slope is less than s,,, in 
order of increasing slope. (If some pseudoline above f is “parallel” to /,,,, we can 
consider it to be met by /,,, after the rest.) 

By construction, /,,, lies below the unit circle; orient it so that the unit circle 
lies to its left. Then /,,, also first intersects lines above the unit circle with slope 
‘greater than s,,,, then lines below the unit circle, and finally the lines above the 
unit circle with slope less than s,,,. The order in which /,,, meets them is also by 
increasing slope. Since /; passes above f if and only if /; passes above the unit 
circle, and since the slopes of /; and /; are equal, /,,, and /,,, intersect 
corresponding pseudolines and lines in the same order. It follows that /,,, and 
4, Split corresponding faces in Y% and &% in exactly the same manner. 


Therefore the arrangements &%,, = {l,,1,...,1,,l,,,} and M%., = 
(1, 15,..-,l,,l,4,} are isomorphic, and hence /\ {/,} is isomorphic to the ar- 
rangement {/),/5,...,/},_,} of straight lines in the Euclidean plane. O 


3. UNIVERSAL TOPOLOGICAL PLANES. Recall that an arrangement .% of 
pseudolines is stretchable in Q if it is isomorphic to some arrangement of 
pseudolines .e7' C L(Q). In the previous section we proved that for every arrange- 
ment & there is some topological projective plane QO in which is stretchable. 
We will now show that there is a topological plane in which every arrangement is 
stretchable. 

We first need to introduce a technique for “patching” parts of one topological 
plane into another. Let Q.and Q’ be two topological planes. Let /,, /,, and /, be 
pseudolines in Q that are not concurrent. /,, /,, and /, then decompose Q into 
four closed regions which we call triangles. Let + be any one of these triangles. The 
vertices of triangle 7 are the points /, N1,, 1, N1,, and 1, N 1,; see Figure 6. 

Similarly, choose three pseudolines in Q’ that are not concurrent, label them J), 
5, and /,, and let triangle 7’ be one of the closed regions bounded by these three 
pseudolines. Let @ be a homeomorphism of 7’ onto 7 that maps the vertices of 7’ 
to the corresponding vertices of 7; again, this will always exist by virtue of 
Schoenflies’ theorem [12]. 

Define a new topological plane Q”, with a new set of pseudolines L(Q") chosen 
as follows. For each / € L(Q), if 1M int(7) = ©, then let / belong to L(Q”). 
Otherwise, / 7 1s a connected arc with two endpoints, p and gq. Let /' be the 
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Figure 6a. Triangle 7. Figure 6b. Triangle 7’. 


Figure 6c. Topological plane Q". 


unique pseudoline in Q’ passing through @~'(p) and @~'(q). Replace 1M 7 by 
d(l' \ 7’) to form a new pseudoline /", and let /” belong to L(Q"). 

We must show that every two distinct pseudolines /5, /{ € L(Q") intersect 
exactly once. But this is immediate from the fact that outside +t nothing has been 
altered, while inside + two pseudolines meet if and only if their intersections with 
dr interlace, a property which is preserved by the homeomorphism @. 

We can now prove 


Theorem 2. There exists a universal topological plane in which every arrangement of 
pseudolines is stretchable. 


n 


Proof: Every arrangement of nm pseudolines has at most (") + 1 faces. Since the 


question of whether two arrangements are isomorphic depends only on the 
combinatorial structure of their associated cell complexes, there are only a finite 
number of isomorphism classes of arrangements of n pseudolines. Thus the set of 
isomorphism classes of arrangements of pseudolines of arbitrary size is countable. 

Let Y,, Y,,LM;,... be a sequence of pseudoline arrangements such that every 
arrangement is isomorphic to some .%,. Let Q, be any topological projective plane 
(for example P*), and let 7,,75,73,... be a sequence of pairwise disjoint triangles 
in Q,. For each i > 0, construct the topological plane Q,; from the topological 
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plane Q,_, as follows. Embed 27 in some topological plane Q', using Theorem 1. 
Let 7’ be some triangle in Q’ containing all the intersection points of the 
pseudolines in 2%. (The existence of such a triangle 7’ follows by continuity, since 
we can always find a pseudoline avoiding all the intersection points in 2% and 
then take two other pseudolines sufficiently close to the first so that all the 
intersection points remain within a single region.) Replace triangle 7; in Q,_, by 
triangle 7’ using the patching technique described above, to form the topological 
plane Q,. 

Let Q,, be the topological plane which is the limit of the topological planes ;;,. 
The pseudolines L(Q,,) are formed from the original pseudolines in Q, by a 
(possibly infinite) sequence of local replacements. If two pseudolines /, /' € L(Q,,) 
intersected at two distinct points, p and q, then some corresponding pseudolines /, 
and /; in some topological plane Q,; would also intersect at p and q, which is 
impossible. Thus Q,, is a topological plane. (The continuity conditions again follow 
as above, since they hold locally.) 

Since all the intersections of pseudolines in . occurred inside 7’, &% is 
isomorphic to some arrangement 2%’ C L(Q,). Because the triangle 7, is never 
modified after the ith stage of the construction, it follows that .% is isomorphic to 
some arrangement in every Q,, j > 1, including Q,. Thus every arrangement is 
stretchable in Q,, 1.e., Q., iS a universal topological plane. O 


Even though any universal topological plane contains all pseudoline arrange- 
ments, up to isomorphism, not all universal topological planes are isomorphic. In 
fact, using the techniques above, different choices of triangles and patchings may 
be shown to lead to uncountably many non-isomorphic universal planes [3], 
answering another question posed in [6]. 


4. OTHER DIRECTIONS. If we start with an arrangement of straight lines in the 
projective plane and let h, be the identity map for every face f, then our 
construction will generate the standard projective plane P’, with L(P”) consisting 
of the usual straight lines. If h y is not the identity, however, then even starting 
with a straight line arrangement we can generate a topological plane that is not 
isomorphic to the usual one. 

In some sense, however, our construction does not generate topological planes 
whose pseudolines are too different from those in our original arrangement. An 
arrangement is called k-piecewise linear if each pseudoline in the arrangement is 
the union of at most k line segments. For any k, it turns out that there are 
pseudoline arrangements that are not isomorphic to any k-piecewise linear ar- 
rangement. (This can be shown, for example, by bounding the number of k-piece- 
wise linear arrangements of n pseudolines using the Milnor-Thom theorem [9, 18] 
on the Betti numbers of solution sets of polynomial inequalities of a semi-algebraic 
set, and using the lower bound proved in [2] on the number of isomorphism classes 
of arrangements of nm pseudolines.) On the other hand, if we start with an 
arrangement of m pseudolines and construct a topological plane Q by our 
methods, then any arrangement Cc L(Q) can be shown to be isomorphic to some 
k-piecewise linear arrangement, where k depends only on nn, the number of 
pseudolines in the original arrangement .%/; this fact plays an essential role in the 
proof of [3]. 

In [6], Griinbaum discusses an object intermediate between an arrangement of 
pseudolines and a topological plane. A spread of pseudolines is a 1-parameter 
family of pseudolines, any two meeting once, with the property that every point 
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s €1,, has a unique pseudoline through it which varies continuously as a function 
of s. In [6], Grinbaum asked if every arrangement can be extended to a spread, a 
question that we answered affirmatively in [4]. The much stronger resuit proved in 
Theorem 1 above is easily seen to imply that proposition. But our methods do not 
give any insight into the possibility of extending a spread to a topological plane, 
and this seems to be an intriguing question. 

Finally, arrangements of pseudolines can be generalized to arrangements of 
pseudoplanes in dimension 3 (or of pseudohyperplanes in arbitrary dimension for 
that matter). These pseudoplanes should be “nicely” embedded in P°, every two 
should intersect in a pseudoline of each, and every three should intersect in a 
single point. Can any arrangement of pseudoplanes be embedded in a continuous 
3-parameter family of pseudoplanes, some sort of topological 3-space analogous to 
the topological space of planes in P?? 

Just as we demanded of the points and pseudolines of a topological projective 
plane, the points, pseudolines and pseudoplanes of a topological projective 3-space 
should obey the standard incidence axioms of geometry, and their intersections 
should vary continuously. Desargues’ theorem, however, now turns out to be a 
direct consequence of these conditions, instead of being an independent axiom as 
it was in the plane [7]. Since any such geometry in which Desargues’ theorem holds 
is isomorphic to the standard one, any such topological projective 3-space turns out 
to be isomorphic to the usual P*. On the other hand, there are certainly arrange- 
ments of pseudoplanes that are not isomorphic to arrangements of ordinary 
planes. This shows that there is no straightforward generalization of Theorem 1 
possible to arrangements of pseudoplanes. 
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Misunderstanding 
Ah, you are a mathematician, 
they say with admiration 
Or Scorn. 
Then, they say, 
I could use you 
to balance my checkbook. 
I think about checkbooks. 
Once in a while 
I balance mine, 
just like sometimes 
I dust high shelves. 
From Intersections: Poems by JoAnne Growney, 
Kadet Press, Bloomsburg, PA, 1993, p. 50. 
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An Application of Fourier Series to 
the Most Significant Digit Problem 


Jeff Boyle 


It is an old observation [N] that in tables of logarithms, the first few pages are 
more smudged and worn than the later pages. It follows that the distribution of the 
first significant digit of the numbers being looked up in the table is not uniform, 
but skewed toward the smaller digits. In fact, many sets of naturally arising 
numbers exhibit this asymmetry, with approximately 30% of the numbers begin- 
ning with the number 1 and less than 5% beginning with the number 9. Their first 
significant digit follows a logarithmic distribution, namely, the fraction of numbers 
whose first significant digit is n is approximately log(n + 1) — logn,n = 1,2,...,9. 

This phenomenon is known as Benford’s Law after Frank Benford, one of the 
first people to call attention to it. Indeed, he compiled a variety of data totalling 
20,299 observations ranging from populations of cities to mathematical tables of 
powers of integers. The first significant digit in many (but not all) of his data sets 
were roughly logarithmically distributed. 

Since Benford’s Law is an empirically observed law of nature, rather than a 
theorem of pure mathematics, there are several competing explanations for it. This 
phenomenon of leading digits has been extensively investigated and enjoys a 
colorful history. For an interesting survey of this problem and its literature, see the 
expository article in the Monthly, August-September, 1976 by Ralph A. Raimi [RI]. 

In all that follows we will write a positive number x in scientific notation, 
x =m, 10", where 1 <m, < 10 and n, © Z. For convenience we will refer to 
m, as the mantissa of x. Of course, the most significant digit of x is [7m,]], so it is 
sufficient to understand the distribution of m,. One explanation of Benford’s Law 
is that it results from “central limit-like’” theorems for the mantissas of random 
variables under multiplicative operations. For example, the figures below show the 
density functions for the mantissa of the product of 1, 2, 3 and 4 independent and 
uniformly distributed random variables on [1,10]. The density functions move 
progressively closer to the log distribution’s density function f(t) = 1/(¢ In 10). 

Frequently, naturally:occurring data can be thought of as a result of products or 
quotients of random variables. For example, the population of a city tends to 
change roughly at a rate proportional to the population. If a city has an initial 
population P, and grows by r,% in year i, then the population in 7 years is 
P,=P(0+r)0 +7,):::Q+-47,), a product of a number of random variables. 
In this case, as we'll see, a set of populations of cities would follow Benford’s law. 

More specifically, we’ll show for mantissas: 


(i) the log distribution is the limiting distribution when random variables are 
repeatedly multiplied, divided, or raised to integer powers, 

(ii) once achieved, the log distribution persists under all further multiplications, 
divisions, and raising to integer powers. 
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0.4 0.4 0.4 


0.3 0.3 0.3 
0.2 0.27 0.2 
0.1 0.1 0.1 
02 4 6 8 10 02 4 6 8 10 02 4 6 8 10 02 46 8 


Figure 1. The p.d.f. for the mantissa of the products of 1, 2, 3, and 4 independent and uniformly 
distributed random variables on [1, 10]. 


These facts have been explained to varying degrees by many other authors. Our 
purpose here is to show how these ideas can be neatly explained using Fourier 
Series. 

The first step is to make the connection between mantissas under multiplication 
and modular arithmetic on [0, 1] via logarithms. If random variables x = m, - 10”* 
and y = m, - 10” are multiplied, the mantissa m,,, of the product is 


m,M, ifl<m,m, < 10 
Mey \imm,/10 if 10 <m,m, < 100. 


This relationship is more easily studied logarithmically: 


log(m,,) = log(m,.) + log(m,)(mod 1). (1) 

Similarly, for division and raising to a power n, 
log(m, ,,) = log(m,) — log(m,)(mod 1), (2) 
log(m,n.) =n- log m,(mod 1). (3) 


An illuminating way to think about what is really going on is via the circular 
slide rule. The slide rule multiplies numbers (mantissas) by mechanically adding 
their logarithms (2 - 3 = 10°°! - 10-477 = 10:88 = 6). This can only work for all 


Figure 2. Numbers are not uniformly spaced around the c-scale but their logarithms on the L-scale are 
uniformly spaced. 
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10 


multiplications if the logarithms are uniformly spread around the slide rule wheel. 
One can easily show that m, has a log distribution F(m) = P(m, < m) = log m, if 
and only if log m, has a uniform distribution on [0, 1]. 

Now think of the circular slide rule as a roulette wheel. When we successively 
multiply random variables we are giving the “roulette wheel’ a succession of 
random spins. No matter how unfairly we try to spin the wheel (assuming 
successive spins are independent), after a number of such spins are combined, the 
position of the wheel (as a random variable) is approximately uniformly dis- 
tributed. 

This approach to the log distribution has been widely studied. Largely over- 
looked, William Feller in his textbook, An Introduction to Probability Theory and 
its Applications [F], shows that under the most general conditions, the uniform 
distribution is the limiting distribution for sums of random variables taken mod 1. 
His method does not use Fourier Series and gives no rate of convergence. Other 
authors [T], [AS] have shown the log distribution is the limiting distribution for the 
mantissa of products for particular initial distributions, usually a uniform distribu- 
tion on [1, 10]. See also [H] and [BB]. As far as I know, the Fourier Series approach 
was first applied by Furry and Hurwitz [FH] and more thoroughly exploited by 
Schatte [S1]. See also [M]. The approach we will use is essentially the same as [S1] 
although we generalize his results to include divisions and exponentiations. Al- 
though not necessary, for ease of exposition, we will assume our random variables 
are continuous on [0,1] and have a density function. In light of the earlier 
discussion, the following theorem implies that very generally, the log distribution is 
the limiting distribution for the first significant digit when continuous random 
variables are repeatedly multiplied. 


Theorem 1. Let x,,...,x, be independent and identically distributed continuous 
random variables on [0,1] with common density function f € L,[0,1], and let 
z = Lf_,xAmod 1). Then as k > ©, z approaches a uniform distribution. 


Actually, the above theorem is true under quite general circumstances. All that 
is required of the independent and identically distributed random variables x, is 
that they are not discrete with their probability concentrated on a finite cyclic 
subgroup of [0, 1] mod 1. (See [F], [S1], or [M]). These authors also show that if the 
x,’s do have their probability concentrated on a finite cyclic subgroup of [0, 1] 
mod 1, with prob (x; = 0) > 0, then L*_,x,; approaches a discrete uniform distri- 
bution. 

Since we are doing mod 1 arithmetic, it will be convenient to assume the density 
functions are defined on all of R and are periodic of period 1. Hence if f(t) is such 
a density 


[9 dt=1 foralla ER. 
The Fourier coefficients of f(t) are defined by 
f(n) = ffl) era, neZ. (4) 
0 


Since f(t) is real valued, f(-n) = fi (n). If f © L’[0, 1], then the partial sums of 
oY __f(nje*™'"* converge to f in the L* norm. This is good enough for 
computing probabilities since L* convergence on [0,1] implies L' convergence. In 
fact, an application of Hélder’s Inequality implies ||fll, < llfllz on [0,1]. The 
mapping ¥: L’[0,1] > /* given by f(t) > {f()}”., is a Hilbert space isomor- 
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phism, and consequently we have: 


Parseval’s Formula 


[Pwoa= ¥ lf. 


n=—o 


Lemma 1. Jf f € L7[0,1] is a probability density, then, 
(a) f(0)=1 = (b) [f(n)| <1, 2 #0 (c) suplf(n)| <1. 
n#0 


Proof: (a) Since f is a density, f(O) = [df(t) - e° dt = 1. 


(b) Fn] =| [faye tem at 


Equality could only hold if all the probability were concentrated on a finite 
cyclic subgroup of [0,1] mod 1. This is not possible since we are only considering 
continuous random variables. 

(c) As a consequence of Parseval’s Formula, | f(n)| — 0 as n — ~, Combining 
this with (b) yields (c). | 


< [if(sle?7™| dt = 1, 
0 


Now let x and y be continuous random variables on [0,1] with densities 
f,g = L’[0, 1], respectively. If z = x + y(mod 1) and has density h, then 


h(t) = [fate — u) du =f * g, the convolution of f and g. 
0 
There is a nice equation relating the Fourier coefficients of f, g, and h. 
Lemma 2. h(n) = f(n) : 8(n) 


Proof: Using the fact that both g and e~?7'” are periodic of period 1, we have 
h(n) = fon(t) -e-?m ide = f° f'f(u)g(t — u) due?" de 
0 0 “0 
_ [twat _ u)je 27 inte u) . e72minu du dt 
0 70 
— : —2r inu 1 — 27 in(t—u) _ ff A 
=f fe [a(e- we dt\du=f(n)g(n). 


Proof of Theorem 1: Recall x,,...,x, are independent and identically distributed 
continuous random variables on [0,1] with common density f € L7[0,1], and 
z = Dk_,x, (mod 1). If z has density A, then by Lemma 2, h(n) = f(n)*. As 
remarked earlier, to show z approaches a uniform distribution on [0, 1] it suffices 
to show h > 1 in the L?-norm. Thus 


Ih — 15 = f'(a(e) ~ 1) ae = E Lacnyl = Dif» 


n#0 


= Dif» 


n#O0 


fim!" s (suplfoml) - D [fonl 


n#0 
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Since 


sup|f(n)|<1 and D> |f(ny)| <~, 


n#0 n#0 


we see that ||h — 1||5 ~ 0as k > &, a 


Before generalizing Theorem 1 we would like to mention a couple of other 
striking consequences of Lemma 2. There exist many non-uniformly distributed 
pairs of random variables on [0,1] whose sum mod 1 is exactly uniformly dis- 
tributed (or equivalently, there exist many pairs of random variables whose first 
significant digits are not logarithmically distributed, but the first significant digit of 
the product does have the log distribution). If x and y have densities f, g © L7(0, 1] 
with fin) - 2(n) = 0 for all n # 0, then z = x + y (mod 1) is uniformly distributed. 
For example, if f(t) = 1+ sin(27t) and g(t) = 1 + sin(47t), then z is uniform. 

This leads to another observation, the invariance of the log distribution for first 
significant digits of products. Suppose x, y, Z, f, 8, and fA are as in Lemma 2. 
Since |g(n)| < 1 for all n # 0, |h(n)| < |f(1)|. If equality does not hold for some 
n, then ||h — 1|l2 < || f — 1llz, and so z is strictly closer to being uniform than x. 
This implies that the convergence in Theorem 1 is “monotone”. On the other 
hand, if equality holds for all n, then f(n) = 0 = A(n) for all n # 0, and f(t) = 1 
= f(t), ie., x and z are uniform. Therefore if z = Lf_,x; (mod 1), and at least 
one of the x,’s is uniformly distributed, then z is uniformly distributed. This can 
be proved directly without using Fourier coefficients. See [H] for example. There is 
an amusing discrete analog to this result. Suppose one tosses a pair of dice, one of 
which is fair, and the other is of unknown nature. What can be said about the sum 
mod 6? Regardless of the nature of the second die the sum mod6 must have 
uniform probability on 0,1,...,5! 

Next we would like to generalize Theorem 1 so that it applies to quotients and 
powers of random variables. Recall the connection between division and exponen- 
tiation with modular arithmetic on [0, 1] via logarithms (see (2) and (3)). We need 
an equation relating the Fourier coefficients of the densities of a random variable 
x € [0,1] and z = mx(mod 1), m & Z. The proof of the following lemma is left to 
the reader. 


Lemma 3. Let x be a continuous random variable on [0,1] with density f © L’[0, 1], 
and let z = mx(mod 1), m € Z. If h is the density for z, then 


(a) A(n) =f(mn) forallneZ — (b) ifm = -1, h(n) =f(-n) =fi(n). 


Theorem 2. Let Xp. ..>X, be independent and identically distributed continuous 
random variables on [0,1] with common density f € L*[0,1], and let z =x, 
+s +X, — xX), — +++ —x,(mod 1). Then z approaches a uniform distribution on 
[0,1] as k > ~, 


Proof: Let h be the density for z. Using Lemmas 2 and 3 we have 


2 A 2 A 27 |wo|2* 72) 
Ia(t) - b= LA) = Dl feyl” lA) 
n#(0 n#0 
~ [2k 
= ¥|f(n)[ > 0ask > &. 
n#0 
(See the proof of Theorem 1). a 
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Theorem 3. Let x be a continuous random variable on [0,1] with density f € L7(0, 1], 
and let z = mx(mod 1), m € Z. Then z approaches a uniform distribution on [0, 1] as 
lm| > ©, 


Proof: Let h be the density of z. Then by Lemma 3 


Ince) - 1b = LAP = Ll fom < ¥Y lA 
n#O0 


n#0 |ln|=>m 


This last sum goes to zero as |m| —> © since L,,|f(1)|* converges. a 


Theorems 2 and 3 imply that the log distribution for first significant digits is the 
limiting distribution when continuous random variables are repeatedly multiplied, 


divided, or raised to integer powers. Let x,,..., x, be continuous random vari- 
ables and let n; © Z, j = 1,...,k. Then under quite general conditions on the x,, 
if zx x /Cert ++ xek) is written as z=m,-10", where 1 < 


m,<10and n, € Z, then m, approaches the log distribution as U|n,| > ~. 

One of the advantages of using Fourier Series is we can compute practical 
bounds on the rate of convergence to the log distribution. Several authors ((T], 
[AS], [A]) have investigated the convergence to the log distribution when random 
variables uniformly distributed on [0, 1] or [1, 10] are repeatedly multiplied. If X is 
such a random variable then one can easily show that x = log X(mod 1) has an 
exponential density function f(t) = (n10- 10’/9), 0 < t < 1. Computing Fourier 
coefficients 


, 11n 10° 10’ dm int J In 10 
_ -e 72m int de = 
f(”) i 9 ° In 10 — 2arin 


Let x,,...,x, be independent random variables distributed as x above, and let 
z = L_,x,mod 1). If z has density function h, then by Lemma 2, h,(n) = 
(In 10/(n 10 — 27rin))*. Hence, 


In 10 2k 


In 10 — 27rin 


In(t) k= ¥ <2: x 
n#O0 n=1 


,{ 10 = 1 ,{ 210 2ki Ok 
{=| [—:) < [<=] {——]. 


n=1 


In 10 ) 


27Nn 


Note that the L' distance between the densities of random variables x and y is the 
same as the L' distance between the densities of g(x) and g(y) for any one-to-one 
differentiable transformation g. In our situation, we are using the log transforma- 
tion to convert random variables that are multiplied into ones that are added on 
[0, 1]. Since the L* norm on [0, 1] dominates the L' norm, the L? norm provides a 
useful method for bounding the rate of convergence to the log distribution under 
repeated multiplications, divisions, or exponentiations. Therefore, Table 1(a) pro- 
vides an accurate measure of the rate of convergence to the log distribution when 
k independent and uniformly distributed random variables on [1,10] are multi- 
plied. For example, if five independent and uniformly distributed random variables 
on [0,1] are multiplied, the probability that the first significant digit is a one is 
log 2 + .0098, or between .2912 and .3108. 

In [AS] it is shown that if X is uniformly distributed on [0,1], then the first 
significant digit of X”™”, m © Z* approaches a log distribution as m > ». Of 
course, this is equivalent to showing that if the random variable x has density 
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TABLE 1. Convergence to the log distribution when uniformly distributed 
random variables on [1, 10] are (a) multiplied, and (b) raised to integer powers. 


— Il 


bound on ||h, — 12 bound on |[h,, 


1 73 1 . 
2 219 2 33 
3 .076 3 .22 
4 027 4 17 
5. .0098 5 13 
10 .000063 8 083 
oo 0 .066 
.0066 
00066 
0 
(a) (b) 
f(t) = In10- 10'79, 0 <t <1, then x = mx(mod 1) approaches a uniform distri- 


bution. If h,,(t) is the ‘density for z, then by Lemma 3, A,(n) = f(mn) = 
In10/(n 10 — 27rimn). Consequently, 


2 2 


In 10 


2trimn 


In 10 


In 10 — 27rimn 


ey 


n#0 
; In10\* 2 1 ; Inl0\*7?—s 11 / In 10)’ 
27m} ,-4 n2 —\ 2am 6 12\ m 
Table 1(b) illustrates the convergence to the log distribution in this case. 
Benford’s logarithmic law of first significant digits is very similar to another 
empirical law of nature, namely the common occurrence of bell shaped distribu- 
tions. These naturally occurring distributions are a consequence of underlying 
mathematical forces. Certainly, as we have seen, the multiplicative operations— 
multiplication, division, and raising to powers—are a major contributing force in 


the log law. In fact, Benford log law is a consequence of “central limit” like 
theorems for first significant digits under the multiplicative operations. 


2 
n(t) — Ul 


n#(0 
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Triangle Inequality 
In a recent article, Wang and Zhang [2] prove that 
[1 — Ku, v)°P? <0 - (u,w)? 7? + [1 - ov, wy? 
for unit vectors in a complex inner product space. As they observe, it is 
enough to prove this result when u,v,w are all in C?. The theorem extends 
to all nonzero vectors in C’ if we define 


(u,v) 


d*(u,v) = 1 - —>—>; 
lul“lv| 


we then have d(u,v) < d(u,w) + d(w,v). A triangle inequality like this 
automatically attracts attention, and I want to analyze what lies behind it. 

It is easy to see that d(Au, wv) = d(u, v) for nonzero complex scalars A, p. 
Thus if we map u = (u,,u,) in C* \ {(0,0)} to z, =u,/u, in CU{o}, the 
value of d depends only on the images. Specifically, we can compute that 


Iz, — Z,| 
(*) du, v) = d(z,, z,) = 


[(1 + 1z,I?)(1 + lz, I)]” 

Now there is a standard bijection between the unit sphere and CU{}, given 
by projection from (0, 0,1) to the plane through the origin; and a straightfor- 
ward computation [1, p. 22] shows that the distance between points on the 
sphere corresponding to z, and z, is exactly 2 times (*). Thus the triangle 
inequality in [2] is a corlsequence of the triangle inequality for distances on 
the Riemann sphere. 


[1] L. V. Ahlfors, Complex Analysis. McGraw-Hill, New York, 1953. 
[2] B.-Y. Wang and F. Zhang, A trace of inequality for unitary matrices. 
Amer. Math Monthly 101 (1994), 453-455. 
William C. Waterhouse 
Department of Mathematics 


The Pennsylvania State University 
University Park, PA 16802 
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NOTES 


Edited by: John Duncan 


Cross Product Identities 
in Arbitrary Dimension 


Andrew Dittmer 


I. INTRODUCTION. Recently, a certain amount of attention in the Monthly [1], 
[3] has been devoted to proofs of the vector identities on R° 


(1) (the “‘Baccab” identity) a x (b X ¢) = bia: c) _ c(a : b), 
(2) (usually obtained from (1)) (a X b): (ec X d) = (a: cb: d) — (a: db: oc). 


It has been shown [2], [4] that the ordinary cross product generalizes to a 
function a, X --: X a,_, of m — 1 vectors on R”, and proofs were indicated in [2], 
[3] that (1) and (2) generalize to identities which hold in arbitrary dimension. In 
particular, we have 


a,X-°+ Xa,_, X (by X «°° Xb,_)) 


b, b,-1 
a,—1° by an—1° Dy-1 
a,°b, a,°b,-; 
(a; X +++ Xa,_1)° (by X +++ X B,_1) = _ 
an-1 “b, ue an-1 *D,-1 
(2*) 


where in (1*) the determinant is to be interpreted formally in the manner of 
elementary calculus texts. 

While several reasonably elementary proofs of the identities (1) and (2) are 
given in [1] and [3], only one ((3], p. 624, pp. 627-629) is coordinate-free and seems 
to generalize to higher dimensions. However, this argument contains a logical flaw 
which cannot trivially be patched in more than three dimensions. Since this line of 
reasoning is fairly common, we briefly review it here. Clearly, both sides of (1) are 
vectors orthogonal to a and lie in the plane spanned by b and c. The claim is then 
made that this implies that the r.h.s. of (1) is a scalar multiple of the l.h.s.. The 
problem with this argument arises when a is perpendicular to the plane spanned by 
b and ec. If this occurs, any vector in this plane satisfies the condition of being 
orthogonal to a, so we cannot trivially deduce that the r.h.s. is a scalar multiple of 
the l.h.s.. Of course, in this case, both sides are clearly zero. In the analogous 
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situation in higher dimensions, both sides of (1*) are zero. However, a proof of this 
seems to require an elaborate algebraic digression, which is alien from the 
geometric appeal of the original argument. 

Perhaps the most elegant of the proofs of (1*) and (2*) already in the literature 
is given in [3], p. 625. In this approach we note that with the inner product given by 
(a, Nay A*** Aa,_1)+(b, Ab, A ++: Ab, _,) = det(a;-b,), the Hodge star 
map *: A”~'(R") — A‘(R”) = R” becomes an isometry. It is then possible to define 
a, Xa,X-°+ Xa,_,:=*(a, Aa,A-++: Aa,_,) and then to verify that the 
standard properties of the extended cross product hold without much trouble. 

The purpose of this note is to quickly review the definition and basic properties 
of the extended cross product and then to give a coordinate-free proof of identities 
(1*) and (2*) which uses only elementary linear algebra. We note that this proof 
works perfectly well in m = 3, in which case we obtain simple and completely 
elementary proofs of identities (1) and (2) (which proofs do not require introduc- 
tion of the extended cross product). In the appendix, we give another proof 
employing the tensor product. 


II. DEFINITION. We follow Spivak [4] in defining the cross product on R”. Let ¢: 
R” — R be defined by y(a,) = det(a,,a,,...,a,) (the determinant of the matrix 
with a,,...,a, aS row vectors), where a,,...,a, are fixed vectors in R”. Since the 
determinant is linear in each of its variables, it follows that @ is linear as well. 
Therefore ¢ is a linear functional on R”, so there exists a unique vector u € R” 
such that g(a,) = u- a,. We then define a, X --- X a, =u. In other words, the 
cross product is characterized by 


a,‘ u= det(a,,a,,...,a,) foralla, ©R” @u=a,X-°--- Xa,. (3) 


Since the triple scalar product identity a: (b X c) = det(a, b,c) is well-known to 
hold for the standard cross product on R?°, it follows by (3) that the cross product 
as defined above coincides with the standard cross product when n = 3. 


Ill. BASIC PROPERTIES. The basic properties satisfied by the cross product are: 


The cross product is skew-symmetric and linear in each variable. (4) 
a, X -*: Xa, is orthogonal to each of a,,...,a,. (5) 
a,X-+: Xa, =0< a,,...,a, are linearly dependent. (6) 
If a, X +:: Xa, #0 then the n vectors a, X °°: Xa,,a,,...,a, form a 
positively oriented parallelotope. (7) 
la, X +++ Xa,| is the m — 1-dimensional volume of the parallelotope formed 
by the vectors a,,...,4,. (8) 


(4) follows immediately from the bilinearity of the dot product and the multilin- 
earity and skew-symmetry of the determinant function. If in (3) we take a, = a, 


(i =2--- n), we obtain a;-(a, X--- X a,) = det(a;,a,,...,a,,...,a,) = 0, 
which proves (5). To see (6), note that by (3), a, X --- X a, = 0 if and only if for 
alla, © R", a, - (a, X --- X a,) = det(a,,a,,...,a,) = 0. Hence for all a, € R’, 
@,,@5,...,a, are linearly dependent, which happens when and only when a,,...,a,, 
are linearly dependent. Now the n-dimensional oriented volume of the parallelo- 
tope A formed by the vectors a, X -*: X a,, a5,...,4, IS 
det(a, X -*: Xa,,a5,...,a,) = (ag X -°* Xa,) + (a, X°°* Xa,) 
= |a,x--: xa,|7>0 (9) 
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which shows (7). If a, X --: X a, = 90, then (8) follows from (6). Otherwise, 
looking at A geometrically, we see that the vector a, X --: X a, is perpendicular 
to the hyperplane spanned by the other vectors. Therefore, the n-dimensional 
volume of A is Th where h is the “height” |a, x --- X a,| and I is the “base” 
parallelotope formed by the vectors a,,...,a,. Comparison with (9) then gives (8). 


IV. THE PROOF. Clearly, (1*) holds if and only if for all a, € R”, 


a,:[a,X-°: Xa,_, X (by X ++: Xb,_))] 


a,b, eee a,'b,_, 
a -b eee a -b 
n+1 2 1 2 n—-1 
=(-1) a | (10) 
a,—1° b, a a,—-1° D,-1 


However, since an n-cycle is an even permutation if and only if 7 is odd, 
a,:[a,X +++ Xa,_, X (by X +++ X b,_1)] 
= det(a,,...,a,_,,b, X °** X b,_,) 
= (-1)""' det(b, X +++ X b,_1,4,5---58,-1) 
= (-1)"""(b, X +++ Xb, 1) * (a, X 1+ X a,_1) (11) 


follows from (3) and the skew-symmetry of the determinant. Therefore equations 
(1*) and (2*) are equivalent. In both this proof and the proof in the appendix we 
will show that (2*) holds. 

If b,,...,b,,_, are linearly dependent, the |.h.s. of (2*) is zero by (6) and the 
r.h.s. is zero because the column vectors are linearly dependent. Otherwise, (6) 
implies that |b, X --: X b,_,| # 0. In this case, define n X n matrices 


b, X oe xX b,-1 


n—-1 b, a b,-1), (12) 


so M is a matrix of row vectors and WN is a matrix of column vectors. By (3), we 
have 


det(M) = (b, X -** X b,_,) * (a, X *** Xa,_,), and (13) 
det(N) = det(N’) = (b, X --: X b,_,) ° (b, X °°: Xb,_,) 


= |b, X +++ Xb,_11”. (14) 
Because of (5), 
Ib, X xX b,_ 11° 0 0 
MN = a, (b; X -** X b,_;) a,b, a,°b,_; (45) 
a,—, ° (b; X Xb,-1) a,-1°b, a,-1°b,-1 
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The determinant of the submatrix 
X=(ab)=fo 500 : (16) 


is the r.h.s. of (2*). Since 
Ib, X °° Xb,_, |’ det X 
= det( MN) = det( M )det(N) 
= ((b, X -** X b,_,) * (a, X *** X a,_1))Ib) X «+ Xb, 417. (17) 
and |b, X --: Xb,_,| # 0, the proof of equation (2*) is complete. 


V. APPENDIX. The proof given here is a modified and generalized version of a 
proof given in [3] (see p. 624 and pp. 627-629). We first prove a lemma which 
shows that when a multilinear scalar-valued function 1s lifted to a function on the 
tensor product, the kernel of the resulting function can be determined solely by 
reference to the kernel of the original function. 


Lemma 1. Let V and W be finite-dimensional vector spaces over a field F. Let p: 
V X W > F be a bilinear function, and let p*: V ® W — F be the natural lift of p to 
.a linear transformation of V ® W guaranteed by the universal mapping property. 
Then ker(p*) is spanned by the simple tensors v ® w contained in it. 


Proof: For each v &€ V, define a linear transformation g,: W > F by 9, (w) = 
p(v,w). Then the map ¢: v > g, defines a linear transformation from V to W%*, 
the dual space of W. By Gauss-Jordan elimination, we may choose bases {e,} of V 
and {f;*} of W* so that o(e;) =f* G@=1--- k), ple;)=0 G@>k), where k = 
rank(¢). If we now pass to the dual basis {f;} of W, we have p(e,,f;) = 1 
G =j <k), p(e,,f;) = 0 otherwise. Now if p* is identically zero, then the lemma 
is trivial, so we may assume that range (p*) = F. If dim(V) = n, dim(W) = m, it 
follows that dim(ker( p*)) = nm — 1, so it suffices to find nm — 1 linearly inde- 
pendent simple tensors in ker( p* ). However, if we set 


S ={e, @f,: p*(e; @ f;) = p(e;,f;) = 0}, (18) 
T = {e, @e, —e, Ge, +e; Ge, —e; Be; = (e, + e€;) ® (e, —e;):1 <i<k}, 
(19) 


then S has nm — k elements, T has k — 1 elements, and S U T 1s clearly linearly 
independent and contained in ker( p*). 


The following corollary results from a straightforward induction on r. 


Corollary. Let V,,...,V, be finite-dimensional vector spaces over a field F. Let p: 
V,X-+++ XV. > F be a multilinear function, and let p*: V, ® --: ® V, — F be the 
natural lift of p to a linear transformation of V,® -+:: @V,. Then ker(p*) is 
spanned by the simple tensors v, ® +++ ® v, contained in it. 


The next lemma is crucial in this proof of (2*). An elementary proof of a 


weaker statement (which is, however, sufficient for proving the identity under 
consideration) can be found in [3]. 
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Lemma 2. Let p,q: V, X -:: X V, > F be multilinear functions, and suppose that 
ker(p) C ker(q). Then there exists a scalar X in F such that pA = q. 


Proof: Lift p,q to linear transformations p*,q*: V,®--: @V.—-F. Now 
ker(p) ¢ ker(q) implies that the set of simple tensors in ker( p*) is contained in 
the set of simple tensors in ker(q*). However, the preceding corollary shows that 
these tensors span all of ker(p*) and ker(g*) respectively, so ker( p*) € ker(q*). 
If qg* is identically zero, then the lemma is trivial, so assume otherwise. Then 
range( p™) = range(q*) = F, so 


dim(ker( p*)) = dim(ker(q*)) = dim(V, ® --- @V,) — 1. 


Setting K = ker(p*) = ker(q*) there is a tensor t € K sothatV, ®@-:-- @®V.=K 
+ (t). Then A = q*(t)/p*(t) clearly satisfies the requirements. 

Let V, = R” (i = 1,...,2n — 2), and interpret each side of (2*) as a multilinear 
function in a,,...,a,_,,b,,...,b,_,. If we could show that in this case the 
hypotheses of Lemma 2 are satisfied, it would show that the r.h.s. of (2*) is a 
constant multiple of the l|.h.s. Evaluating for a; = b; = e;,, ({e,} denoting the 
standard orthonormal basis of R”) would then show that this constant is 1, giving 
us a proof of (2*). 

Proving that the hypotheses of Lemma 2 hold for the sides of (2*) is clearly 
equivalent to showing that the r.h.s. of (2*) is zero whenever the I.h.s. is zero. In 
the trivial cases where a,,...,a,_, (or b,,...,b,_,) are linearly dependent, the 
l.h.s. of (2*) is zero by (6) and the r.h.s. is zero because the row vectors (column 
vectors) are linearly dependent. Otherwise, the fact that 


O = (a, X «++ Xa,_1)* (by X +++ Xb, _1) 
= det(a, X --* X a,_j,b,,...,b,_1) (20) 


implies that the vector a, X °--: X a,_, is a nontrivial linear combination of the 
vectors b,,...,b,_,. Therefore, the columns of the r.h.s. of (2*) nontrivially 
generate the vector 


(a, (a, X +++ Xa, 1),..-54,-1° (a; X -+* X a,_,)) = 9, (21) 


which shows that the r.h.s. of (2*) is zero whenever the I.h:s. is. 


ACKNOWLEDGMENTS. The:proof of Lemma 1 appearing in this paper is due to William Pardon. 
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A Non-Constant Continuous Function 
on the Plane Whose Integral 
on Every Line Is Zero 


D. H. Armitage 


If f is a continuous function on C, integrable on C with respect to plane Lebesgue 
measure, and if 


[fas =( (1) 


for every (doubly infinite) line 7, where s denotes length measure, then f must be 
identically zero. This has been known for many years (see [5] for references), but 
only comparatively recently was it shown that this result fails in the absence of the 
global integrability condition. In fact, Zalcman [5; pp. 243-244], using a theorem of 
Arakelian [1; p. 1189] concerning tangential holomorphic approximation on un- 
bounded sets, constructed a non-constant entire function f satisfying (1) on every 
line /. The purpose of this note is to give a construction, similar to Zalcman’s, but 
using only elementary complex analysis. As a bonus, all the derivatives of the 
function we produce also have vanishing integrals on all lines. 

The main idea is to construct a non-constant entire function g whose deriva- 
tives satisfy 


[lee*?las < +00 (2) 


and 
gz) 70 a z>m~ 8 (zél) (3) 


for each line / and each non-negative integer n (cf. [5; p. 243]). If we define f = g’, 
then by (2) each f™ is integrable on every line /, while (3) together with the 
fundamental theorem of calculus shows that 


[f@ds=0  (n=0,1,2,...) 
l 


for each line /. 
We construct g by a simple pole-pushing technique which goes back at least to 
Runge [4]. 


Lemma. Suppose that z,,z,€C and |z,—2z,| <1. If ¢, is holomorphic on 
C\ {z,} and « > 0, then there exists a holomorphic function ¢, on C \ {z,} such 
that 


($2 - b,)(z)|< et lzl)* (lz — 1 > 1). (4) 
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For the sake of completeness, we indicate a proof of this lemma. The function 
¢@, has a Laurent expansion centred at z,: 


bz) = oz) + La(z-22) 7 (Iz — z.| > Iz, — Zl), 
j=l 
where ¢, is entire. By choosing m sufficiently large, we can arrange that the 
function @,, defined by 


m 
b,(z) = (2) + La (z-2)? (Zz #2Z)), 
j=l 
satisfies (4). 
To complete the construction of g, choose a sequence (¢,) of points lying on the 
parabolic arc P = {t + it”: t > 0} such that 


fo = 9,  — Se-1| <1 (k = 1), b, > ©, 


Let g,(z) =z~*. Using the lemma repeatedly, we obtain a sequence (g,) of 
functions such that g, is holomorphic on C \ {¢,} and 


(Se — Se-a(zZ)| < 2-1 Ft Izl* (KE Iz-GI> 1). (5) 


Then (g,) is locally uniformly convergent to a limit function g which is entire. 
Define P, = {z: inf, plz — w| > a}. If z © P,, then by (5) 


(g— Bo)(2)] SU Mae ~ e-1(2)] < + lz)” </g0(2)1, 


so that g # 0 and 

lg(z)|< (+ Izl)~* +] eo(z)| < 2lz1-? (z €P,). 
Hence, by Cauchy’s estimates, 

lg(z)| <2nl(\zl-1)°* (n= 0,1,2,...,z © P,). 


Since /\ P, is bounded for any line /, it follows that (2) and (3) hold, as required. 

The real part of the function f constructed above (or of Zalcman’s function) 
provides an example of a non-constant harmonic function on R* whose integral on 
every line is zero. Recently, Armitage and Goldstein [3] showed that there exists a 
non-constant harmonic function 4 on R%, where N > 2, such that 


[ih di =0 


for every (N — 1)-dimensional hyperplane A, where A denotes (N — 1)- 
dimensional measure: The above argument for holomorphic functions can be 
mimicked with harmonic functions on R%; for harmonic pole-pushing see, for 
example, [2]. Thus the construction in [3] can be made more elementary. In 
particular, the result of [3] can be obtained without recourse to the quite difficult 
harmonic approximation theorem [2; Theorem 1.1] that we used, which played a 
role analogous to that of Arakelian’s theorem in [5]. 
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Chu’s 1303 Identity Implies Bombieri’s 
1990 Norm-Inequality (Via an Identity 
of Beauzamy and Dégot) 


Doron Zeilberger' 


Blessed are the meek: for they shall inherit the earth (Matthew V.5) 


Inequalities are deep, while equalities are shallow. Nevertheless, it sometimes 
happens that a deep inequality, A, follows from a mere equality B, which, in turn, 
follows from a more general, and trivial’ identity C. 

In this note we demonstrate this, following [3], with A := Bombieri’s norm 
inequality [2}°, B := an identity of Reznick [5], and C := an identity of Beauzamy 
and Dégot [3]. This exposition differs from the original only in the punch line: I 
give a 1-line proof of C, using Chu’s identity. 

Let P(x,,...,x,) and Q(x,,...,x,,) be two polynomials in n variables: 

P= Qik 0 Xs Q= »p bi. ik tt XP. 


iy,...,8,>0 iy,...,i, 20 
The Bombieri inner product [2] is defined by 
[P,Q] = | » (i! +++ 7,1) Ai i Dit... igs 


ij,...,1,20 


and the Bombieri norm, ‘by: ||Pll = y[ P,P]. 


Bombieri’s Inequality A. Let P and Q be any homogeneous polynomials in 
(x,,...,X,), then 


|PQ|| = IPI IQll. 


‘This note was written while the author was on leave (Fall 1993) at the Institute for Advanced 
Study, Princeton. I would like to thank Don Knuth for a helpful suggestion. 

* Trivial to verify, not to conceive! 

*Tt was needed by Beauzamy and Enflo in their research on deep questions on Banach spaces. It 
also turned out to have far reaching applications to computer algebra! [1]. 
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In order to state B and C, we need to introduce the following notation. D, = 
d/ax;, G=1,...,n), PS := Dis +++ DP, and for any polynomial 
A(x,,...,X,), A(D,,..., D,) denotes the linear partial differential operator with 
constant coefficients obtained by replacing x; by D,. 

A follows almost immediately from ((5][3]): 


Reznick’s Identity B. For any polynomials P, Q in n variables: 


| POD)... D,)O(e, 5 X,) I 


2 
IPor= > a 
Ss itc-: u,! 


Liyeeeyl y= 


Beauzamy and Dégot’s Identity C. For any polynomials P,Q, R,S in n variables: 


[PQ, RS] 
3 [ Ro yeeey '(D,,..., D,)O(%41,-++5 Xn); pO yeeeg '(Dy,..., Dy) S(41,---5 Xn) | 
oe wo (ii! i! 


Proof of B.= A. Pick the terms for which 7, + --- +7, equals the (total) degree 
of P, let’s call it p, and note that for those (i,,...,i,), P&?'(x,,...,x,) = 
(i!--- 7,Da; i, $0 


Eyye- 


» | PO (Di, oo. Dy OX, +005 Xn) | 
oe 


ijt-++ +i,=p Ny 


2 . . 
> | @,,,....;,.Q(%1+--5 Xn) | *(,!ee+ a!) 


ijt +++ +i,=p 


~ (4,005): too a) OC... 2.) IP = IPIPQI?. 


ijt ++: +i,=p 
Proof of C > B. Take R = P and S = Q. 


Proof of C. Both sides are linear in P, in Q, in R, and in S, so it suffices to take 
them all to be typical monomials, (P = x?! --- x”, and similarly for Q, R, and S), 
for which the assertion follows immediately by applying Chu’s [4] identity? 


x (7)| S | _[{rts 
iso\t/\P~? poy 
tor=r,5=S,,p=p,, (t=1-:: n), (using i, for 7), and taking their product. 


Q.E.D. 
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Who are these mathematicians and what did they share? 


Answer on page 910. 
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How to Stay Competitive 


Catherine C. McGeoch 


In the Office Of The Future, the coffee pot will come to you. You’ll push a button 
on your wrist communicator, and a mobile robot will appear at your desk with 
fresh pots of coffee and tea, and donuts. Also, through the Wonder of Technology, 
you will only have to work a half-day to maintain your current productivity, so you 
can devote the rest of your time to entrepeneurial pursuits. 

Suppose you decide to go into the mobile-coffee-robot business, providing 
coffee-serving robots to office buildings for a flat monthly fee. Your expenses are 
proportional to the distance the robots travel, so you want to minimize this 
quantity by making smart decisions about which robot to send to service any given 
request. (After servicing a request, the robot stays where it is until needed again.) 
Of course it would help if you could make some predictions about future requests, 
but that is not possible. Here are two strategies: 


Greedy. Always send the robot that is nearest to the request. 


Balance. Send the robot S such that the total distance traveled by S (so far this 
month) would be minimized over all robots. 


Which strategy is better? How can you even compare them if you don’t know 
anything about the request sequence? It turns out that by a certain reasonable 
standard described below, Greedy is a bad algorithm; and in some cases Balance is 
a good algorithm. This column analyses the two algorithms according to this 
standard. 


The k-server problem.' Let ¥ be the set of symmetric weighted graphs on n nodes 
having nonnegative edge weights (the nodes represent offices and the weights 
represent distances between offices). There are k servers that move from node to 
node by traversing edges. The cost of traversing an edge is equal to the weight on 
that edge. The minimum cost of moving a server from node w to node x (not 
necessarily via a direct edge) is d,,,. 

A sequence o = (0), 0,...0,,) of m requests for service at vertices in G € F 
is presented. Suppose o, is a request at a particular vertex x. A k-server algorithm 
A must cover node x by moving one of the k servers there. (We assume 
throughout that 2 <k <n since otherwise all algorithms are identical.) If A 
moves a server from node w then the service cost is d,,,. We assume that if there 
is already a server on node x, then the algorithm does nothing and the service cost 
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7 z 


Figure 1 


is 0. Let C.,(o,) denote the service cost for algorithm A to cover request g,, and let 
Cio) = X77 ,C_,(a,) be the total cost of A on sequence a. 

In fact the cost of A on o also depends on the initial placement of the servers 
on nodes (before the first request). This technical point will be addressed in the 
proofs below, which hold for arbitrary initial configurations. 

An algorithm is online if it must decide how to cover o, without having 
information about future requests. Both Greedy and Balance are online. An 
algorithm that can examine all of o before deciding how to cover each request o, 
is offline. 

An online k-server algorithm B is c-competitive if, for any graph G € Y, any 
request sequence oa, and all other k-server algorithms A, it holds that C,(o) < 
c:C,(o0) + a, for a constant a that may depend on G. Note that A is allowed to 
be either online or offline. 

If B is c-competitive, then your mobile-coffee-robots will travel no more than c 
times as far as is absolutely necessary to service requests. An algorithm for which c 
is a small constant is a good algorithm. How small can c be? It is known that there 
cannot be a c-competitive algorithm for the k-server problem when c < k (see [3)). 
The two theorems below concern the k-competitiveness of Greedy and Balance. 


Theorem. Greedy is not k-competitive for the k-server problem. 


Proof: We need only exhibit a counterexample to k-competitiveness. Let k = 2, 
and consider the 3-node graph in Figure 1 and the sequence o = 
(z,X,y,X,y,X, y,xX,...) of length m. The first two requests will cause Greedy to 
position the two servers on nodes x and z (with cost between 0 and 6 depending 
on initial conditions). After that, Greedy will shuttle one server between nodes x 
and y, incurring a service cost of 4 each time. Therefore the total cost for Greedy 
is Co(a) = 4(m — 2). 

However an online algorithm A could place its servers on x and y after the 
first three requests (incurring cost at most 12), and subsequently have a service cost 
of 0. Therefore C (a0) < 12. For any constant a, the sequence o can be made long 
enough to obtain C.(a) € 2C,(o) + a. It is straightforward to extend this argu- 
ment to larger k by using this example graph as a subgraph in a larger problem. 

LJ 


This theorem can be extended to show that Greedy is not c-competitive for any 
constant c. Greedy can be “fooled” with a request sequence that makes it behave 
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badly in comparison to an offline algorithm. The next theorem shows that in the 
special case k = n — 1, Balance can’t be similarly fooled: any sequence that is bad 
for Balance is also fairly bad (within a factor of k) for all algorithms. To get some 
intuition, try running Balance on the above example. 

First we define the Balance strategy a little more carefully. Suppose o, is a 
request for vertex s. If s is already covered, then Balance does nothing. If s is not 
covered, then for each node / that is currently covered, let D, ,_, be the distance 
traveled so far by the server on /. Balance covers s by moving a server from some 
node w such that D,, ,_, + d,,, is minimized over all nodes. After the move we 
have D,, = D, ;-1 + d,;. The Lemma below states that under the Balance rule, 
the difference in distance traveled, between the most-recently-moved server and 
all other servers, is bounded above and below by constants. See [3] for a proof of 
this Lemma. 


Lemma. At any time t, let s be the most recently requested vertex, let w be the vertex 
left vacant by Balance, and let | be any other covered vertex. Then —d,, < D, ,- 
D, , Ss di, ~ d 


ws* 
Theorem. Balance is k-competitive for the k-server problem when k =n — 1. 


Proof: We must show that for any G there is a constant a such that the following 
bound holds: 

VA,Wo,C,(o) < (n—-1)C,(0) +a 
where A is an arbitrary algorithm for the k-server problem and B denotes the 
Balance algorithm. 

Note that when k = n — 1, exactly one vertex is vacant at any time. We assume 
that the sequence o is hard for Balance: that is, o always forces Balance to move 
by always requesting the vacant vertex. If Balance is k-competitive on hard 
sequences then it is k-competitive on all sequences. 

Now consider the situation just after o, is covered. Let x, be the vertex left 
vacant by A and let y, be the vertex left vacant by Balance. It suffices to exhibit a 
residue function R (x,, y,, 0;,¢), with R( x9, Yo, Gp, 0) defined to be 0, for which two 
properties hold. Property 1 is that for 0 < t < m, 


R(Xi4 Via Get + 1) < R(x,, Y,>0,,¢) + (n - 1)C4( 0,41) ~ Cz(0;,1)- 
Then 


yr 
» R(x,, Y,,0;,¢) ~ R(xX,-1, Vr-12 %-1 fF 1) 
t=1 


< y(n - 1)C,(0,) - Y Calo) 


and therefore 
R( Xm Ym> Gm» M) — R(X, Vp, %>9) < (n — 1)C,(o) — C3 (ce). 
Property 2 is that there is a constant —a (dependent on G) such that 
—A < R(Xm5Vmn> Gms). 


With these two properties established, a little algebraic manipulation completes 
the proof. The residue function represents a comparison of the service costs of 
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Balance and A, for arbitrary A. Properties 1 and 2 bound the residue function to 
ensure that the total costs of Balance and A cannot get too far apart. 

We now define R. To simplify notation, for fixed ¢ let s = 0,, let v = x, be the 
vertex left vacant by A after covering s, and let w = y, be the vertex left vacant by 
Balance. After Balance moves, D,, is the total distance traveled by the server 
currently on node J. Let D* = U,D,, be the total over all servers on covered 
nodes. Then let 


(n ~1)(D,,~ dy.) - DE ifv=w 


R(v,w,s,t) = ; ; 
(Y, Ws 8,1) (n -1)D,,-D? ifvus#w,s 


Note that R(s, w, s, t) is undefined since vertex s cannot be left vacant by A. 

Now consider the next request o,,,. Since o is assumed to be hard for Balance, 
we know that o,,, = w. Balance moves a server from some vertex k such that 
D,., + d,, iS minimized. The service cost for Balance is Cz(o,,,) = d,,,. After 
covering w, A leaves some vertex j vacant. The service cost is C,(a,,,), equal to 
either d,,, or 0, depending on whether w was previously covered by A. 

To establish property 1, we must show that for all j,v where R(j, k,w,t + 1) 
and R(v,w, s,t) are defined, 


R(j,k,w,t+1) < R(v,w,s,t) + (nm — 1)C,(0,41) — Cea(G%41)- 
We proceed by case analysis using the following observations. 


1. After Balance covers w, D, ,,, = Dy; + d,,-. For all other covered nodes /, 
D, 441 = D,;. Therefore D*,, = D* + d,,. 

2. If v = w, then R(v,w, s,t) = (n — ‘1D, , —d,,) — Df and C,(o,,,) =d 

3. If v # w, then R(v, w,s,t) =(u - LD, — D*. Since w is already covered 
by A, we know that C,(o,,,) = 0. Since A doesn’t move to cover w, we 
know that j = v. 

4. If j =k, then RG, k,w,t + 1) = (m — 10D, 141 + day) — DA. Tl J #&, 
then R(j,k,w,t + 1) =(n — ID, ,,, — DF, 1. In either case, using observa- 
tion 1 we obtain R(j,k,w,t + 1) = (n — 1)D;., — D¥ — dy,,. 


Case A. Suppose v = w. Then using observations 2 and 4, we must show that 
(n —1)D,,— DF - dy, < (n — 1)(D,,- d,,) — DF + (n — Idy, — dy, 
or, cancelling terms, that 
D.,- 


Js 


< dy — dy, 


D,, < 


Application of the Lemma finishes the proof of Property 1 for this case. 


- 


Case B. Suppose vu # w. Fhen using observations 3 and 4, we must show that 
(n a 1)D,, _ DF a Any Ss (n a 1)D, ; _ DF _ Any 


Since j = vu here the result is immediate. 

Property 2 remains. Inspection of the definition of R(v, w, s, t) shows that the 
residue function has a constant lower bound for any ¢ (including t = m) if, for any 
vertex i, (n — 1)D, , — Df has a constant lower bound. This fact also follows from 
the Lemma. (Note that n is a constant here). O 


More about k servers. Variations on the k-server problem have many computa- 
tional applications besides coffee-serving robots. For example, most computer 
systems maintain a large secondary memory (on disks) and a much smaller main 
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memory (on RAM chips). Programs and data can only be processed if they are 
residing in main memory. Only k pages (or chunks) of information can be in main 
memory at a given time. This can be viewed as a k-server problem: the nodes 
represent all m pages in the system, and the servers mark the k pages that are 
currently in main memory. An edge weight corresponds to the cost of moving (or 
“swapping”’) a page between secondary and main memory; we usually assume that 
all weights are identical. A page request is determined by the currently running 
program, and the operating system must try to minimize the total number of swaps 
without knowing which pages will be needed in the future. 

It can be shown that if k #n—-—1 then Balance is not k-competitive. A 
2-competitive 2-server algorithm does exist; otherwise, for 2 < k <n — 1 it is an 
open problem to find a k-competitive algorithm for the k-server problem. 

This problem was first described by Manasse, McGeoch, and Sleator [3] who 
proved all of the results mentioned in this column and conjectured that a 
k-competitive k-server algorithm must exist. The conjecture has prompted inten- 
sive research and led to several nice results about the nature of online computa- 
tion. Recent attention has focused on an algorithm described by Chrobak and 
Larmore [1]; Koutsoupias and Papadimitriou [2] have shown that that algorithm is 
(2k — 1)-competitive for the general k-server problem. 


ACKNOWLEDGMENT. Thanks go to Lyle McGeoch for several stimulating discussions regarding the 
k-server problem. 
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Each generation has its few great mathemati- 
clans, and mathematics would not even notice 
the absence of the others. They are useful as 
teachers, and their research harms no one, but 
it is of no importance at all. A mathematician is 
great or he is nothing. 


—Alfred Adler 
‘Mathematics and Creativity,” 
The New Yorker Magazine, 
February 19, 1972. 
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Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


On the Calculus of Variations and Its 
Major Influences on the Mathematics 
of the First Half of Our Century. 
Part II.* 


Erwin Kreyszig 


Note: The first part of this paper (sections 1-5) appeared in 1994, in the August- 
September issue of the Monthly (pp. 674-678). What follows is a short summary of 
the first part and the concluding part of the paper. 


Summary of Sections 1-5. The calculus of variations deals with the problem of 
determination of extrema of functionals given by integrals. It may be said to have 
begun with Johann Bernoulli’s brachystochrone problem of 1696. The birthyear of 
its theory is 1744, the year in which Euler published his Methodus inveniendi ... 
that included his necessary condition for a minimum and a splendid collection of 
problems. 

Lagrange went beyond Euler by inventing the “method of variations” and the 
concept of the first variation 6/J of a functional J; 6J is the analogue of the first 
derivative of a function. 

Both Euler and Lagrange contributed to the problem of minimal surfaces, an 
important geometric application of the calculus of variations. 

The discovery of sufficient conditions for an extremum of a functional was due 
to Jacobi and Weierstrass. Legendre introduced the concept of the second varia- 
tion 6*J of a functional J, which is an analog of the second derivative of a 
function. Weierstrass rigorized the calculus of variations and introduced the 
fundamental concepts of a field of extremals and of the E-function, “‘a turning 
point in the history of the calculus of variations.” 


6. IMPACT ON EARLY FUNCTIONAL ANALYSIS. The proverbial Welerstrassian 
rigor (Felix Klein’s term of 1885) had a profound influence on the theories of 
functionals and function spaces of our century. In fact, near the end of the last 
century, the central role of functionals in the calculus of variations may very well 
have directed attention to functionals in general. This may have been a kind of 
subliminal influence that affected primarily the younger generation represented by 
Volterra and later by Fréchet and F. Riesz. 


*Abbreviated version of a paper with the same title. 
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Five notes of 1887 by Volterra on special classes of functionals, investigated as 
concepts per se, marked the birth of functional analysis. Influenced by Dini and 
Betti, the latter a close friend of Riemann’s, Volterra wanted to generalize complex 
analysis. His whole theory was based on the calculus of variations. 

Hadamard was enthusiastic about Volterra’s novel theory and contributed a 
basic result that attained final form in a celebrated paper by F. Riesz. (Riesz’s 
theorem states that every continuous linear functional U[f], f © Cla, b], can be 
expressed as a Stieltjes integral 


ULF] = [ F(x) d(x), 


where w(x) is determined by U and is of bounded variation on [a, b].) Hadamard 
pointed to the variational-analytic roots of functional analysis—the calculus gave 
rise to the calculus of variations, which in turn produced the functional calculus, 
later called functional analysis. 

Hadamard regarded the calculus of variations as 


“a first chapter of functional calculus, whose development will without doubt be one of the first 
tasks in the analysis of the future.” 


(This quotation is from Hadamard’s book on the calculus of variations published in 
1910.) 


7. DIRICHLET’S PRINCIPLE. Besides its direct impact on developing functional 
analysis from 1887 to about 1903 the calculus of variations also had an indirect 
and, in the long run, an even greater impact on (classical and) functional analysis, 
an impact which dates back to about 1870 and reached functional analysis shortly 
after the turn of the century. This involved partial differential equations, for which 
the development proceeded from the search for general solutions to solution 
formulas for boundary and initial value problems (Green, Poisson, Kirchhoff, etc.) 
and on to existence (and uniqueness) proofs, notably for Laplace’s equation for a 
function u of three variables 


Au =Uu,, +uU,, +u,, = 0 (7.1) 
(or in two variables x, y), which is basic in gravitation, electrostatics, stationary 


heat conduction, and fluid flow. For the corresponding Dirichlet problem in a 
general domain, 


Au=0inQ; uleg=f, uEeCc?(Q)NC(0), ACR? orR®, (7.2) 


a proof of existence of a solution u is not easy and attracted the efforts of many of 
the greatest mathematicians for quite some time. A (faulty) method of proof was 
provided by a principle taken from the calculus of variations. B. Riemann (1826-66) 
had first seen it in lectures of G. Lejeune Dirichlet (1805-59) in Berlin and named 
it after him: 


Dirichlet’s principle. There exists a function u that minimizes the functional (the 
so-called Dirichlet integral) 


D[u] = J |grad ul?dV, OCR? or R3, (7.3) 


among all functions u € C'(Q.) M C°(Q) which take on given values f on the 
boundary 00 of ©, and that function wu satisfies (7.2). 
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Note that Laplace’s equation is the Euler equation for (7.3). 

We see that the Dirichlet integral is bounded below (by zero) and the claim of 
existence of a minimum is based on a conceptual error, namely the failure to 
distinguish greatest lower bound from minimum. 

Dirichlet’s principle was used earlier (in 1839) by C. F. Gauss (1777-1855) in his 
potential-theoretic investigations. Gauss claimed that if V is the potential of a 
mass distribution of density m on the surface S of a spatial region and U is a given 
function on S, then among all possible distributions there obviously exists one for 
which 


[(V - 2U)mas 


takes its minimum. If this is granted, then one can show, as Gauss did, that: (i) for 
this minimizing distribution, V — U = const at all points of S that carry mass; (ii) if 
U = 0, there must be mass everywhere on S; (iii) one can obtain a distribution 
whose potential V on S equals U. Thus one could conclude the existence of the 
required harmonic function [a twice continuously differentiable solution of (7.1)] as 
well as its representation as the potential of a single layer of mass. 

After Gauss, Dirichlet’s principle was used in 1847 by W. Thomson (Lord 
Kelvin, 1824-1907) in order to “prove” the existence of a solution u of the 
differential equation 


(a7u,), + (a7u,) , + (a’u,), = 47 (7.4) 


that vanishes at infinity. Here a and ¢ are given functions and ¢ is zero outside a 
given bounded region. Because of this work, the principle is usually called in 
England Thomson’s principle. 

After attending Dirichlet’s lectures in Berlin, Riemann used the principle in his 
famous thesis of 1851 as a key tool for obtaining fundamental results on complex 
analytic functions from real potential theory. Since the principle does not always 
hold, some of Riemann’s proofs were not complete. All the results, however, 
turned out to be correct and were proved later by other methods. The same holds 
for Riemann’s later use of Dirichlet’s principle in his monumental paper on the 
theory of Abelian functions. 

It was a strange situation. Dirichlet’s principle had helped to produce exciting 
basic results but doubts about its validity began to appear, first in private remarks 
of Weierstrass—which did not impress Riemann, who placed no decisive value on 
the derivation of his existence theorems by Dirichlet’s principle—and then, after 
both Dirichlet and Riemann had died, in Weierstrass’s public address to the Berlin 
Academy: 


“From Dirichlet’s assumptions it can only be claimed that for (7.3) there exists a certain lower 
bound to which (7.3) can come arbitrarily close, without being forced to actually reach it. 
Dirichlet’s argument appears invalid.” 


8. ANOTHER IMPACT ON FUNCTIONAL ANALYSIS. The breakdown of 
Dirichlet’s principle had an enormous positive effect on analysis because it led to 
the creation of three ingenious new methods for obtaining existence proofs for the 
Dirichlet problem—by H. A. Schwarz (1843-1925), H. Poincaré (1854-1912), and 
C. Neumann (1832—1925)—as well as to the development of the direct methods of 
the calculus of variations initiated by D. Hilbert (1862-1943). 
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C. Neumann’s method of the arithmetic mean (1870) was of great importance to 
analysis and to functional analysis because it sparked work on spectral theory, 
integral equations, and through it on Hilbert spaces. Neumann assumed the 
solution to be the potential of a double layer, a layer of dipoles normal to the 
boundary surface (or curve) 00, of unknown density 0(Q), O € dC. Writing this 
potential in the form 


u(P) = = [a=(-| dS(Q), r=d(P;Q), QeaQ, (8.1) 
(v the outer normal of 00) one obtains for the unknown density o the integral 
equation 
1 0/1 
(0) = 5- J 5 [= ]a(2*) as(2*) = (0), 
y(Q) the given values of u on 00, r* = d(Q,Q*). (8.2) 


Because of the term o(Q), the operator form of this “integral equation of the 
second kind”’ is 


(I+ K)o=@. (8.3) 
Its solution should obviously be 
g=(1+K) e@=9-Ko+K’o—-K*ot+--:. (8.4) 


Using this idea, Neumann was able to prove existence of a solution of the Dirichlet 
problem by integral equation methods. He solved (8,2) by successive approxima- 
tion, defining oy = @ and 


1 0/1 h 
y= (KO = = 5 f Oi se(=] dS = (-K)"e 
This gave him the Neumann series (8.4) as the solution of the problem in a convex 
domain in space or in the plane. 

We conclude this part of the section with a short list of events resulting from 
Neumann’s work. In 1888, Weierstrass’s former student du Bois-Reymond coined 
the term integral equations and expressed the desirability of a general theory of 
these equations, with which one could solve various problems such as that solved 
by Neumann, It did not take long for such theories—by Le Roux (1895), Volterra 
(1896), and the most famous one by Fredholm (1900, 1903)—to appear. Hilbert 
“caught fire at once” (as H. Weyl put it) and developed his spectral theory of 
integral equations with symmetric kernel published in six Mitteilungen between 
1904 and 1910. Most important of these was the fourth Mitteilung, the earliest 
truly functional-analytic treatment of integral equations, in which he introduced 
continuous and compact (Hilbert said “completely continuous’) forms (cast into 
operator language by F. Riesz in 1913). 

After the breakdown of Dirichlet’s principle there was no more general princi- 
ple for handling various problems of applied mathematics. It seems that in that 
situation Hilbert first put his hope in the calculus of variations, which had 
produced general principles in the past. In Problem 23 of his famous talk of 1900 
on unsolved problems Hilbert had drawn attention to Weierstrass’s work and to 
A. Kneser’s book, the first presentation of the modern calculus of variations. Not 
intimidated by Weierstrass, he was able to re-establish the Dirichlet principle 
within proper limits as a valid method of proof. He did this in two papers of 1900 
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and 1901 (reprinted 1905). In the first of these papers he proposed the following 
more general formulation of Dirichlet’s principle. 


“Every regular problem of the calculus of variations [Sec. 3] has a solution as soon as suitable 
restrictive assumptions with respect to the nature of the given boundary conditions are satisfied 
and, if necessary, the concept of a solution is suitably generalized.” 


This approach gave rise to the direct methods (methods without the use of the 
Euler-Lagrange equations), which became of basic importance in the existence 
theory of the calculus of variations. (A forerunner of these methods was Euler’s 
almost forgotten “direct difference method”.) Another solution of the Dirichlet 
problem by direct methods was given later (in 1907) by Lebesgue. 

Apart from these splendid initial steps, Hilbert made no further attempts to 
uniformize analysis by methods of the calculus of variations. Instead, he turned to 
integral equations, perhaps as a more promising tool for the same purpose. But his 
idea of weakening the notion of solutions became a guiding principle in the 
calculus of variations of our century. 


9, PLATEAU’S PROBLEM. By Plateau’s problem one means the determination of 
a simply connected portion of a minimal surface S in R° bounded by a given curve 
in space. This problem is named after the Belgian physicist J. Plateau, who 
realized minimal surfaces experimentally by dipping wires (the boundary curves) 
into soap solutions, minimum area corresponding to minimum surface energy. 

Plateau’s problem has attracted great attention from around 1870 to the 
present. It is a problem genuinely belonging to the calculus of variations. The 
solution methods developed by a pleiad of researchers (Schwarz, Lebesgue, Korn, 
Bernstein, Haar, Garnier, Rad6, Douglas, Courant, McShane) were interrelated 
with various branches of the mathematics of our century, which they fertilized 
immensely. 

From a more general viewpoint we can regard the evolution of the theory of 

minimal surfaces and of Plateau’s problem as particular cases of the development 
of the theory of partial differential equations. The first stage concerned special 
solutions of the minimal surface equation (special minimal surface), while the 
second stage dealt with general solutions (Weierstrass’s general solution formulas). 
The third stage, the solution of boundary value problems (Plateau’s problem), 
began in 1867 with Schwarz’s work, slightly later than work on partial differential 
equations in general. For this work, Kelvin, Gauss, Riemann, and others had 
already switched from general solutions to the geometrically and physically more 
useful boundary and initial value problems. 
10. GLOBAL CALCULUS OF VARIATIONS (MORSE THEORY). We have seen 
that early functional analysis owed much to the calculus of variations, and that 
general topology (set-theoretic topology) developed along with it, in a process of 
mutual give-and-take that extended over the first three decades of our century. 
This led to the creation of modern nonlinear analysis in connection with partial 
differential equations, and even more in connection with the calculus of variations, 
resulting in the so-called calculus of variations in the large, or Morse theory, for 
short. 

In this calculus one is concerned with relations between properties of a “space”’ 
X (usually a topological space, often a Riemannian manifold) and a real-valued 
continuous function f defined on X. 
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The beginnings of this fascinating theory are due to Poincaré, who began his 
work on periodic solutions of the differential equations of celestial mechanics in 
his thesis of 1879 and knew of “Morse inequalities” for a surface as early as 1885, 
seven years before Morse was born. 

The next stage of the development can perhaps best be seen from G. D. 
Birkhoff’s book on Dynamical Systems (1927). In it Birkhoff emphasized the 
growing importance of topology in the calculus of variations. He also mentioned 
his Ph.D. student M. Morse, who worked out his calculations in the large in his 
book published by the AMS in 1934. “In the large” meant that Morse considered 
the whole manifold on which the variational problem was given and not just in a 
small neighborhood of an extremal (“calculus of variations in the small’). In the 
Preface he commented: 


“Any problem which is nonlinear in character, which involves more than one coordinate system 
or more than one variable, or whose structure is initially defined in the large, is likely to require 
consideration of topology...” 


Around 1930 topology was developing rapidly and eliciting general interest. This 
was evidenced, for example, by the wealth of new results contained in Alexandroff- 
Hopft’s Topologie I of 1935 and by the papers presented at the Moscow Topology 
Congress of 1935. Thus it was just the right time for a marriage of the classical 
calculus of variations and topology, and Morse made ingenious use of the latter. 

Define a critical point of a smooth function f on a smooth manifold M to bea 
point p at which 


grad f = 0, 


Morse classified the critical points in terms of the eigenvalues of the Hessian 
matrix 


a*f 
HF.) = [ae (10.1) 
D 

and obtained topological lower bounds (in terms of Betti numbers) for the number 
of critical points. These are the famous Morse inequalities. This work applied to 
functions. From functions, Morse proceeded to functionals, essentially n-dimen- 
sional analogs of our integral (2.1) and generalizations. Now for functions the 
topology at that time was sufficient. For functionals, that is, functions on a space of 
curves, Morse had to develop a topology in function spaces. Instead of critical 
points he had critical curves. He described this intuitively in his talk of 1932 at the 
Zurich International Congress. From it, one gains the impression that in Morse’s 
theory the calculus of variations and topology had developed multiple relations. 
Incidentally, a similar theory was created simultaneously and independently by 
L. A. Lyusternik and L. Schnirelman. This shows that at certain times certain 
things are in the air, in the sense that problems which extend known settings in a 
natural way become accessible as soon as basic theories (topology in the present 
case) have been sufficiently developed. 

Let me conclude with the following remark. We started with the calculus, 
sketched briefly how calculus evolved into the calculus of variations, and outlined 
the most important ways in which the calculus of variations accompanied, influ- 
enced, or even initiated progress in various parts of analysis, geometry, functional 
analysis and, finally, topology. The whole process was heterogeneous, but I hope 
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that we have seen traces of some intrinsic logic of the development here and there. 
The presentation can perhaps help to bring about a better understanding of 
certain features of present-day mathematics. The calculus of variations seems to 
have had a profound effect on the general development of mathematics. I hope 
that readers will see this as an invitation to further research pertaining to details as 
well as to larger issues that call for a more profound study than that presented in 


these pages. 
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I do hate sums. There is no greater 
mistake than to call arithmetic an exact 
science. There are permutations and aber- 
rations discernible to minds entirely noble 
like mine; subtle variations which ordinary 
accountants fail to discover; hidden laws 
of number which it requires a mind like 


mine to perceive. For instance, if you add 
a sum from the bottom up, and then from 
the top down, the result is always differ- 
ent, 


—Mrs. La Touche 
Mathematical Gazette, vol. 12. 
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I have hardly ever known a mathematician who was capable of reasoning. 
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Answer to Who are these mathematicians? 
(p. 896) 


Top, left to right: Pierre de Fermat (1601-1665), Christian Huygens 
(1629-1695), Jacob Bernoulli (1654-1705); Bottom, left to right: Joseph 
Louis Lagrange (1736-1813), Pierre Simon Marquis de Laplace (1749-1827). 

They shared a common mathematical interest in the Sharing Problem (see 
the article in this issue by Sobel and Frankowski on page 833). 
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inside front cover. Please include solutions and relevant references. Three copies of 
all items needed to evaluate the problem should be sent. 
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should be included if an acknowledgment ts desired. 


The published solution 1s likely to be based on a solution that 1s complete and correct. 
Additional information, such as references to other appearances of the problem or tts 
solution, 1s also welcome. 


An asterisk (*) after the number of a problem, or part of a problem, indicates that no 
solution is currently available. 


PROBLEMS 


10410. Proposed by Frank Schmidt, Arlington, VA. 

Let G be a finite group. Define a(G) to be the average order of an element of G. If 
G # 1, can a(G) be an integer? 
10411. Proposed by Gord Sinnamon, University of Western Ontario, London, Ontario, Canada. 


Let R be the region inside the unit circle and above the line x + y = 1. Calculate 


// 1 dx dy 
Rr (log x)* + (log y)* xy — 


10412. Proposed by Donald A. Darling, Newport Beach, CA. 


Find necessary and sufficient conditions on a non-increasing sequence aj, a2,... of 
positive real numbers so that, if b;, b2,... is a non-increasing sequence with by > ax for 
infinitely many k, then )° by, = oo. 
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10413. Proposed by Mirel Mocanu, University of Craiova, Craiova, Romania. 


Four disjoint (except for boundary points) equilateral triangles of sides a, b, c and d, are 
enclosed in a regular hexagon of unit side. 

(a) Prove that 3a +b+c+4+d < 4V3. 

(b) When is 3a + b+ce+d = 4/3? 

(c)* Prove or disprove thata + b+c+d < 2¥3. 


10414, Proposed by R. J. Simpson, Curtin University of Technology, Perth, Australia, and W. 
F- Smyth, McMaster University, Hamilton, Ontario, Canada. 


For a positive real number x, let 


C(x) = leak 


Fe =| ay | +L 


(a) Express C(x) in a form that requires only one evaluation of a square root. 
(b) Express F(x) in terms of C(x). 


and, for x > 1, let 


10415. Proposed by Edward Kitchen, Santa Monica, CA. 


Let A be a triangle whose centroid is at the origin. Choose k € R, k > 1, and dilate 
one of the Napoleon triangles of A by a factor of —k and the other by a factor of k/(1 — k). 
Prove that A is (simultaneously) perspective with both dilated triangles. 


10416. Proposed by Kwang-Wu Chen (student), National Chung Cheng University, Chia-Yi, 
Taiwan, Republic of China. 


The Bernoulli numbers B, (n = 0, 1, 2, ...) are defined by 


which converges for |t| < 27. Also, for each nonnegative integer n, the Bernoulli polyno- 


mial B,,(x) is defined by 
7. (n 
B,(x) = > (7) B,—4x*. 
k=0 


For integer m > 1 and arbitrary constants a and B, prove 


m 


> (7) Bea) Bnu(B) = —(m — 1)Bm(a + B) + m(a + B — 1)Bm-1(@ + B). 


k=0 


NOTES 


(10415) The outer Napoleon triangle of a triangle A is formed by building equilateral 
triangles on the outside of each edge of A and taking the centroids of these triangles as 
vertices. The so-called theorem of Napoleon states that this triangle 1s equilateral. The inner 
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Napoleon triangle is obtained in the same way from equilateral triangles on the edges of A 
facing into A. More information about Napoleon triangles can be found in John E. Wetzel, 
“Converses of Napoleon’s theorem”, this MONTHLY, 99 (1992), 339-351. Two triangles are 
in perspective if the vertices can be labelled so that the lines joining corresponding vertices 
are concurrent. 


SOLUTIONS 


Uniform Convergence and Continuity of Complex Power Series 


6080 [1976, 205]. Proposed by R. N. Hevener Jr., University of South Carolina. 


A theorem of Abel states that if }-7>.)anz” converges on the closed interval A, then 
(i) convergence is uniform on A, whence (11) it determines a continuous function on A. Is 
either part of this theorem true if A denotes a closed disk instead of an interval? If we 
impose the additional hypothesis, trivially satisfied in Abel’s theorem, that the function be 
continuous on the boundary of A, is either part true? 


Solution by M. J. Pelling, Royal Air Force of Oman, Seeb, Sultanate of Oman. The answers 
are negative, except that if f is continuous on the boundary 0A it also will be on A. 


Theorem 1. If f(z) = 0 ayz" converges on A : |z| < 1, then f need not be continuous 
on 0A and, a fortiori, may not be continuous or uniformly convergent on A. 


Proof. Let 
(0) = exp ( _97! sin?(@-') } 0O<0<nx 
= g(—6) —7% <0 <0 
—() 6 =0, 


and let a, be the Fourier cosine coefficients of g. Then, by Dini’s test, 0 An cos n@ 
converges to g(@) for all@ € [—z, m1]. Writing wy (0, t) = g(9 +t) — g(0 —2), the conjugate 
series partial sum 


— > Am sinmé = (27)! | w (0, t) cot(t/2) (1 — cosnt) dt + o(1) 
0 


m=1 


by [1, 45-47]). This converges as n — oo for all 6 since (6, t)/t is L(O, 5) for all 6. In 
fact, by the Riemann-Lebesgue lemma, 


h(@) = — San sinn@ = (27)! / [g(6 +t) — g(@ — t)] cot(t/2) dt. 
n=1 0 
Hence f(z) = 0 Anz" converges on A, with f (e'?) = g(0) — ih(@) on OA. But f is 


not continuous on 0A at z = 1, since g(@) is discontinuous at 6 = 0. 


Theorem 2. If f(z) = )-?2.9 nz” converges on A : |z| < 1, then f is continuous on A if 
and only if it is continuous ond A. However, f may be continuous but not uniformly convergent 
on 0A, so may be continuous yet not uniformly convergent on A. 


Proof. Trivially continuity on A implies continuity on 0A. Conversely, if f is continuous 
on 0A and f(e!?) = 0 a,e"'?, then the latter series is the Fourier series of f(e!"): see 
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theorem 100 on p.91 of [1]. If P(r, 6) denotes the Poisson kernel 


GaP) _ ayy 1425 7" cos nd O<r<l 
2x (1 — 2r cos@ + r2) 7 


n=1 


then direct calculation shows 
e) OO 20 
f(z) = So an” = So anrne"’® -| P(r,9 —t)f(e") dt 
n=0 n=0 0 


for r = |z| < 1. Interchange of the order of summation and integration is valid here 
by absolute convergence of the series expansion of P(r,@) and boundedness of f (e!’), 
It follows by the Proposition on p.930 of [2] that if f,(0) = f(re'?) then the family 
{f, :0 <r < 1} 1s equicontinuous at each 6. But by Abel’s theorem f (re'”) is continuous 
in r for 0 < r < 1 and each @ and it follows that f(z) = f(re!’) is 2-dimensionally 
continuous at each boundary point z = e’®. Hence f(z) is continuous on A. 

Finally, theorem 1.17 on p. 301 of [3], constructs a power series f(z) = )- 7-9 Cnz" 
regular for |z| < 1, continuous for |z| < 1, and convergent on |z| = 1 but nonuniformly on 
every arc of |z| = 1. 

In this context it may be of interest to note that theorem 1.14 on p. 300 of [3] constructs 
a power series 0 Cnz" regular in |z| < 1 which is the restriction to |z| < 1 of a 
continuous function f(z) on |z| < 1, but for which > c, diverges. In this example, if 
s(n,Z) = m0 Cmz, then there is a subsequence s(nx, z) of these partial sums of the 
power series such that s(mz, z) — f(z) ask — oo uniformly in |z| < 1. 


REFERENCES 


1. G. H. Hardy and W. W. Rogosinski, Fourier Series, Cambridge Tract 38, Cambridge University Press, 1968. 
2. J. Kral and W. F. Pfeffer, “Poisson integrals of Riemann integrable functions”, this MONTHLY 98 (1991), 929-931. 
3. A. Zygmund, Trigonometric Series, vol. 1, second ed., Cambridge University Press, 1959. 


There is no record of any other solution being received. 


Comparing Sums of Numbers with Equal Products 


6667 [1991,766]. Proposed by George Baloglou and Phil Tracy, State University of New York, 
College at Oswego. 


If aj, 42, ...,Qn,b1, b2, ..., by are positive numbers such that 
a1a2...an = byb2... by and 
Y> la-al<s So [bi ->yl, 
1<i<j<n 1<i<j<n 


(i)* prove that 
n n 
yo ai < (n — 1) yo bi, and 
i=1 i=l 
(ii) show that the factor n — 1 cannot be replaced by a smaller one. 
Solution of (i) by Eugene A. Herman, Grinnell College, Grinnell, IA. The following proof 


uses only the geometric-arithmetic mean inequality. By reordering subscripts, we may 
assume that 


O<a, <a2<-:-<a,, O0<b <b. <---> <b. (1) 
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Furthermore, the problem is unchanged if every a; and b; is multiplied by a positive constant 


-1 
n 
k. If we choose k = (= b , the problem becomes the following: From assumptions 
i=l 


(1), (2), (3), and (4), where 


n 


I] aj = I] b; (2) 
i=] 


i=] 


xe —n—1)a; < Yi —n—1)b; (3) 
i=l i=] 


Y= 4 
i=] 


prove 


n 
ae (5) 
i=] 


Clearly, n > 1 was intended, since the result is false for n = 1. Also, the proof is routine 
when n = 2; so henceforth we assume n > 3 and prove (5) with strict inequality. Using the 
facts that aj41-; — aj > O when 22 <n+1 and (21 — n — 1)b; < O when 27 <n +1, we 
deduce from inequality (3) that 
n 
(n—1)(Qn—a1)< Yo (Qi-n-1)bj. (3’) 
i>(n+2)/2 

The conclusion will follow if (3) is replaced by the weaker inequality (3’). From inequality 
(3’) and equality (4), we have 


” ” 2i-—-n—1 
ama <i-Ya+ SO A=", 
i=] i>(nt2)/2 


4 ( Tot SS “o— (6) 
7 a n—-1 ‘| 
i<(n+1)/2 i>(n+2)/2 


From (1) and equality (2), we have 


n n—1 
allan < | | < | [2 (since by, < 1) 
i=] i=] 


n—-1 ° n—1 
T] rT 2(n — I] —1 
i<(n+1)/2 i>(n42)/2 " ~ i>(n+2)/2 (n — 1) 


n—1 . k 
2(n — —] 

~ ( I l | ( | | : Pn) “a 
i<(n+1)/2 i>(nt+2)/2 


where k = (n — 2)/2 when n is even and (n — 3)/2 when n is odd. Therefore, by the 
geometric-arithmetic mean inequality, 


a" a, , OR 1 m1 2-1), \" 
Gone <4 *@O IES Go yst >» a+ Dd 7 i 
(n — 1) (n — 1) (n — 1) i<(n+1)/2 i>(n+2)/2. 


Hence, by inequality (6), 


a a 1) <1 } 
a TT n— a a 3 or 


1994] PROBLEMS AND SOLUTIONS 915 


An n—1l 
ol (=) (n a1 < 1 — ay. (7) 


Ifa, < (n — 1)/n, then 1 aj < na, < n—1. Thus we may assume thata, > (n — 1)/n, 


which implies 
mT nk 
an n— n— 
———_—- —1) > | ————_—- 1. 
(cm) . 12 ( n : 


So we see that in (7) the expression in square brackets is positive and hence, by inequality 
(7), that 1 — a, > 0. Since )>7_, a; < a; + (n — 1)ap, the conclusion (5) holds with strict 
inequality if 


(again using (7)) which we may rewrite successively as follows: 
1 An ay 1) -1 
—— < | ———— n—1)-1, 

n—1 (n — 1) 


( n yr An 
(n—12) ~ (n—DF’ 


nr! 


(n — 1)2n—-2-k 
But the assumption a, > (n — 1)/n implies 
(n — 1)2?-2-ka, . (n — 1)2n—-1-k 
nn-l — nn ° 


< Ap. (8) 


Thus to show that (8) holds, it suffices to prove that 
n” <(n—1)2"71-*, 
This is easily confirmed when n = 3. Whenn > 4,n < (n — 1)3 and so 
n”™ <(n—1)2" <(n—1)"71-*, 


This completes the proof of part (i). 


Solution of (ii) by the editors. Take ay = N-—) and ag = a3 =--: =a, = N for N 
large and positive; also b} = b3 =--- = by_1 = (N+ 1)7V/@-1) and b, = N+ 1. Then 
2 la; — aj| < (n—1)N < 2 |b; — bj| 
1<i<j<n 1<i<j<n 


so that the hypotheses of part (1) are satisfied; but also 
n 
2 aj > (n—1)N 
i=] 
and, for N sufficiently large, 


7b; <N42, 
i=] 


n n 
(n—1)N 
d4 ~ N42 », 
i=1 1=1 
Letting N grow large shows that the constant n — 1 in part (i) cannot be improved. 


so that 


Solved also by the proposers. 
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Reducing a matrix to zero by S E X changes 


10216 [1992, 362]. Proposed by G. Bennett, Indiana University, Bloomington, IN. 


Let A = (q;,;) be an m by n matrix with integer entries. A set of locations, H, in A is 
called an “echelon” if, whenever (k,/) © H,i <kandj <1, one has (1, 7) € H. Consider 
the family of operations 


Sij: subtract 1 from a;,;; add 1 to aj41,; 
E;. ;: subtract 1 from aj, ;; add 1 to qj, j44 
Xi, j: subtract 1 from q;,; 


(for all values of i and j for which the operations can be defined). Show that there is a 
sequence of these operations reducing A to the zero matrix if and only if 


\- {aij Gj) € H} = 0 


for every echelon H. 


Solution by Richard Stong, Rice University, Houston, TX. Let f(H) = eG jeH ai, j, 
and let (*) be the condition that f(H) > O for every echelon H. Since no operation 
increases f(7), (*) is necessary. To see that (*) 1s sufficient, note that each operation 
lowers )\,, f(H), where the sum is over all echelons H C A. Also, if (*) holds, then the 
sum is a nonnegative integer and vanishes only if A is the zero matrix. Hence it suffices to 
show that some operation can be performed without destroying (*) whenever A is nonzero 
and satisfies (+). 

Suppose (*) holds. Call H a zero echelon if f(H) = 0. If H, and HM are zero echelons, 


O= f(A) + f(A2) = f(A A2) + fC U A). 


Both terms on the right are nonnegative, hence both are zero. In other words, the set of zero 
echelons is closed under intersection and union. 

Let a;,; be a positive entry of A. If (7, 7) is not contained in any zero echelon, then 
we can apply X;,;. Otherwise, let H be the smallest zero echelon containing (/, /), 1.e., 
the intersection of all zero echelons containing (i, j). One of the locations (¢ + 1, 7) and 
(i, j +1) must be in H, since otherwise H’ = H — {(i, j)} is an echelon with f(H’) = 
f (H)—a;,; < 0. By symmetry, we may assume (+1, 7) © H. NowS;_; can be performed 
to preserve (*) unless there is a zero echelon H that contains (i, J) but not @ + 1, 7). In 
that case HM H would be a zero echelon containing (i, j), contradicting the minimality of 
H. 


Solved also by P. J. Anderson (Canada), D. M. Bloom, D. Callan, R. J. Chapman (U. K.), T. Hesterberg, S. Kanetkar, 
K. S. Kedlaya (student), O. P. Lossers (The Netherlands), M. Mocsy (Hungary), A. Nijenhuis, University of Wyoming 
Problem Circle, and the proposer. 


The Determinant of an LCM Matrix 
10232 [1992, 571]. Proposed by Serge Zakharov, Tumen State University, Tumen, Russia. 


Let M, be the n by n matrix whose (7, j)-entry is lem(i, j). Evaluate det(M,,). 


Solution by Richard Holzsager, The American University, Washington, DC. One form for 
the answer is 


det(M,) =n! | [GQ — p)n/s, 


where the product is taken over all primes p. 
A function f defined on the positive integers 1s multiplicative if f(mn) = f(m)f(n) 
whenever m and n are relatively prime. It is a standard result (found in most texts on 
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Elementary Number Theory) that if f is multiplicative, then so is the associated function g 
defined by g(m) = >> d\m f(d). Let f be the multiplicative function defined by f(p”) = 
(1 — p)/p” for each prime power p”, and let g be its associated multiplicative function. 
Since g(p”) =14+(1— p)/p+(1— p)/p* +---+(1— p)/p” = 1/p", we conclude that 
g(m) = 1/m for each positive integer m. 

Now let NV be the diagonal matrix diag(1, 2,...,), and let A be the n by n matrix 
whose (1, j)-entry is 1/ gcd(, 7). Then M, = NAN. Next let B be the n by n matrix 
whose (i, j)-entry is f(c) if i|7 and 0 otherwise, and let C be the n by n matrix whose 
(i, j)-entry is 1 if j|2 and O otherwise. Then the (7, 7)-entry of the product CB is 


n 
Yciabaj = Diba = D> F@ =a(gcd(i, j)) = 1/gedti, j). 
d=1 d|i d| gcd(i, j) 
Thus CB = A. 

We conclude that det(M,,) = det(N)? det(B) det(C). Since the matrices N, B, and C 
are respectively diagonal, upper triangular, and lower triangular, we have det(NV) = n!, 
det(B) = [J*_, f(@), and det(C) = 1. Thus det(M,) = (n!)? []?7_, f@. 

Finally, define the function h by h(m) = mf (m), so det(M,) = n! []7_, h(@). Now h 
is multiplicative, with h(p”) = 1 — p for each prime power p”. Thus h(m) = [[(1 — p), 
where the product is taken over all primes dividing m. This yields the formula claimed, 
because for each prime p there are [n/p] multiples of p in the interval [1, 7]. 


Editorial comment. This determinant appears frequently in the literature, including the 
textbook, I. Niven and H. S. Zuckerman, An Introduction to the Theory of Numbers (3rd ed.), 
Wiley, 1972. In this edition, Theorem 3 on p.266, gives the result in the form det(M,) = 
[1 ?¢ I] p|k(—P)- Several readers traced the determinant to H. J. S. Smith, “On the 
value of a certain arithmetic determinant”, Proc. London Math. Soc. 7 (1875-76), 208-212 
(or Collected Works II, 161-165). It also appears in works that extend the theory of such 
matrices, such as S. J. Beslin, “Reciprocal GCD matrices and LCM matrices”, Fibonacci 
Quarterly 29(1991), 271-274 and K. Bourke & S. Ligh, “On GCD and LCM matrices”, 
Linear Algebra and Appl. 174(1992), 65-74. 

Solved also by D. Alvis, D. W. Bailey, K. Bourque, D. Callan, S. R. Cavior, R. J. Chapman (U. K.), J. Duncan & 
M. Maliakas, P. Haukkanen (Finland), R. H. Jeurissen (The Netherlands), S. Kanetkar, N. Kang (student, Korea), S. P. 
Kishore & S. V. Singh (India), N. Komanda, J. H. Lindsey II, O. P. Lossers (The Netherlands), R. Martin (student), 


R. A. Mena, M. Mocsy (Hungary), A. Nijenhuis, J. Oaks, P. Schaefer, H. J. Seiffert (Germany), A. Yanushka, J. Zurek 
(France), USA Mathematical Olympiad Program, University of Wyoming Problem Circle, and the proposer. 


A Stability Criterion 


10271 [1992, 958]. Proposed by Victor I. Kostin, Institute of Mathematics, Novosibirsk, Russia. 


Let A be askew-hermitian N by N matrix with N distinct eigenvalues. Let b be a column 
vector with nonzero projections on each eigenvector of A. Prove that all eigenvalues of the 


(N + 1) by (NV + 1) matrix 
A b 
—b* —-] 


Solution by Dario Fasino, Universita degli studi di Udine, Udine, Italy. A more general 
result holds. 


have negative real parts. 


Proposition. Let M be a complex matrix such that: 
1. its hermitian part Hi = (M + M*)/2 is negative semidefinite; 
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2. ifa vector v belongs to the kernel of H,, then v is not an eigenvector of Hy = (M — M*)/2, 
the skew-hermitian part of M. 
Then, every eigenvalue of M has negative real part. 


Proof. Let i be any eigenvalue of M and v one of its eigenvectors: Mv = (A, + A2)uv = Av. 
Since H,; = Hf and Hz = —H7, there exist real numbers c, and c2 such that v* Hyv = cyvu*v 
and v*Hav = icgu*v. Then dA = v*Mov/(v*v) = cy + ic2 and R(A) = cy < O by 
assumption 1. On the other hand, if cj = 0, then Hyv = 0 and Mv = Agu = dv, which is 
in contradiction with assumption 2. Thus Si(A) < 0. 


We can apply this proposition to the given matrix M = (4. ’ ); indeed, in this case, 


A = (§ °): which is negative semidefinite. Moreover, let v = (v1,...,ux,0)* be 
any nonzero vector in the kernel of Hj and let u = (vj,..., vy)*. Since Hz = (4. é 


Hv = Av holds iff the relations Au = Au and —b*u = 0 are both true, but b*u is not zero 
whenever u is an eigenvector of A. Hence v is not an eigenvector of Hz. The hypothesis 
that the eigenvalues of A are distinct is not needed since, whenever A has an eigenspace of 
dimension greater than one, the required condition on the projections of b cannot hold. 


Editorial comment. An obvious misprint in the original statement has been corrected in 
the above statement of the problem. A generalization, as in the selected solution, was also 
given by Michael K. Kinyon, C. R. Rosentrater, and the University of Wyoming Problem 
Circle. 

Solved also by R. J. Chapman (U. K.), D. Jespersen, M. K. Kinyon, M. Marcus, C. R. Rosentrater, F. Schmidt, GCHQ 


Problem Solving Group (U. K.), University of Wyoming Problem Circle, and the proposer. One incomplete solution 
was received. 


Convex Functions on Convex Sets 


10283 [1993, 184]. Proposed by Feng Luo and Richard Stong, University of California, Los 
Angeles, CA. 


Let D be a convex polygonal region in the plane and let f be a bounded convex (and 
hence continuous) function on the interior of D. 

(a) Show that f extends to a continuous function on all of D. 

(b) Show that the analogous result does not hold if D is the unit disk. 


Solution by John Rainwater, University of Washington, Seattle, WA. This problem was 
answered in David Gale, Victor Klee and R. T. Rockafellar, “Convex functions on convex 
polytopes”, Proc. Amer. Math. Soc. 19 (1968), 867-873, where it was shown (among more 
general results) that a bounded convex function on the relative interior of an n-dimensional 
polytope D has a unique extension to a continuous convex function on D. Moreover, this 
property characterizes polytopes among bounded convex sets. This answers both parts of 
the problem. A concrete example for the unit disk Dy = { (x,y)ix*+y%<1 } is the 
function 
y2 


1 — |x| 


This is convex since it is the maximum of y*/(1 — x) and y?/(1 + x), each of which 
has a positive semidefinite Hessian, hence is convex. However, f(x, y) < 2 on D; with 
f(x, y) > 2 as (x, y) > (1, 0) along the circle, while f(x, 0) = 0 for |x| < 1. 


f(x,y) = (|x| <1). 


Editorial comment. Dale Varberg and Wayne Roberts gave a related example, and re- 
marked that composition with a rotation would lead to functions fg (x, y) with discontinu- 
ities at +(cos 6, sin@). Functions of the form 


> 2" fo, (x, Y) 
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could then be formed with a dense set of points of discontinuity on the boundary of Dj. 


Solved also by M. V. Bjelica (Yugoslavia), Y. Diao, H. von Eitzen (Germany), H. Hanche-Olsen, J. Kane, I. Kastanas, 
A. D. Melas (Greece), D. Varberg & W. Roberts, GCHQ Problem Solving Group (U. K.), and the proposers. 


Collaborating editors: David F Appleyard, Paul T. Bateman, Bruce C. Berndt, Duane 
M. Broline, Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. Griggs, 
Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, Daniel J. Kleitman, 
Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, Stephen L. Portnoy, J. O. Shallit, John 
Henry Steelman, Kenneth B. Stolarsky, David E. Tepper, Douglas B. Tyler, Daniel Ullman, and 
William E. Watkins. 


920 


For all their wealth of content, for all the 
sum of history and social institution in- 
vested in them, music, mathematics, and 
chess are resplendently useless (applied 
mathematics is a higher plumbing, a kind 


of music for the police band). They are 
metaphysically trivial, irresponsible. They 
rcfuse to relate outward, to take reality for 
arbiter. This is the source of their witch- 


ery. 
—G. Steiner 
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REVIEWS 


Edited by Darrell Haile 
Indiana University, Bloomington IN 47405 


Fleeting Footsteps: Tracing the Conception of Arithmetic and Algebra in 


Ancient China. By Lam Lay Yong and Ang Tian Se. Singapore (World 
Scientific Publishing Co., Inc.) 1992. XVI + 199 pp. U.S. $24, £ 15. 


Reviewed by Frank Swetz 


The fascinating title of this book owes its origin to a remark /query posed by the 
nineteenth century mathematical historian Florian Cajori concerning the ‘‘Hindu- 
Arabic” numerals. Cajori was puzzled as to why scholars could not agree on the 
course of these numerals, “their fleeting footsteps as they migrated from land to 
land.” (Cajori, 1896). Lam Lay Yong, Professor of Mathematics at the National 
University of Singapore and Ang Tian Se, formerly Professor of Chinese Studies at 
the University of Malaya and now teaching at Edith Cowan University, Australia, 
attempt to trace the elusive origin of these “footsteps” back to ancient China. The 
authors offer and embellish the premise that the concept of the “Hindu-Arabic”’ 
numerals can be found in the rod numerals used by the early Chinese. In building 
their case, Lam and Ang discuss the structure, (base 10, positional, etc.), and form 
of the rod numerals as well as describe the computational rod algorithms used to 
perform the four basic operations and extract numerical roots of numbers. Physical 
configurations of small rods laid out on a counting board served as a computing 
device while the written record of the rod configurations served as numerals. This 
system was highly efficient and eventually allowed for an operational designation 
of negative numbers, black rods for negative numbers and red rods for positive, 
and an extension to work with decimal fractions. Early Chinese computers also 
developed computational algorithms to perform what today would be recognized 
as algebraic procedures: solving a proportion by use of the “‘rule of three”, using 
“false position” to obtain a solution to a simple linear equation and solving a 
system of equations -by using a tabular or matrix solution scheme. This discussion 
of rod arithmetic and algebraic procedures is primarily based on the contents of 
the fifth century mathematics text Sun Zi suanjing [The Mathematical Classic of 
Sun Zi]. A complete translation of this work is provided. Thus in the Fleeting 
Footsteps the authors have set themselves an ambitious and multifaceted agenda, 
namely to: provide a translation and presentation of the contents of Sun Zi 
suanjing; use the contents of this ancient classic to furnish an explanation of rod 
numeral arithmetic and algebra and then, on the basis of having established the 
mathematical superiority of rod numerals and computation, contend the case for a 
Chinese origin of the “Hindu-Arabic’”’ numeral system. 

This book presents the first, complete, English language translation of Sun Zi 
suanjing, one of the Ten Mathematical Manuals of ancient China (pp. 151-182). 
The dating of Master Sun’s classic is based on internal evidence and has been 
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approximated to the years between 280 AD and 473 AD. Sun Zi’s specific identity 
also remains clouded and he is believed to have been a minor state official or a 
Buddhist monk. His brief mathematical text consists of a preface and three 
chapters in which he provides instructions for the use of computing rods, tables of 
weights and measures relevant to the time, and a collection of sixty-four mathemat- 
ical problems and their solution schemes. The problems reflect the needs of daily 
life in fifth-century China. The text’s intended audience was scholars and officials 
and not the “common people” as suggested by the authors (pp. 127). Although, in 
general, this early Chinese work remains unknown in the West, one of its problems 
has become famous and is widely recognized as the origin of the “Chinese 
Remainder Theorem” (Swetz, 1979). Now, this whole collection of intriguing 
problems which, in many ways, is reminiscent of Alcuin of York’s Propositiones ad 
acuendos jeuvenes (ca. 732), is available to a wide audience. This translation has 
supplied mathematical historians and others who are non-readers of Chinese with 
a true treasure! For example, problem thirty-four of chapter two tests the numeri- 
cal endurance of a computer by requiring ‘“‘Now there is sighted 9 embankments 
outside; each embankment has 9 trees; each tree has 9 branches; each branch has 
9 nests; each nest has 9 birds; each bird has 9 young birds; each young bird has 9 
feathers; each feather has 9 colors. Find the quantity of each.” It is found there 
are 43,046,721 colors (pp. 181-2). But there are also other problems more 
representative of a particular Chinese mystical nature—“ Now there is a pregnant 
woman whose age is 29. If the gestation period is 9 months, determine the sex of 
the unborn child”. Through a number manipulation based on Yin-Yang principles, 
the child is found to be a male (pp. 182). In their analysis of rod procedures and 
operations, Lam and Ang rely extensively on the use of these problems. An 
appendix is provided associating each particular problem with its textual discussion 
within the book, a very valuable reference feature (pp. 183-84). Chapter eight of 
Fleeting Footsteps is devoted to a discussion of the economic, sociological and 
political aspects of the problem collection. These insights into Chinese life con- 
sider factors such as the use of barter, taxation rates and evidence of existing social 
unrest. This examination is an excellent feature of the book and I regret it was not 
longer and more detailed in its considerations. In brief, the presentation and 
discussion of the contents of Sun Zi suanjing is well done. 

The existence and use of computing rods in China can be traced back to the 
Warring States period of history (475-221 BC). These rods were made of wood, 
bone or even ivory, depending on the rank and privileges of their user. They were 
approximately 0.2 cm thick and 8 cm long. In the hands of a skilled calculator, they 
were a powerful computing tool, in many respects equivalent to the hand calcula- 
tor of today. The use of such rods dominated the Chinese computing scene up 
until the 17th century when they were replaced by the more egalitarian abacus. 
Discussions on rod algorithms and computing procedures occupy chapters three 
through five and present lucid explanations of the workings of rod arithmetic and 
simple algebra. This material provides the most extensive and detailed account of 
ancient Chinese rod mathematics available in the English language and opens 
another window of understanding for the linguistically limited reader. 

Thus after recognizing some “footsteps” in the conceptualization of the 
‘“Hindu-Arabic” numeral system, the senior author, Lam Lay Yong, proposes on 
the basis of the evidence examined, that the path of these “footsteps” originated in 
China—“‘the Hindu-Arabic numeral system had its origins in the Chinese rod 
numeral system’ (pp. 134). Of course this claim is not new. Alexander Wylie, one 
of the first modern, western observers of the Chinese mathematical scene noted 
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the possibility of a Chinese origin for the Hindu-Arabic numeral system (Wylie, 
1897). The conjecture was echoed by early 20th century scholars (Fellows, 1921). In 
most recent times, this claim has been made most succinctly in the works of Wang 
Ling and Joseph Needham. In a 1958 paper delivered in Adelaide, Wang pre- 
sented a detailed case for a Sino origin of the “Hindu-Arabic” numerals and 
pointed to the strong possibility of westward transmission to India. Wang’s theory 
was further amplified in his collaborative work with Joseph Needham. Science and 
Civilization in China, vol. 3, devotes several pages (pp. 146-150) to this very issue 
and the phenomena of “‘stimulus diffusion’. Needham’s work clearly indicates the 
need for further research clarification as to the status of early Hindu mathematics 
and the possibility of cultural transmissions. It is exactly this research that must be 
undertaken to strengthen the claim for a Chinese genesis of our numeral system 
and, unfortunately, it is exactly this research that is lacking in Fleeting Footsteps. 
What was the status of ancient Indian mathematics during the Warring States 
period of Chinese history? How were the numerals used in ancient India? Could 
the Chinese have obtained their mathematical knowledge from India?—after all, 
Buddhism was an intellectual import from China’s western neighbor. These are 
some of the issues and questions that must be addressed in positing a claim of a 
Chinese origin for the “Hindu-Arabic’”’ numeral system and they remain missing 
footsteps in the path this book has taken. 

Despite the inability to develop and strengthen its major premise, Fleeting 
Footsteps is a valuable resource for understanding early Chinese mathematics. Its 
sections devoted to the Sun Zi suanjing and its problems and to rod calculations 
provide historical information and insights previously unknown to a large audience. 
Although the “‘Fleeting Footsteps” themselves remain elusive, the book that bears 
this name is highly recommended for general reading and library acquisition. 
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General, P. The Gelfand Mathematical Semi- 
nars, 1990-1992. Eds: L. Corwin, I. Gelfand, 
J. Lepowsky. Birkhauser, 1993, x + 235 pp, 
$59.50. [ISBN 0-8176-3689-7] 15 papers on 
miscellaneous topics. 


Reference, P, C. Russian-English, English- 
Russian Dictionary on Probability, Statistics, 
and Combinatorics. K.A. Borovkov. SIAM, 
1994, viii + 154 pp, $47.50 (P), with disk. 
[ISBN 0-89871-316-1] 

Reference, P. McGraw-Hill Dictionary of Sci- 
entific and Technical Terms, Fifth Edition. Sybil 
P. Parker. McGraw-Hill, 1994, xvii + 2242 pp, 
$110.50. [ISBN 0-07-042333-4] 


Mathematics Appreciation, T(13: 1, 2). Pat- 
terns in Mathematics: Problem Solving from 
Counting to Chaos. Jack R. McCown, Michael 
A. Sequeira. PWS, 1994, xx + 874 pp. [ISBN 
0-534-18786-2] “Math for liberal arts:” prob- 
lem solving, applications, history, exploration 
problems, motivational quotes. Covers sets, 
counting, probability, statistics, Markov chains, 
fractals, and logic. As in Sherlock Holmes nov- 
els, sections start with an “opening case.” HD 


Mathematics Appreciation, S**, L*. Build 
Your Own Polyhedra. Peter Hilton, Jean Peder- 
sen. Addison-Wesley, 1994, 175 pp, (P). [ISBN 
0-201-49096-X] A mix of origami and math- 
ematics. Thorough, detailed instructions for 
building many polyhedra. Illustrations and rel- 
evant geometry at high school level. Great for 
math ed courses, summer institutes. (1988 edi- 
tion, TR, January 1989.) DP 


Mathematics Appreciation, S**. Classic 
Math: History Topics for the Classroom. Art 
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Johnson. Dale Seymour Pub, 1994, vii + 
172 pp, (P). [ISBN 0-86651-690-5] |Wonder- 
ful supplement for various courses. Five chap- 
ters: Quote of the Week, Event of the Week, 
Historical Problems, History of Mathematical 
Symbols, History of Mathematical Terms. Each 
consists of several short sections aiming to stim- 
ulate discussion, convey human side of mathe- 
matics. Intended for grades 7-12. DP 


Recreational Mathematics, S(13). Speed 
Mathematics Simplified. Edward Stoddard. 
Dover, 1994, x + 271 pp, $6.95 (P). [ISBN 0- 
486-27887-5] Unaltered reprint of 1965 edi- 
tion. Explains tricks, many based on abacus 
principle, for rapid mental arithmetic. HD 


Elementary, P, L*. Algebra. I.M. Gelfand, 
A. Shen. Birkhauser, 1993, 153 pp, $24.50. 
[ISBN 0-8176-3737-0] Elementary algebra 
text for high school students in Gelfand’s Math- 
ematical School by Correspondence. Spare ex- 
position; challenging but intriguing problems— 
many with solutions. A pedagogical gem. AO 


Education, P. The Legacy of Hans Freuden- 
thal. Ed: Leen Streefland. Kluwer Academic, 
1993, 164 pp, $60. [ISBN 0-7923-2653-9] 
Reprinted from Educational Studies in Math- 
ematics 25 (1—2) 1993. 


Education, P. Didactics of Mathematics as 
a Scientific Discipline. Eds: Rolf Biehler, et 
al. Math. Educ. Lib., V. 13. Kluwer Aca- 
demic, 1994, ix + 467 pp, $150. [ISBN 0-7923- 
2613-X] Papers on preparing mathematics for 
students, teacher education and research on 
teaching, interaction in the classroom, tech- 
nology and mathematics education, psychol- 
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ogy of mathematical thinking, differential di- 
dactics, history and epistemology of mathemat- 
ics and mathematics education, cultural framing 
of teaching and learning mathematics. 


Education, S(15-17). Algebra Experiments I: 
Exploring Linear Functions. Mary Jean Win- 
ter, Ronald J. Carlson. Addison-Wesley, 1993, 
105 pp, (P). [ISBN 0-201-81524-9] 17 exper- 
iments, each with teaching notes and lab pages. 
Stresses dependent vs. independent variables. 
Useful for pre- and in-service teachers. MW 


Education, S(13—16). Algebra Experiments 
II: Exploring Nonlinear Functions. Ronald J. 
Carlson, Mary Jean Winter. Addison-Wesley, 
1993, 108 pp, (P). [ISBN 0-201-81525-7] 14 
experiments explore rational, exponential, and 
logarithmic functions. Includes programs for 
graphing calculators, Apple Basic. MW 


Education, S(15-17). Cooperative Informal 
Geometry. Wade H. Sherard III. Dale Seymour 
Pub, 1995, 92 pp, (P). [ISBN 0-86651-799-5] 
20 investigations, at middle school level, stress 
cooperative work groups, hands-on activities. 
With student worksheets, teacher notes. Useful 
as models for secondary teachers. MW 


Logic, T(13~14: 1), L. Logic, Sets, and Re- 
cursion. Robert L. Causey. Ser. in Comp. Sci. 
Jones & Bartlett, 1994, x + 405 pp, $49.95. 
[ISBN 0-86720-463-X] Careful, lucid intro- 
duction to propositional calculus, sets, rela- 
tions, functions, predicate calculus. Stresses 
proofs and understanding; aims to fill gap in 
background of CS (and math) students. RM 


Logic, P. The Structure of Relation Algebras 
Generated by Relativizations. Steven R. Givant. 
Contemp. Math., V. 156. AMS, 1994, xv + 
134 pp, $34 (P). [ISBN 0-8218-5177-2] 


Foundations, T(15-16: 1), L. Notes on Set 
Theory. Yiannis N. Moschovakis. Under- 
grad. Texts in Math. Springer-Verlag, 1994, 
xiv + 272 pp, $39. [ISBN 0-387-94180-0] A 
modern introduction. Covers interesting set- 
theoretic questions as well as role of set the- 
ory as foundation for mathematics: faithful rep- 
resentation of mathematical objects by struc- 
tured sets, domains and the fixed point theorem, 
Aczel’s antifounded universe. RM 


Graph Theory, T(13—14: 1), S, L. Introduc- 
tion to Graph Theory. Richard J. Trudeau. 
Dover, 1993, x + 209 pp, $7.95 (P). [ISBN 
0-486-67870-9] Nice, simple, enjoyable in- 
troduction to graph theory; an introduction to 
abstract mathematics for the “mathematically 
traumatized” (author’s comment). Reprint of 
1976 version published as Dots and Lines (TR’s, 
March 1977 and February 1979). RM 
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Combinatorics, L. Concrete Mathematics, 
Second Edition. Ronald L. Graham, Donald E. 
Knuth, Oren Patashnik. Addison-Wesley, 1994, 
xiii + 657 pp, $51.75. [ISBN 0-201-55802-5] 
Major change is new section on Zeilberger’s 
extensions to Gosper’s algorithm for evaluating 
indefinite sums. (First Edition, TR, May 1990; 
Extended Review, October 1991.) AO 


Combinatorics, P. Investigations in Alge- 
braic Theory of Combinatorial Objects. Eds: 
I.A. Faradzev, et al. Math. & Its Applic., 
V. 84. Kluwer Academic, 1994, xi + 510 pp, 
$269. [ISBN 0-7923-1927-3] Survey and 
research papers on cellular rings, distance- 
transitive graphs, amalgams and diagram ge- 
ometries. 


Discrete Mathematics, T(13—14: 1), L. A Log- 
ical Approach to Discrete Math. David Gries, 
Fred B. Schneider. Texts & Mono. in Comp. 
Sci. Springer-Verlag, 1993, xvi + 497 pp, 
$44.95. [ISBN 0-387-94115-0]; Instructor’s 
Manual, iv + 311 pp, (P). Unconventional, rel- 
atively sophisticated. Includes 10 chapters on 
logic and its applications to computer science. 
Also covers set theory, induction, relations and 
functions, number theory, combinatorics, recur- 
rence relations, graphs. KES 


Linear Algebra, T(14~-15). Linear Alge- 
bra: Gateway to Mathematics. Robert Messer. 
HarperCollins, 1993, x + 403 pp, $44. [ISBN 0- 
06-501728-5] Fairly conventional in outline 
and content; book’s novelty lies mainly in var- 
ious user-friendly features: informal remarks 
blocked off from the main text, two student 
projects for each chapter, numerous exercises, 
review exercises, chapter summaries. JS 


Linear Algebra, T(13—14: 1). 
tary Linear Algebra, Fifth Edition. Stanley 
I. Grossman. Saunders College, 1994, xx 
+ 758 pp, $58.75. [ISBN 0-03-097354-6] 
Major changes include “calculator boxes”— 
instructions for matrix features on TI-85 and 
Casio fx-7700 GB; MATLAB tutorials and 
problems; a section on LU factorization. 130 
pages longer than Fourth Edition (TR, October 
1991), due mainly to MATLAB material. LC 


Group Theory, P. Geometric Group The- 
ory, Volume 2: Asymptotic Invariants of Infinite 
Groups, M. Gromov. Eds: Graham A. Niblo, 
Martin A. Roller. London Math. Soc. Lect. 
Note Ser., V. 182. Cambridge Univ Pr, 1993, 
vii + 295 pp, $34.95 (P). [ISBN 0-521-44680- 
5] An extended essay by Mikhael Gromov, 
partly based on a lecture at a 1991 symposium 
at Sussex University. 


Group Theory, T(17: 1). Fast Fourier Trans- 
forms. Michael Clausen, Ulrich Baum. Bib- 
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liographisches Institut & FA Brockhaus, 1993, 
181 pp, DM 68. [ISBN 3-411-16361-5] De- 
sign and analysis of FFTs on finite groups; self- 
contained introduction to representation theory. 
Applications to graph theory, signal processing, 
Statistics, data compression. TH 


Algebra, S(18), P. Fundamental Structures of 
Algebra and Discrete Mathematics. Stephan 
Foldes. Wiley, 1994, xv + 344 pp, $64.95. 
[ISBN 0-471-57180-6] Treats various alge- 
braic structures, graphs, matroids, topological 
Spaces, universal algebra, categories. Useful 
mainly as a concise summary of major results— 
exercises, examples, and motivation are too lim- 
ited for beginners. JS 


Algebra, P. Three Papers on Algebras and 
Their Representations. V.N. Gerasimov, N.G. 
Nesterenko, A.I. Valitskas. Transl. Ser. 2, 
V. 156. AMS, 1993, x + 195 pp, $98. [ISBN 
0-8218-7503-5] 

Algebra, P. Universal Algebra, Algebraic 
Logic, and Databases. B. Plotkin. Math. and 
Its Applic., V. 272. Kluwer Academic, 1994, xv 
+ 438 pp, $189. [ISBN 0-7923-2665-2] Ap- 
plies universal algebra and category theory to 
computer science, in this case to algebraic as- 
pects of database theory. Employs many-sorted 
algebras and topoi as models; applies algebraic 
logic to the structure of databases (including 
those with fuzzy information). RM 


Algebra, P. Unconventional Lie Algebras. Ed: 
Dmitry Fuchs. Adv. in Soviet Math., V. 17. 
AMS, 1993, x + 216 pp, $105. [ISBN 0- 
8218-4121-1] 8 papers on representations and 
cohomology of Lie algebras that are infinite- 
dimensional, defined over fields of finite charac- 
teristic, Lie superalgebras, or quantum groups. 


Algebra, P. Representations of Algebras. Eds: 
Vlastimil Dlab, Helmut Lenzing. Canadian 
Math. Soc. Conf. Proc., V. 14. AMS, 1993, 
“xxv + 478 pp, $81 (P). [ISBN 0-8218-6019-4] 
Proceedings of a 1992 conference at Carleton 
University. 

Calculus, S(13). Calculus: An Active Approach 
with Projects. Stephen Hilbert, et al. Wiley, 
1994, xi + 257 pp, $20.95 (P). [ISBN 0-471- 
00316-6] Activities and projects to promote 
students’ own development of calculus con- 
cepts. “Activities” are for in-class or homework 
use; 2-3 week team projects emphasize written 
presentation of results. Accompanies any cal- 
culus text; contains tear-out assignment pages. 
Instructor’s Guide available. JNC 

Calculus, T(13: 1-3). Calculus with Analytic 
Geometry, Alternate Fifth Edition. Roland E. 
Larson, Robert P. Hostetler, Bruce H. Edwards. 
DC Heath, 1994, xxii + 1243 pp [ISBN 0-669- 
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35336-1]. Calculus with Analytic Geometry, 
Fifth Edition. 1994, xxxviii + 1256 pp. [ISBN 
0-669-35335-3] “Every portion... examined 
and revised in the spirit of [calculus] reform.” 
More use of technology, more emphasis on 
communication skills, new applications. Al- 
ternate Fifth Edition has “late trigonometry,” 
different treatment of limits, integral applica- 
tions, exponential and log functions, vectors. 
(Third Edition, TR, May 1986.) AO 


Calculus, T(13: 1-3), L. Calculus of a Sin- 
gle Variable. Thomas P. Dick, Charles M. Pat- 
ton. PWS, 1994, xxiv + 831 pp. [ISBN 0-534- 
93936-8] One of the “new” texts. Contents 
not too unstandard; informal approach. Re- 
orderings include early exponentials, paramet- 
ric curves as application of derivatives, differ- 
ential equations immediately after integration. 
Only antidifferentiation techniques are substi- 
tution and parts (partial fractions, trigonometric 
techniques, etc., in appendices). Contains such 
pleasant (and popular) morsels as splines and 
Bezier curves, Cantor set, iterative processes, 
a fixed-point theorem. Is “technology-aware,” 
but assumes no specific technology. KS 


Calculus, T(13: 1-3). Calculus, Second Edi- 
tion. Richard A. Hunt. HarperCollins, 1994, 
xviii + 1165 pp, $53.50. [ISBN 0-06-043046- 
X] Over 1700 new exercises: applications; 
conceptual and theoretical problems; graphing 
calculator exercises; writing exercises. AO 


Real Analysis, T(15). Elementary Classical 
Analysis, Second Edition. Jerrold E. Marsden, 
Michael J. Hoffman. WH Freeman, 1993, xiv 
+ 738 pp, $47.95. [ISBN 0-7167-2105-8] At- 
tractive option for a first course. Retains many 
attractive features of First Edition (TR, April 
1975)—intuitive presentation of key theorems, 
followed at chapter end by formal proofs; close 
ties to applications. Length may be exces- 
sive for some—the derivative first appears on 
page 327. AWR 


Differential Equations, S. Differential Equa- 
tions With Derive. David C. Arney. MathWare 
(604 E. Mumford Dr., Urbana, IL 61801), 1993, 
x + 284 pp, $23 (P). [ISBN 0-9623629-3-X] 
Examples and exercises that use Derive to solve 
or analyze ODE’s. SK 


Partial Differential Equations, P. Computa- 
tional Methods for Boundary and Interior Lay- 
ers in Several Dimensions. Ed: J.J.H. Miller. 
Boole Pr, 1991, vii + 225 pp, (P). [ISBN 0- 
906783-93-3; 0-906783-92-5] 10 papers sur- 
vey developments in theory and application of 
numerical solutions of multi-dimensional prob- 
lems with boundary and interior layers. 


Partial Differential Equations, P. Domain 
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Decomposition Methods in Science and Engi- 
neering. Eds: Alfio Quarteroni, et al. Con- 
temp. Math., V. 157. AMS, 1994, xxii + 484 pp, 
$75 (P). [ISBN 0-8218-5158-6] Proceedings 
of a 1992 conference in Como, Italy. 


Dynamical Systems, P. Topics in Ergodic The- 
ory. Ya. G. Sinai. Math. Ser., V. 44. Princeton 
Univ Pr, 1994, vii + 218 pp, $39.50. [ISBN 
0-691-03277-7] 


Numerical Analysis, T(15—16: 1, 2). Numeri- 
calAnalysis. Vithal A. Patel. Saunders College, 
1994, xvi + 652 pp, $44. [ISBN 0-03-098330-4] 
Methods and algorithms for scientific comput- 
ing: root finding, interpolation, differentiation, 
integration, systems of linear and nonlinear 
equations, linear algebra, ODEs, PDEs. Pseu- 
docode algorithms; paper-and-pencil, computer 
exercises for each section. AO 


Numerical Analysis, P. Parallelism in the Nu- 
merical Integration of Initial Value Problems. 
B.P. Sommeijer. CWI Tracts, V. 99. Cen- 
trum voor Wiskunde en Informatica, 1993, v 
+ 195 pp, Dfl. 50 (P). [ISBN 90-6196-431- 
8] An introduction and 6 (republished) tech- 
nical papers on parallel numerical methods for 
ODE’s (both nonstiff and stiff). 


Numerical Analysis, P. Applications of Ad- 
vanced Computational Methods for Boundary 
and Interior Layers. Ed: J.J.H. Miller. Boole Pr, 
1993, vii + 215 pp, (P). [ISBN 1-85748-002-3; 
1-85748-001-5] 9 papers illustrate new com- 
putational techniques for numerical solution of 
singularly perturbed problems. 


Operator Theory, P. Harmonic Analysis and 
Boundary Value Problems in the Complex Do- 
main. Mkhitar M. Djrbashian. Oper. Theory: 
Adv. & Applic., V. 65. Birkhauser, 1993, xiii + 
256 pp, $100. [ISBN 0-8176-2855-X] 


Functional Analysis, P. Multimedians in Met- 
ric and Normed Spaces. E.R. Verheul. CWI 
Tracts, V. 91. Centrum voor Wiskunde en In- 
formatica, 1993, x + 136 pp, Dfl. 40 (P). [ISBN 
90-6196-420-2] 

Functional Analysis, P. Semigroups of Linear 
and Nonlinear Operations and Applications. 
Eds: Giséle Ruiz Goldstein, Jerome A. Gold- 
stein. Kluwer Academic, 1993, 283 pp, $119. 
[ISBN 0-7923-2560-5] Proceedings of a 1992 
Curacao conference/workshop. 

Analysis, P. A Lie Algebraic Study of Some In- 
tegrable Systems Associated with Root Systems. 
J.K. Scholma. CWI Tracts, V. 96. Centrum 
voor Wiskunde en Informatica, 1993, 92 pp, 
Dfi. 30 (P). [ISBN 90-6196-426-1] 

Analysis, P. Analysis of and on Uniformly Rec- 
tifiable Sets. Guy David, Stephen Semmes. 
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Math. Surveys & Mono., V. 38. AMS, 1993, 
xii + 356 pp, $111. [ISBN 0-8218-1537-7] 


Analysis, P. Residue Currents and Bezout 
Identities. Carlos A. Berenstein, et al. Progress 
in Math., V. 114. Birkhauser, 1993, xi + 158 pp, 
$49.50. [ISBN 0-8176-2945-9] 


Algebraic Geometry, P. Recent Advances in 
Real Algebraic Geometry and Quadratic Forms. 
Eds: William B. Jacob, Tsit- Yuen Lam, Robert 
O. Robson. Contemp. Math., V. 155. AMS, 
1994, viii + 404 pp, $57 (P). [ISBN 0-8218- 
5154-3] Papers from a 1990-91 program at 
the University of California, Berkeley, and from 
a 1991 special session at the AMS winter meet- 
ing in San Francisco. 


Differential Geometry, P. Lie-Cartan-Ehres- 
mann Theory. Robert Hermann. _Interdisc. 
Math., V. 28. Math Sci Pr, 1993, 283 pp, $95. 
[ISBN 0-915692-44-9] Three approaches to 
differential geometry laid out in the author’s 
unique style. Treats applications of geometric 
structures such as frame bundles, Lie structures, 
and variational calculus. Foreword contains au- 
thor’s opinions on current funding and state of 
mathematics in the U.S. DS 


Differential Geometry, P. Functions on Man- 
ifolds: Algebraic and Topological Aspects. V.V. 
Sharko. Transl. of Math. Mono., V. 131. AMS, 
1993, x + 193 pp, $98. [ISBN 0-8218-4578-0] 


Geometry, T*(15: 1), L. Topics in Geometry. 
Robert Bix. Academic Pr, 1994, x + 538 pp, 
$59.95. [ISBN 0-12-102740-6] For prospec- 
tive secondary teachers and other mathematics 
majors. Covers topics in, and emphasizes con- 
nections among, advanced Euclidean geometry, 
transformation geometry (including wallpaper 
groups), projective geometry, conic sections, 
hyperbolic and absolute geometry. Many ex- 
ercises of varying difficulty. JNC 


Geometry, S, P, L. A Sourcebook of Problems 
for Geometry Based Upon Industrial Design 
and Architectural Ornament. Mabel Sykes. 
Dale Seymour Pub, 364 pp, (P). [ISBN 0- 
86651-795-2] Brief historical comments and 
numerous exercises based on design analyses of 
many lovely geometric patterns. With updated 
bibliography; first published in 1912. JNC 


Algebraic Topology, P. Cobordisms and Spec- 
tral Sequences. V.V. Vershinin. Transl. of 
Math. Mono., V. 130. AMS, 1993, v + 97 pp, 
$62. [ISBN 0-8218-4582-9] 


Algebraic Topology, P. Topology and Rep- 
resentation Theory. Eds: Eric M. Friedlander, 
Mark E. Mahowald. Contemp. Math., V. 158. 
AMS, 1994, ix + 318 pp, $48 (P). [ISBN 0- 
8218-5165-9] Proceedings of a 1992 confer- 
ence at Northwestern University. 
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Algebraic Topology, T*(17: 1), L. Homology 
Theory: An Introduction to Algebraic Topology, 
Second Edition. James W. Vick. Grad. Texts 
in Math., V. 145. Springer-Verlag, 1994, xiv 
+ 242 pp, $49. [ISBN 0-387-94126-6] New: 
an extensive chapter on covering spaces, pre- 
senting the lifting problem and using it to intro- 
duce the fundamental group. Stresses intuitive 
treatment of basic ideas. (First Edition, TR, 
August-September 1973.) DP 


Operations Research, T(15-17: 1-3), C. 
Operations Research: Applications and Algo- 
rithms, Third Edition. Wayne L. Winston. 
Duxbury Pr, 1994, xx + 1372 pp, with disk. 
[ISBN 0-534-20971-8] Comprehensive intro- 
ductory text; covers linear programming and 
other deterministic and probabilistic models. 
New edition has 200+ new problems, 6 case 
studies, rewritten chapter on nonlinear pro- 
gramming, and more emphasis on software (es- 
pecially structured modeling packages). Disk 
contains student editions of LINDO, GINO, and 
LINGO as well as data files. (Second Edition, 
TR, April 1991.) AO 


Control Theory, P. Lecture Notes in Con- 
trol and Information Sciences-191: Simultane- 
ous Stabilization of Linear Systems. Vincent 
Blondel. Springer-Verlag, 1994, xxi + 184 pp, 
$45 (P). [ISBN 0-387-19862-8] 

Control Theory, P. Lecture Notes in Control 
and Information Sciences-193: Variable Struc- 
ture and Lyapunov Control. Ed: Alan S.I. Zi- 
nober. Springer-Verlag, 1994, xxii + 401 pp, 
$78 (P). [ISBN 0-387-19869-5] 18 papers on 
theory and applications. 

Control Theory, P. Lecture Notes in Control 
and Information Sciences-192: The Modeling 
of Uncertainty in Control Systems. Eds: Roy S. 
Smith, Mohammed Dahleh. Springer-Verlag, 
1994, xv + 391 pp, $79 (P). [ISBN 0-387- 
19870-9] 31 essays and technical papers from 
a 1992 workshop at the University of California, 
Santa Barbara. 


Control Theory, P. Lecture Notes in Con- 
trol and Information Sciences-188: Aeroas- 
sisted Orbital Transfer: Guidance and Control 
Strategies. D. Subbaram Naidu. Springer- 
Verlag, 1994, xiii + 178 pp, $44 (P). [ISBN 
0-387-19819-9] 

Systems Theory, T(17: 1). The Elements of 
System Design. Amer A. Hassan, et al. Aca- 
demic Pr, 1994, x + 280 pp, $54.95. [ISBN 
0-12-343060-7] Practical approach to design- 
ing large-scale digital systems. Two premises: 
(1) engineering education slights tradeoffs pos- 
sible between individual models and off-the- 
shelf components; (2) systems design should 
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integrate 4 main disciplines: signal processing, 
communications, control, computation. RM 


Stochastic Processes, P. Sojourn Times in 
Feedback and Processor Sharing Queues. J.L. 
van den Berg. CWI Tracts, V. 97. Centrum 
voor Wiskunde en Informatica, 1993, 123 pp, 
Dfi. 40 (P). [ISBN 90-6196-427-X] 


Elementary Statistics, T(13: 1). Statistics: 
An Introduction, Fourth Edition. Robert D. 
Mason, Douglas A. Lind, William G. Mar- 
chal. Saunders College, 1994, xvii + 790 pp, 
$53.25. [ISBN 0-03-096917-4] Traditional 
topics. Over 40% of text devoted to one- 
dimensional descriptive statistics and probabil- 
ity, 20% to basic statistical inference, the rest to 
one-way analysis of variance, simple and multi- 
ple regression and correlation, chi-square tests, 
nonparametric methods. RSK 


Statistics, P. Stochastic Integrals and 
Goodness-of-fit-Tests. A.J. Koning. CWI 
Tracts, V. 98. Centrum voor Wiskunde en Infor- 
matica, 1993, iv + 163 pp, Dfl. 50 (P). [ISBN 
90-6196-428-8] 

Statistics, P. Proceedings of the First US/Japan 
Conference on the Frontiers of Statistical Mod- 
eling: An Informal Approach, Volumes 1-3. Ed: 
H. Bozdogan. Kluwer Academic, 1994, $390 
set [ISBN 0-7923-2600-8]. Volume 1: Theory 
and Methodology of Time Series Analysis, xv 
+ 277 pp; Volume 2: Multivariate Statistical 
Modeling, xiii + 413 pp; Volume 3: Engineer- 
ing and Scientific Applications, xiii + 346 pp. 
42 papers from a 1992 conference in Knoxville, 
Tennessee. 


Statistics, P. The Chronological Annotated 
Bibliography of Order Statistics, Volume VIII: 
Indices, with a Supplement on 1970-1992. H. 
Leon Harter, N. Balakrishnan. Ser. in Math. & 
Management Sci., V. 24. American Sciences 
Pr, 1993, vii + 267 pp, $110 (P). [ISBN 0- 
935950-26-5] 

Programming, C. ObjectWindows for C++. 
Robert J. Traister. Academic Pr, 1993, xii + 
205 pp, $39.95 (P), with disk. [ISBN 0-12- 
697415-2] User-friendly introduction to cre- 
ating Windows applications that allow text- 
editing, mouse operations, graphics. MPR 


Computer Systems, P. SCO UNIX in a Nut- 
shell: A Desktop Quick Reference for SCO 
UNIX & Open Desktop. Ellis Cutler. O’ Reilly 
& Assoc, 1994, xix + 568 pp, $9.95 (P). [ISBN 
1-56592-037-6] 

Theory of Computation, P. The Graph Iso- 
morphism Problem: Its Structural Complexity. 
Johannes K6bler, Uwe Schoning, Jacobo Toran. 
Prog. in Theoret. Comp. Sci. Birkhauser, 1993, 
160 pp, $34.50. [ISBN 0-8176-3680-3] 


[November 


Computer Science, P. Proceedings of the Fifth 
Annual ACM-SIAM Symposium on Discrete Al- 
gorithms. ACM & SIAM, 1994, 735 pp, (P). 
[ISBN 0-89871-329-3] 

Computer Science, P. Proceedings: 8th An- 
nual X Technical Conference. Ed: Adrian Nye. 
The X Resource: A Practical Journal of the X 
Window System (Issue 9). O’Reilly & Assoc, 
1994, 253 pp, $22.50 (P). [ISBN 1-56592-066- 
X] 

Computer Science, T*(12), C, L. Com- 
puter Science: An Overview, Fourth Edition. 
J. Glenn Brookshear. Ser. in Comp. Sci. 
Benjamin/Cummings, 1994, xii + 506 pp, 
$40.50 (P). [ISBN 0-8053-4627-9] Surveys 
the “big picture:” machine architecture, soft- 
ware, data organization, future directions. New 
topics include object-oriented programming 
and databases, networks, the OSI reference 
model. Lab manuals in Pascal and C. (1985 
text, TR, November 1987.) MPR 


Computer Science, T(14—-15), S, C, L. The 
Mathematica Programmer. Roman E. Maeder. 
Academic Pr, 1994, xv + 199 pp, $44.95 (P), 
with disk. [ISBN 0-12-464990-4] An intro- 
ductory computer science text with an empha- 
sis on object-oriented programming concepts. 
Also an introduction to Mathematica as a pro- 
gramming language. Therein lies a problem— 
the intended audience is not clear. Nonetheless, 
an interesting book. MPR 


Computer Science, P. The CWEB System of 
Structured Documentation, Version 3.0. Don- 
ald E. Knuth, Silvio Levy. Addison-Wesley, 
1994, 226 pp, (P). [ISBN 0-201-57569-8] 
User’s guide and reference manual for CWEB, 
a version of Knuth’s WEB system for “literate 
programming” adapted to C and C++. AO 


Computer Science, S(16-17), P*, L**. The 
Stanford GraphBase: A Platform for Combina- 
torial Computing. Donald E. Knuth. Addison- 
Wesley, 1993, viii + 575 pp. [ISBN 0-201- 
54275-7| A collection of “literate programs” 
that present important algorithms and data 
structures (e.g., Dijkstra’s algorithm for short- 
est paths, Fibonacci heaps). Includes data for 
benchmarking combinatorial algorithms. AO 


Applications (Biological Science), P. Disease 
Dynamics. Alexander Asachenkov, et al. Sys- 
tems & Control: Found. & Applic. Birkhauser, 
1994, xv + 316 pp, $79.50. [ISBN 0-8176- 
3692-7] Summarizes 16 years of research in 
theoretical immunology. A unified approach to 
the developing and using mathematical models 
of immune response. AO 


Applications (Economics), P. Computational 
Techniques for Econometrics and Economic 
Analysis. Ed: D.A. Belsley. Adv. in Com- 
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putat. Econ., V. 3. Kluwer Academic, 1994, 
ix + 238 pp, $92. [ISBN 0-7923-2356-4] 13 
recent studies on uses of numerical techniques 
in economics and econometrics. 

Applications (Fluid Dynamics), T(17-18: 1), 
P. Multiphase Flow and Fluidization: Contin- 
uum and Kinetic Theory Descriptions. Dimitri 
Gidaspow. Academic Pr, 1994, xx + 467 pp, 
$69.50. [ISBN 0-12-282470-9] State-of-the- 
art treatment of multiphase flows in such engi- 
neering situations as boiling, condensation, sep- 
aration, and mixing. Aims to help reader under- 
stand and use computer tools newly available to 
handle previously intractable problems. DS 


Applications (Fluid Dynamics), P. Annual 
Review of Fluid Mechanics, Volume 26, 1994. 
Eds: John L. Lumley, Milton Van Dyke, Helen 
L. Reed. Annual Reviews, 1994, x + 704 pp, 
$47. [ISBN 0-8243-0726-7] 


Applications (Physics), P. Deformations 
of Mathematical Structures II. Ed: Julian 
Lawrynowicz. Kluwer Academic, 1994, x + 
464 pp, $199. [ISBN 0-7923-2576-1] Papers 
on Hurwitz-type structures and applications to 
surface physics. 

Applications (Physics), P. Wave Propaga- 
tion: Scattering Theory. Ed: M. Sh. Birman. 
Transl. Ser. 2, V. 157. AMS, 1993, x + 256 pp, 
$105. [ISBN 0-8218-7507-8] Papers treat 
wave propagation, scattering theory, and linear 
differential and pseudodifferential operators. 


Applications (Physics), P. Quantum Scatter- 
ing Theory for Several Particle Systems. L.D. 
Faddeev, S.P. Merkuriev. Math. Physics & 
Appl. Math., V. 11. Kluwer Academic, 1993, 
xiii + 404 pp, $180. [ISBN 0-7923-2414-5] 


Applications (Physics), P. Scattering in Quan- 
tum Field Theories: The Axiomatic and Con- 
structive Approaches. Daniel Iagolnitzer. Ser. 
in Physics. Princeton Univ Pr, 1993, xxi + 
290 pp, $49.50. [ISBN 0-691-08589-7] 


Applications (Physics), P. Operator Algebras, 
Mathematical Physics, and Low Dimensional 
Topology. Eds: Richard Herman, Betiil Tanbay. 
Res. Notes in Math., V. 5. AK Peters, 1993, 
324 pp, $39.95. [ISBN 1-56881-027-X] Con- 
tributed papers from a 1991 NATO Advanced 
Research Workshop in Istanbul, Turkey. 


Reviewers 


JNC: Judith N. Cederberg, St. Olaf; LC: Laura Chihara, 
St. Olaf; HD: Hung Dinh, Macalester; TH: Tom Halverson, 
Macalester; SK: Steve Kennedy, St. Olaf; RSK: Richard 
S. Kleber, St. Olaf; RM: Richard Molnar, Macalester; AO: 
Arnold Ostebee, St. Olaf; DP: David Peifer, St. Olaf; MPR: 
Matthew P. Richey, St. Olaf; AWR: A. Wayne Roberts, 
Macalester; KS: Karen Saxe, Macalester; JS: John Schue, 
Macalester; DS: Dan Schwalbe, Macalester; KES: Kay E. 
Smith, St. Olaf; MW: Martha Wallace, St. Olaf. 
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NEW IN THE SPECTRUM SERIES 


The Words of Mathematics 


An Etymological Dictionary of Mathematical Terms Used in English 


Steven Schwartzman 
A 


aerosol” Cin, 


The Words of Mathematics explains the origins 
of over 1500 mathematical terms used in English. 
While other dictionaries of mathematics define 
technical terms, this book concentrates on where 
those terms came from and what their literal 
meanings are. The words included here range 
from simple to advanced. Elementary school 
teachers may be surprised to learn that znch and 
ounce are really the same word, that eleven means 
literally “one left over,” that a-thousand is a 
“swollen hundred,” and that the original meaning 
of tames was “divide.” High school teachers will 
find out that asymptote means “not falling to- 
gether,” that an area used to be a threshing floor, 
and that focus is a Latin word meaning “fire- 
place.” College teachers who want to explain 
heteroscedastic, amphicheiral, and eigenvalue to 
their students will find the origins of those words 
in this book. 

This dictionary is easy to use. Although some of 
the entries are highly technical, the book explains 
them in plain English. The introduction gives an 
overview of how the ancient language known as 
Indo-European developed into Latin, Greek, 
French, and English, the languages from which 
most of our mathematical vocabulary has been 
derived. Another section discusses the many ways 
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Centro 


otthyy, 
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Figen’ 


in which mathematicians have borrowed and cre- 
ated their specialized vocabulary over the centu- 
ries. A glossary explains historical and linguistic 
terms used throughout the book. 

As in any dictionary, the entries themselves are 
arranged alphabetically. The words are drawn 
from arithmetic, algebra, geometry, trigonom- 
etry, calculus, number theory, topology, statis- 
tics, graph theory, logic, recreational mathemat- 
ics, and other areas. Over 200 illustrations accom- 
pany the dictionary entries, especially some of the 
less familiar ones. Connections to related 
nonmathematical English words are often pointed 
out. Key numbers attached to many entries lead 
interested readers to an appendix which groups 
mathematical terms that come from a common 
source. 

This dictionary is an indispensable reference for 
every library that serves teachers and students of 
mathematics. It is anatural source of information 
for courses in the history of mathematics and for 
mathematics courses intended for liberal arts stu- 
dents. 

262 pp., Paperbound, 1994 

ISBN 0-88385-511-9 

List: $27.00 MAA Member: $21.00 

Catalog Number WORDS 


Total $ 
Payment Q Check UO) VISA UO) MASTERCARD 


Credit Card No. 
Signature 


Exp. Date 


Memorabilia Mathematica 


The Philomath’s 
Quotation Book 


Robert Edouard Moritz 


When Robert Edouard Moritz compiled his book 
of quotations, Memorabilia Mathematica, which 
appeared in 1914, he stated that his primary 
objective was to seek out the exact statement of and 
exact references for famous passages about math- 
ematics. He searched the writing not only of 
mathematicians, but poets, philosophers, histori- 
ans, statesmen, and scientists as well. His sources 
ranged from the works of Plato to the writings of 
Hilbert and Whitehead. His second objective was 
to produce a volume that would be a source of 
pleasure, encouragement, and inspiration to both 
mathematicians and non-mathematicians alike. 

This work was a ten-year labor of love, and it is 
a tribute to his discerning eye that this selection of 
passages should remain one of the most stimulat- 
ing works about mathematics ever published. It 
was the first collection of its kind in English and it 
conveys asense of the full range of mathematics, its 
enormous accomplishments, and the living per- 
sonalities of great mathematicians. 

The more than eleven-hundred fully anno- 
tated selections in this book, gathered from the 
works of three hundred authors, cover a vast range 
of subjects pertaining to mathematics. Grouped 
in twenty-one chapters, they deal with such topics 
as the definitions and objects of mathematics; the 
teaching of mathematics; mathematics as a lan- 
guage or as a fine art; the relationship of math- 
ematics to philosophy, to logic, or to science; the 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 
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nature of mathematics, and the value of mathemat- 
ics. Other sections contain passages referring to 
specific subjects in the field such as arithmetic, 
algebra, geometry, calculus, and modern math- 
ematics. Ofspecial interest is the extensive amount 
of material on great mathematicians which pro- 
vides irreplaceable glimpses into the lives and 
personalities of mathematical giants. 

To mathematicians the book will be a great 
source of pleasure, inspiration, and encourage- 
ment. To teachers of mathematics and writers 
about mathematics, it will remain of inestimable 
value as a source of quotations and ideas. To the 
layperson, it will be a revelation. It should dispel 
forever the narrow notion that mathematics is a 
cut-and-dried affair, isolated from other compart- 
ments of life and thought. 


440 pp., Paperbound, 1993 
ISBN 0-88385-321-3 


List: $24.00 MAA Member: $19.00 

Catalog Number: MEMO 

Qty Catalog Number Price 
Total $ 


Payment O Check O VISA Q MASTERCARD 
Credit Card No. 
Signature 


Exp. Date 


Research Issues in 
Undergraduate 


Mathematics Learning 
Preliminary Analyses and Reports 


James J. Kaput and 
Ed Dubinsky, Editors 


Research in undergraduate mathematics edu- 
cation is important for all college and university 
mathematicians. If our students are to be more 
successful in understanding mathematics, then 
college faculty need to understand how math- 
ematicsis learned. This knowledge can guide us 
in curriculum reform and in improving our own 
teaching. It can help us make mathematics 
accessible to all students and it can increase the 
number of graduate students in mathematics. 

This volume of research in undergraduate 
mathematics education informs us about the 
nature of student learning in some of the most 
important topics in the undergraduate curricu- 
lum: sets, functions, calculus; statistics, abstract 
algebra and problem solving. Paying careful 
attention to the trouble students have in learn- 
ing mathematics will help us to work with stu- 
dents so they can deal with those difficulties. 

A survey of the literature begins the volume. 
Becker and Pence have brought together an 
unusually complete list of references on research 
in collegiate mathematics. Their comments will 
guide those attempting to begin or to continue 
a program of research in student learning. 

The sad fact that even good calculus students 
stumble over nonroutine problems is the theme 
of Selden, Selden, and Mason. Their conclu- 
sions point to significant shortcomings in the 
curriculum. This study of student difficulties is 
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continued by Ferrini-Mundy and Graham who 
investigate a single student’s interactions with 
the fundamental concepts of the calculus. 
Baxter studies a group of students to learn how 
they acquire the concept of set, while Cuoco 
does the same for the concept of function. 

Cooperative learning does help the student. 
That is the conclusion of Bonsangue, who inves- 
tigates how two carefully matched classes of 
students in a Statistics course perform on exams. 
How students learn to write proofs in group 
theory is the subject considered by Hart. 
Rosamond breaks new ground by comparing 
how emotions vary 1n their effect on the prob- 
lem solving ability of novices and experts. 

All college faculty should read this book to 
find how they can help their students learn 
mathematics. 


150 pp., Paperbound, 1994 
ISBN 0-88385-090-7 

List: $24.00 

Catalog Number NTE-33 
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FOUR TEXTS 


I think that in retirement I have become a publisher. I am proud to offer a new 


text by a master of the field 


Donald Sarason, Notes on Complex Function Theory (list price $17), 
intended for a first, undergraduate course of one semester. This book fills a need for a 
straightforward, correct, elementary presentation. 


I continue to offer three books by myself: Linear Algebra (first published by 
Holden-Day) (list price $20), which presents the standard syllabus simply enough for 
ordinary junior-level courses, but without compromising mathematical integrity; and 

Honors Calculus (list price $24), addressed to freshmen who want to do calculus 


right the first time; and 


Harmonic Analysts (first 


published by 


Addison-Wesley, reprinted by 


Wadsworth) (list price $24), a graduate-level introduction to Fourier Series and the 
wonderful part of analysis that depends on this venerable subject. 


Send orders or requests for examination copies to Henry Helson, 15 The Cres- 
cent, Berkeley, CA 94708; Tel (510)848-8629; email helson@math. berkeley.edu 


Visualization in Teaching and 
Learning Mathematics 


Walter Zimmermann and 
Steve Cunningham, Editors 


Buy this book. If you can’t buy it, have the brary order 
it. Ifthe library won't order it, ask to borrow a copy from 
a friend. But do read thts book. 

—The Mathematics Teacher 


High school, community college, and untversity teach- 
ers who use or are interested in using graphics to teach 
calculus, deductzve reasoning, functions, geometry, or 
statistics will find valuable ideas for teaching... A 
must for every college or university library with a 
mathematics department.—CHOICE 


The twenty papers in this book give an overview 
of research, analysis, practical experience, and 
informed opinion about the role of visualization 
in teaching and learning mathematics, espe- 
cially at the undergraduate level. Visualization 
in its broadest sense is as old as mathematics, but 


progress in computer graphics has generated a 
renaissance of interest in visual representations 
and visual thinking in mathematics. 


230 pp., Paperbound, 1991 
ISBN 0-88385-071-0 

List: $24.00 

Catalog Number NTE-19 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-(800) 331-1622 Fax (202) 265-2384 


LURE OF THE 
INTEGERS 


Joe Roberts 


A joy to read and ponder, this book is a welcome 
addition to the body of mathematical literature. It 
belongs in every mathematical library. 

—Journal of Recreational Mathematics 


Will enrich library collections serving curricula 
with theory of numbers courses. —Choice 


In some small way, this book is an introduction to 
a mythical book which might go under the name of 
. The Book of Integers. This mythical book has on 
page n all of the interesting properties of the 
integer n. This introduction stems from many 
years’ casual accumulation of numerical facts. 
Most of the material presented belongs to elemen- 
tary mathematics in the sense that no deep or 
profound mathematical background is required in 
order to understand what is said. Much of the 
material is drawn from the theory of numbers. 


Many of the topics touch on contemporary re- 
search and most of the results are stated without 
proof. As a general rule, one cannot tell from the 
statements of the results whether or not their 
proofs will be elementary. Indeed, this is a hall- 
mark of mathematics and is one of the things that 
gives the subject a special flavor and interest. 
Until one knows that expert practitioners have 
been unable to solve a problem, one does not 
know that the problem is difficult. Even then it may 
turn out that there is an easy solution. 
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Some of the material will be familiar to people 
having only a small acquaintance with mathemat- 
ics. Even in those cases, the author provides 
something new. On the other hand, much of the 
material is sufficiently out of the main stream of 
concern that even professional mathematicians 
may be unfamiliar with the results. The many 
references to the literature will almost always 
enable a reader to track down further information. 
In Lure of the Integers the author has presented 
a body of material which will prove interesting to 
the enlightened layman as well as to the profes- 
sional. 


300 pp., Paperbound, 1992 
ISBN-0-88385-502-X 

List: $31.50 MAA Member: $21.50 
Catalog Number LURE 


ORDER FROM: 


The Mathematical Association of America 
1529 Eighteenth Street, NW 

Washington, DC 20036 

1-800-331-1622 Fax (202) 265-2384 


Foreign Orders Please add $3.00 per item ordered 
to cover postage and handling fees. The order will be 
sent via surface mail. If you want your order sent by 
air, we will be happy to send you a proforma invoice 
for your order. 


Total $ 
Payment O Check 4 VISA O MASTERCARD 


Credit Card No. 


Signature Exp. Date 
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DERIVE is a powerful 
computer algebra system 
for doing symbolic and 
numeric mathematics on 
your personal computer. 


DERIVE: 


Performs numerical 
operations exactly with no 
round-off error . 


Approximates irrational 
expressions to thousands of 
digits of precision 
Algebraically simplifies, 
expands, and factors 
expressions; and solves 
equations 


Applies the rules of 
trigonometry, calculus, 


DERIVE is a 
registered trademark of 
Soft Warehouse, Inc. 
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matrix algebra, and vector 
calculus 


* Plots explicitly and 


implicitly defined functions 
in 2D with zooming and 
auto-scaling 


¢ Generates 3D wire-frame 


function plots using hidden- 
line removal 


,° Displays and prints expres- 


sions using standard 2D 
mathematical notation 


* Provides an easy to use, 


menu-driven interface with 
on-line help 


¢ Is ideal for students, teach- 


ers, engineers, scientists, 
and mathematicians 


Soft Warchouse: 


HONOLULU*HAWAII 


Declare Expand Factor Hel 
Unde 


2 VERSION 3! 


¥ Jump solve Mana 
e 


ge 
te moVe Window approx 


Derive Algebra 


DERIVE Requirements 
(regular memory version): 

A PC compatible running 
MS-DOS with 512K memory 
and a 3'/o inch (720K) 
diskette drive or an HP 95LX, 
100LX or 200LX palmtop 
computer with 1M memory 
and connectivity pack for 
downloading. List $125. 


DERIVE XM Requirements 
(extended memory version): 
A 386, 486 or Pentium® 
based PC compatible 
running MS-DOS version 3.0 
or later with at least 2M of 
extended memory and a 3//s 
inch (1.4M) diskette drive. 
List $250. 


Soft Warehouse, Inc.* 3660 Waialae Ave. 
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COMMENTS 


Assessment is sweeping the country these days. Almost every section of the 
MAA has discussions of assessment. Accredidation agencies mandate that 
universities implement it. Deans direct departments to formulate plans. Chairs form 
committees to study the problem. And faculty discover that they've been doing 
assessment for years — they just haven't been doing it right. 

It's hard to be against something so obviously sensible. Assessment means 
setting goals for programs and courses, and then measuring how effectively the goals 
are met. What knowledge and skills should be emphasized in undergraduate 
education? Are students learning what we think we are teaching? Those are pretty 
sensible questions, and they ought to be asked. I thought they were. 

At a recent meeting, one person began his presentation by saying that the 
terminology and ideas of assessment may seem unfamiliar to mathematicians, but 
these were all ideas that had been in education schools for years. I wasn't comforted. 
Another speaker displayed a chart that showed current assessment practices in his 
college. All departments were listed vertically, and across the page were various 
kinds of assessment activities. Most departments had a few things checked. One had 
twice as many as the others — it was Education. They seem to be more advanced 
than the rest of us. 

In much of the midwest, assessment is driven by the North Central 
Association. Universities and colleges must put in place elaborate assessment plans 
or risk their accredidation. They hire outside assessment experts who run workshops 
and train local assessment experts. I wonder where these people first learned to be 
experts? 

Part of my uneasiness about assessment is the lack of precision. Here are 
some phrases from the Handbook of Assessment that my university provides to all 
departments. "Assessment focuses on student learning outcomes." Learning 
outcomes? "Assessment is research and inquiry into the improvement of teaching 
and learning." What's it mean to inquire into improvement? "Assessment is a process 
in which goals and learning objectives of a program or course are identified and data 
are collected from multiple sources to document student, teacher, or program 
achievement of those goals and objectives." It seems strange that the same handbook 
advises us when creating instruments to "choose words that have precise meanings" 
and to "avoid complex or awkward syntax." What is an instrument anyway? 

A basic goal of assessment seems to be measuring the value added to 
students by attending our classes. This emphasizes the manufacturing approach to 
education. I wonder how one would assess a trip to the museum or a child's visit to 
the playground or a night at the symphony. What are the goals? Were the objectives 
met? I'd like to think that attending a university is closer to a trip to a museum or a 
playground than a trip along an assembly line. 

One of the benefits of assessment is the "added responsibility teachers feel 
for student learning." Of course, teachers ought to be responsible. But I wish I knew 
of some technique to make students feel more responsible for student learning. 


John Ewing 
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The Rectilinear Crossing Number of a 
Complete Graph and Sylvester’s “ Four 
Point Problem” of Geometric Probability 


Edward R. Scheinerman and Herbert S. Wilf 


The chance of ...the quadrilateral formed by joining four points, taken arbi- 
trarily within any assigned boundary, constituting a reentrant or convex quadri- 
lateral, will serve as types of the class of questions in view. 


—J. J. Sylvester [11] 


We prove that two fundamental constants of the geometry of the plane are 
equal. 

First, if R is an open set in the plane with finite Lesbesque measure, let g(R) 
denote the probability that if four points are chosen independently uniformly at 
random in R, then their convex hull is a quadrilateral. Let q, be the infimum of 
q(R) over all such sets R. 

Second, let ¥(K,,) denote the rectilinear crossing number of the complete graph 
on 7 vertices, i.e., the minimum number of intersections in any drawing of K,, in 


the plane that has straight-line-segment edges. It is well known that »(K,)/ (") 


increases steadily to some limit p* as n > , 
Our main result is that gq, = 0”. 


FOUR RANDOM POINTS. Let R be an open set in the plane with finite area. As 
such, we can consider R to be a sample space from which we select points 
independently uniformly at random (i.u.a.r.). Choose four points from R i.u.a.r.. 
Then with probability 1, no three of the points are collinear, so the convex hull of 
the four points is either a triangle (one point in the convex hull of the other three) 
or a quadrilateral. J. J. Sylvester [11] asked, what is the probability that the points 
determine a convex quadrilateral? We denote this probability by q(R). 

How large and how small can g(R) be? When R is restricted to being a convex 
set we have the following result (see Blaschke [1, 2] and also [7]). 


Theorem 1. Let R be an open, convex subset of the plane, of finite area. 
Then 


= 0.704. 


2 
 <q(R) <1- 
3 = a(R) 1272 


Further, both inequalities are sharp. The lower bound is attained by the interior of a 
triangle (any triangle), and the upper bound by the interior of an ellipse (any ellipse). 
O 


This theorem, however, does not fully address the issue of the extreme values of 
q(R) because it considers only convex regions R. It is easy to see that if we relax 
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the convexity requirement, then the supremum of g(R) is 1; let R be a very thin 
open annulus and observe that we can make g(R) arbitrarily close to 1. Thus it 
remains to consider the infimum of q(R); let g, = inf g(R) where the infimum is 
over all open sets R with finite area. 

We show below that q, is positive and strictly less than 2/3 (the lowest possible 
result for convex R). We show further that q, is closely related to the rectilinear 
crossing number of complete graphs. 


RECTILINEAR CROSSING NUMBER OF A GRAPH. Let G be a graph which 
we wish to draw in the plane. If G is planar, then we can find an embedding in 
which the edges do not cross. A result of Fary [4] shows that we can choose this 
embedding so that the edges are noncrossing straight line segments. 

Let »(G) denote the minimum number of crossings in a straight line drawing of 
G in the plane; the parameter »(G) is known as the rectilinear crossing number of 
G. (For background on the rectilinear crossing number, see [6], [7] or [12].) 

An important open problem in the study of graph embeddings is to determine 
the rectilinear crossing number of the complete graph K,. For n = 5, 6,7, 8,9 the 
values are known (see [12]) and they are 1,3, 9, 19, 36 respectively. For n = 10 it is 
known [10] that 61 < >(K,,.) < 62. 

If we place the n vertices of K, on a circle, then the number of crossings is 
exactly ("); certainly we can do better, but >(K,,) is on the order of n* as we now 


explain. 


Theorem 2. There exists a constant v™ such that 0 < v* < 0 and 


(This is well-known folklore, but for completeness we show the proof here.) 


Proof: Let m <n and consider a straight line embedding of K, in the plane with 
the minimum number of crossings, »(K,). For each m element subset A 
of V (K,,), let c-A) denote the number of crossings in this embedding in which the 
endpoints of the crossing edges are all in A. If we sum c(A) over all m-subsets of 


V(K ), we count each possible crossing exactly (” ~ ‘| times. Thus 


m—4 


i(K,) = Eno) 


Now clearly c(A) = 0(K,,,), so it follows that 
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Thus p(K,,)/ (") is a nondecreasing function of n which is bounded above by 1 and 
below by H(K)/(5} =1/5. D 


Since )(K,,) = 61, we see that >* > 61/210 = 0.29. 
Singer [10] proves the following upper bound on the rectilinear crossing number 
of K, when v7 is a power of 3: 


1 
v(K,) < 3 Om" — 39n? + 91n* — 57n). 


Thus 0* < 3; X 24 = 4 = 0.3846. 


(Jensen [6] gives a rectilinear embedding of K, with z;n* + O(n) crossings, 
yielding an upper bound of 0.3888 ...on p*.) 


MAIN RESULT. Our main result is a simple relation between q,,, the smallest 
probability of choosing a quadrilateral, and »*, the limit of 7(K,,)/ ("). 


Theorem 3. With the preceding notation, qy = v™. 


Proof: Let R be any open set in the plane with finite area. Choose n points 
P1,P2,---,D, 1.u.ar. from R and let those n points be the vertices of a straight 
line drawing of K,. Let c be the number of crossings in this drawing. Now c is a 
random variable whose value is always at least »(K,,). Further, let 


X= PY 1{p,, Pb, Pe, Pq form a quadrilateral} 
{a,b,c,d} 
where the sum is over all 4 element subsets of {1,...,n} and 1{...} is a 0,1 


indicator random variable whose value is 1 just when the convex hull of the four 
points p,, Dy, DP->, Pq iS a quadrilateral. Since the optimum drawing cannot have 
more crossings than the average, we get, by taking expectations, 


H(K,) < E(X) = ("1 Ja(R) 


for all m. Dividing by (") and letting n — ©, we have p* < g(R) for all R. Thus 
D* <x. 

For the opposite inequality, consider a straight line embedding of K, with the 
minimum number, »(K,,), of crossings. Let R, be the (disconnected) open set 
formed by placing a small open disc of radius e€ centered at each vertex of the 
embedding. See the cover of this issue. Here e€ is chosen small enough so that for 
every choice of n points, one in each disc, if we connect all pairs of them by 
straight line segments then the number of crossings is always equal to »(K,,), i.e., 
all such embeddings are optimal. Clearly such an e exists. 

We now consider the following question: choose four distinct discs of R., and 
then choose i.u.a.r. a point from each of them. What is the probability g that the 
resulting quadrilateral is convex? 

On the one hand, gq is the number of convex quadrilaterals in the original 


embedding divided by ("). But the former are in 1-1 correspondence with edge 


crossings, so there are exactly »(K,,) of them, and we have q = (K,,)/ ("). 


On the other hand, q(R,) is the probability that four points chosen i.u.a.r. in R, 
will form a convex quadrilateral. But four points so chosen will lie in four distinct 
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discs of R, with probability 1 — O(1/n). Hence 


q = 4(R,) + O(1/n) > a + O(1/n). 


Combining these two facts about g we obtain 


q = m(K,)/("} 24% + O(1/n). 


If we now let n > ~,p* > q, follows, and the proof is complete. O 


Thus to summarize our principal results, we have 


0.29 = 2 ot <= 0.385 
. = — < = < — =U. . 
210 ~4* ~ " = 43 


SOME COMMENTS 


1. 
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In a spirit similar to ours, Moon [8] applies random methods in bounding 
the ordinary crossing number of K,,. He places n points i.u.a.r. on a sphere 
and joins them pairwise by arcs of great circles. This gives an upper bound 


| for the crossing number of K,,. It is not clear how to project Moon’s 


of 3( "| 

embedding into the plane and have the edges become line segments. 

Our methods can be applied to arbitrary graphs G. Let M denote the 
number of pairs of edges in G which span four distinct vertices. Then 
V(G) < v*M/3; simply place the vertices of G i.u.a.r. in a region R and 
compute the expected number of crossings. 

Given a subset R of the plane together with a probability measure wu 
defined on R, define g(R, ) to be the probability that four points chosen 
i.u.a.r. with respect to mw form a convex quadrilateral.. Without further 
restrictions, we see that the infimum of qg(R, w) is 0; let R be a unit line 
segment together with Lebesgue measure—no four points can form a 
quadrilateral. 

If we restrict ourselves to those (R, wu) for which the probability that 
three points selected i.u.a.r. are collinear is 0, then the infimum of qg(R, w) 
remains the same, namely q,. 

If we understand Sylvester’s problem (see quote above) to mean the 
interior of a Jordan curve, our results still don’t change; in the proof of 
Theorem 3 we can join the small open disks by even smaller tendrils so the 
domain is the interior of a simple closed curve. 

Let R be a bounded open set and embed the vertices of K, at n points 
selected i.u.a.r. in R. We have seen that the expected number of crossings 


in this embedding is q(R)("}. However, one might harbor hopes of doing 


better on occasion. It would seem natural to generate many embeddings of 
K,, in, say, the interior of a square or a disk, and count the number of 
crossings in hopes of finding a good embedding. Regrettably, this is not at 
all likely, as we now explain. 


The number of crossings can be written 


X= YY 1{p,. Pp, Der Pa forma quadrilateral} 
{a, b,c, d} 
where the sum is over all 4-element subsets of {1,...,} and the p,’s are 


chosen i.u.a.r. in R. Thus X is an example of a U-statistic; see [5] or [9] for 
an extensive discussion. Using “deviations” results (see [9] §5.6) one can 
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show that the probability that the number of crossings is “‘significantly’’ less 
than the expectation is extremely small. 
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Biographical history, as taught in our pub- 
lic schools, is still largely a history of 
boneheads: ridiculous kings and queens, 
paranoid political leaders, compulsive voy- 
agers, ignorant generals—the flotsam and 
jetsam of historical currents. The men who 


radically altered history, the great scien- 
tists and mathematicians, are seldom men- 
tioned, if at all. 


—Martin Gardner 


George F. Simmons, Calculus Gems. 
New York: McGraw Hill, Inc., 1992, p. 1 
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String Matching for the Novice 


Ora E. Percus and Jerome K. Percus 


A recurrent theme to gladden the hearts of jaded mathematicians is the frequent 
rediscovery, for the purposes of today’s technology, of elegant mathematical results 
from years, decades, or even centuries ago. The development of quantum mechan- 
ics via the “matrix mechanics” of Born and Heisenberg in (anecdotal?) ignorance 
of the very well developed theory of matrices, is a widely quoted example. But 
technology today often has the prefix ‘‘bio” attached to it, and so it is only fitting 
to find that statistical questions arising in the human genome project were 
considered, and solved in practical form, centuries ago. It is the purpose of this 
note to go one step further and show that even the naive mathematical level that 
must have been the norm in bygone days is sufficient for practical answers to these 
Same questions. 

The questions we have in mind are all versions of “the matching problem’. A 
typical context is this. (See e.g. ref. 5.) A large linear molecular chain occurs as a 
string of / molecular units (amino acids for proteins, bases for DNA,...); another 
molecule of length / when aligned with the first is found to have a subsequence of 
r units in common with it. What is the a priori probability of this event, to be used 
for example in a Statistical test of the hypothesis that this was a purely random 
occurrence, and hencé not a significant indication of some functional or evolution- 
ary relationship between the two molecules? Now the “‘randomness’ assumption 
has to be made more precise. For this purpose, we suppose that the relative 
frequency f; of the jth type of unit, j = 1,...,n (proteins: n = 20, DNA: n = 
4...) is known, and that this is the frequency at which type j is to be found at a 
given location or site in a molecular string, independently of the identity of the 
units at other sites. If this is the case, the probability of units from two molecules 
matching at a given site will be 


p= Uf? (1) 


j=1 


and the probability of their not matching, g = 1 — p. 


PROTOTYPE. Let us return to the question at hand, now framed as: what is the 
probability that, under random selection, the two strings of units will match at least 
at r sites in a row? This was studied from a generating function viewpoint by 
de Moivre [1] back in the 1700’s, and an explicit highly computable series result is 
to be found, e.g. in Uspensky’s standard text [6]. The conceptual difficulty is that 
e.g. failure to match a given r-site subsequence biases the matching probability of 
another r-subsequence with overlapping sites. In other words, two successful 
matching events A and B for different subsequences will not exactly satisfy 


P(A UB) = P(A) + P(B) (2) 
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because they are not mutually exclusive. There are ingenious ways of bounding the 
effect of such potential overlaps on the resulting probability, but here we simply 
note that if the r-matching probability is very small to start with—in accord with 
many practical situations—then we anticipate that the overlap correction will be 
extremely small and can be neglected. In the present case, the implications are 
immediate: the probability of at least r matches starting at the left end (designated 
site 1) of the pair of molecules is clearly p’; the probability of an r-match starting 
at site s=2,...,/+1-—Tr is that of site s — 1 having a mismatch, followed by r 
matches, i.e. gp’, and there are / — r such values of s. Hence the total probability 
of at least one r-match under the assumption of additive probabilities is given by 


PL, =p’ t+ (J—r)qp’ 
=(1+(/-r)q)p’, forr>1. (3) 


We can just as easily be a little more precise. Suppose that we ask for P’, the 
probability that the /Jongest match has length r, i.e. that there are length r matches 
and they do not extend to length r + 1. This means that unless we are at either 
end, a mismatch followed by r matches followed by a mismatch is required, a 
probability of gp’¢ for s = 2,...,/ — r; starting at s = 1, one needs only p’q, and 
finishing at s = /, only gp’. Thus, P! = p’g + (I — r — 1)qp'q + gp’, or 


Pp! (2+ (1-1-~r)q)p’q forl<r<l 


’ p! for r=1. 


(4) 


Of course, (3) and (4) are related. The probability of the longest match having 
length r is that of at least r in a row minus that of at least r + 1 in a row, and 
indeed 1 + Ud —r)qg)p’ -Q+U-r-—-Dq)p’*!=2+d-1-)r)q)p’g is an 
identity for r < /, and holds trivially for r = J. 


NEXT STAGE. This was a warm-up. Since the long molecules are built up of bits 
and pieces which get arranged and rearranged, there is no reason to expect a 
common functional subsequence to be at the same relative set of locations in each 
molecule, and indeed no reason to expect the two potentially related molecules to 
have the same length. So let us suppose that the molecules are /, < /, units long, 
and that we ask for the probability of a match of at least r units in sequence of one 
with the same number of units in sequence some place in the other. The counting 
is most easily done by figuratively sliding one molecule along the other and 
regarding the common region of sites as a pair of equal length molecules to which 
we can apply (3). The length of this region will be /, any time the /, molecule can 
fit inside the 7, molecule, i.e. /, + 1 — 1, times, each contributing (1 + (1, — r)q)p’ 
to the desired probability (again with the assumption of independence). Then, as 
the /, string moves partly outside the /, string at one end or the other, an overlap 
of r </ <1, will occur exactly twice, with a contribution of (1 + (/ — r)q)p". The 
total probability is then (7, + 1 —1,)0 + Ud, —r)q)p’ + 2077) + Ud -na)p’, 
which reduces without difficulty to 


Pre=[14+(,-rn)+(h-r)+(,-r)(,-rja|p’, forl<r<l,. (5) 


For example [4], two fragments of DNA, of lengths /, = 154, 1, = 103 with 
nominal base matching probability p = 1/4 would have a match of at least nine 
bases length with probability P, , = 0.040, very close to the exact 0.039. 

Of course, the corresponding result for the longest match being of length r can 
be found precisely as before, i.e. by computing P/v? = Ph — Ph |. and this 
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turns out to be 
Piro = [24+ 21, -r+h—-r—-lqat+(l—-r-1)(,-7r-1)4?|p’, 
forl<r<l,-1. (6) 


RELAXED CRITERIA. It is well known that some substitutions can be tolerated 
without changing the functionality of biological subsequences. Suppose that the 
matching frame is of length r or greater, but that only m sites of a subsequence 
have to be identical, including of course the first and last sites that define the 
length r. Now the elementary matching probability, instead of being p’, is 
determined by one match, m — 2 out of r — 2 matches, then another match, i.e. a 
probability of p(n 2 \p™—2q"—"p. We conclude that the only effect of the relaxed 
criterion is the replacement 


m—2 


in (2), (3), (4), or (5). Presumably more relevant would be the criterion that at least 
m out of r match, leading at once to 


for m out of r, replace: p’ > | r—2 |p"ar-" (7) 


for at least m out of r, replace: p’ > >, [; 7 5 | pta* (8) 


k=m 
Still more relevant might be a weighted probability in which the pattern of matches 
within an r-subsequence matters, so that e.g. one would bias against large gaps, 
but this is a matter of explicit relevant biochemistry and will not be discussed here. 


ASSESSMENT. The assumption that all successful r-matches are nonoverlapping 
is clearly an approximation that holds only if the elementary events have suffi- 
ciently small probabilities that one is confident that the overlap probability is fully 
negligible. But how small is small? In this respect, a number of comparisons have 
been made between the results of the independence assumption and sophisticated 
bounding techniques or numerical simulations, arriving at the conclusion that 
“sufficiently small” can be quite large, to the extent that it is automatically 
satisfied by virtually any situation in which matching is unusual enough to warrant 
its use to contradict a hypothesis of randomness. So a naive approach is justified. 
But suppose one really wants to assess the possible effect of non-empty intersec- 
tion of relevant events. How difficult is this? Not very, as a prototypical example 
shows. 

Consider the basic matching problem as approximated by (3). If A, is the event 


that a match at > r contiguous sites starts at site j,j = 1,2,...,/ + 1-— Vr, then 
PS, = P(A, UA, VU a UA)41-+)- (9) 
One knows from Bonferroni’s inequalities [2] that, quite generally, 
I+1-r I+1-r 
» P(A) =Pl| U A; 
j=l i= 
I+1-r 
> )) P(A;)- > P(A; NA,). (10) 
j= 1<j<k<l4+1-r 


We have already used P(A,) = p’, P(A;) = gp’ for j > 1 in (3). Now if j <k <j 
+ r, A, overlaps or abuts A,; thus k cannot be the start of a match of exactly r, 
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and P(A; A,)=0. On the other hand, if k >j+r,A, and A, are not 
mutually exclusive but independent, so that PCA, N.A,) = P(A,)P(A,). Thus the 
nonvanishing contributions come from: j= 1, r+1<k<l1+1-Yr, a total of 
(1 — 2r)p’qp’, and: 1<j</]—-—2rr+j<k<l+1-~Yyr, 4 total of a9 + 1 
— 2r —j\(qp")? = 5(1 — 2r — 1) — 2r)\(qp")’. It follows from (3) and (10) that 
(1+ (l-r)q)p’>PS,>(1+(l-1r)q@)p' 

—(1— 2r)(1 + 3a(1 — 2r — 1))qp*”. (11) 
Interestingly, the terms in (11) are precisely the first two in Uspensky’s expansion 
(and the remaining terms can be obtained as well using inclusion-exclusion). A 
quoted example [3] is in the context of the tossing of a fair coin, p = 1/2. For 2059 
tosses, the probability of a run of at least 13 is given exactly by .1176. According to 
(11), one finds .1249 > P2°?? > .1172, illustrating as well the rapid convergence of 


the alternating series that equation (10) is developing. 


REMARKS. The above argument suggests an alternative strategy, which is to start 
by assuming that successful r-matches are independent events. Then (9), in the 
form P,, = 1—-— P(A, A),...) where A is the complement of A, yields 


PL, =1- [TG - P(A))), (12) 


which takes on a Poisson distribution form for small p”’, coincides with the lower 
bound of (11) when r </, and can be sequentially corrected for correlations. 
Establishing bounds in this fashion is not trivial. Of course, bounding correction 
.terms can be derived for other cases, just as they were for (11), with similar 
qualitative results: the correction AP to the probability P scales as the square of 
the probability (here P ~ lgp’, AP ~ 5(Igp")”), validating our initial presumption 
as to the effect, of additivity of probabilities. However, since we have for example 
omitted correlations among sites on the same chain, which are known to exist, it is 
not clear that we are justified in going beyond the mathematics of the early days of 
probability, as illustrated by (3), (4), (5), and (6) which give us the upper bounds 
that tell us whether an observation is likely to represent nonrandomness. And this 
is the message which was to be delivered. 
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Bernoulli Trials and Number Theory 


Don Rawlings 


1. INTRODUCTION. A coin may be used to generate a natural number by simply 
tossing it until heads lands up: If the heads occurs on the nth toss, then identify 
the sequence of tosses with n. For instance, the Bernoulli sequence TTTTTH may 
be viewed as generating the natural number 6. The probability P,() that a given 
number n is the outcome may be readily determined. If the coin lands tails up with 
probability q < 1, then P,(n) = q”'(1 — q) since the trials are independent. 

Of course, the set of natural numbers Z* along with the measure P, really 
amounts to the geometrical distribution with success probability (1 — q). However, 
the perspective from the measure space (Z", P,) iS provocative; it serves aS a 
catalyst for the consideration of questions with a number theoretic flavor in the 
context of a geometrical distribution. 

Two examples illustrating the possibilities afforded by this perspective are 
presented herein. In the first, a certain divisibility property considered on Bernoulli 
generated s-tuples of natural numbers leads to a natural “g-analog” of Euler’s 
product formula for the Riemann zeta function. The second example concerns the 
probability of a sequence n,,”,,...,n, of Bernoulli generated numbers having no 
“fixed indices.” As s > ©, the likelihood of having no “fixed indices” will be 
shown to approach the reciprocal of a “‘g-analog” of the real number e. 


2. THE NOTION OF g-ANALOG. An object @(q) is said to be a q-analog of @ if 
@(q) reduces to @ when g = 1. As an illustration, for n € Z* the polynomials 
defined by 

[nJ=1lt+qtaq?t::: tq"! and [n]!= [1][2]--- [7] 
are qg-analogs respectively of n and its factorial. These g-analogs are extended to 
include 0 by adopting the conventions that [0] = 0 and [0]! = 1. The polynomial [7] 
is just the mth partial sum of a geometric series in q. 

An important example of a qg-analog for the discourse at hand is the continuous 
extension on congruence classes of P, to include the case g = 1. Forme Z *, let 
1,2,...,m denote the set of congruence classes modulo m. Since [m] = (1 — 
q™)/( — q), it follows for 0 < q < 1 that the probability of a Bernoulli generated 
number n belonging to the class 7 is 


=) _ r— km 
P{n€7F} = da i+km(1 — q) 


(1 _ q)q"' - q’ 3 
1 —q”™ [m] 


_ — k 
(1-a)q"" Li (a) = 
> k=0 
Since q’~1/[m] is continuous from the left at gq = 1, P, may be continuously 
extended on congruence classes to gq = 1 by defining Pm <©7}=1/m for 1< 
r <m. Thus, the extended P, is a q-analog of the equiprobable measure. 
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Actually, this extension makes some sense in terms of the Bernoulli scheme: If 
q = 1, then the coin almost surely never lands heads up. So, all numbers in Z* are 
equally likely (or, more aptly speaking, equally unlikely). For later reference, the 
remarks of this paragraph and the last are recorded in the following lemma. 


Lemma 1. For 0 <q <1, the probability of a Bernoulli generated number n 
belonging to the congruence class r modulo m is Pn © r} = q’~ '/[m]. 


3. A g-ANALOG OF THE RIEMANN ZETA FUNCTION. For n € Z*, suppose 
that nm = py'ps? --: pee where p,, P>,..-., DP, are distinct primes and 
Q1,Q@5,...,@, are positive integers. The q-canonical factorization of n is then 
defined to be 


(n} = [pil Tp)? Led. 


Furthermore, let v(n) = a,(p, — 1) + a,(p, — 1) + ++: +a,(p, — 1). By conven- 
tion, set {1} = 1 and v(1) = 0. 

For s > 1 and 0 <q <1, a q-analog of the Riemann zeta function may be 
defined by 


00 qi’ 
f(s) = 


wa {ny 


Note that ¢,(s) does indeed reduce to the Riemann zeta function ¢(s) when 
q = 1. As might be expected, ¢,(s) has some properties that are analogous to those 
of £(s). The relevant ones for the purposes of the next section are tabulated in the 
next two theorems: 


Theorem 1. Fors > 1 and0<q<1,¢ gS) is convergent. 


Theorem 2. For s > 1 and 0 <q <1, £(s) =T1,Q — q°?"?/[p})~* where the 
product is over all primes p. 


Theorem 2 is a g-analog of Euler’s classical product formula for the Riemann 
zeta function. Rather than getting momentarily mired down in analytic details, the 
proofs of Theorems 1 and 2 are postponed until sections 6 and 7. Let’s now turn 
our attention to the context that served to motivate Theorem 2. 


4. p-DIVISIBILITY AND BERNOULLI GENERATED s-TUPLES. For p prime, 
an s-tuple (n,,5,...,,) of natural numbers is said to be p-divisible if p divides 
each n,. By considering the notion of p-divisibility in the context of Bernoulli 
trials, a probabilistic application of 1/¢,(s) arises. 

To see how this comes about, consider repeating the Bernoulli scheme of the 
introduction s times for each prime p. The result will be a sequence of s-tuples, 
each of the form (1,,7,...,”,),. Since the components of such an s-tuple are 
determined independently, Lemma 1 implies that the probability of (,, n.,,..., nN)» 
not being p-divisible is 


p-1 
4 


1 - [1P{n, <5} -1- I] 


| <a = 1- gr D/[ pl. 
j=1 j=1 [?] 


Furthermore, provided that s > 1, the probability of (1,, 7 ,...,n,), not being 
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p-divisible for all primes p is given by 
I] (1 _ q°P-Y/[ py). 
p 


By Theorem 2, this infinite product converges to 1/¢,(s). These considerations are 
summarized in the following theorem. 


Theorem 3. Suppose that s > 1 and that the Bernoulli scheme of the introduction is 
repeated s times for each prime p so as to generate a sequence of s-tuples, each of the 
form (ny, N2,..+,%s)y. The probability of no (ny, nz,...,N5), being p-divisible is 


1/¢,(s). 


5. SEQUENCES WITH NO FIXED INDICES. Before addressing the subject of 
this section, it is expedient to first dispose of a few technical details. In [4], a 
q-analog of the real number e is defined as the series 


[eJ- 


imo Ln}! 


which converges for 0 < g < 1. If g = 0, then [e] = «~. As demonstrated in [4], this 
q-analog of e satisfies the following limit formula: 


Theorem 4. Jf 0 <q <1, then 


n git! \~* 
[e] = lim []}1 - —— 
jin FA Ty] 
This is just a g-analog of the well-known identity e = lim, ,.<1 — 1/n)~". The 
Stage is now Set. 

A sequence n,,1,,...,n, of natural numbers is said to have no fixed indices 
modulo s if n; € j for 1 <j <s. Although not standard fare for number theory, 
this notion leads to an identity involving the reciprocal of [e]. 

By making use of independence and of Lemma 1, the probability that a 
Bernoulli generated sequence n,,”,,...,n, has no fixed indices modulo s is 


S ; S ; S qi"! 
T1242, €7} = [] (1 - Bhan, ej -T[1- 4). 
Tan €) = T1(1-a(n,e})) = FI t- 
In view of Theorem 4, letting s — © yields 


Theorem 5. The probability that a Bernoulli generated sequence n,,nNy,...,n, has no 
fixed indices approaches 1/\e] as s > ©. 


Theorem 5 was inspired by and is equivalent to a result in [4] concerning 
“q-random mappings” with no fixed points. 


6. A PROOF OF THEOREM 1. Theorem 1 is clearly true for the extreme values 
of q: On the one hand, ¢,(s) = 1. On the other, ¢,(s) is just the Riemann zeta 
function which is known to converge for s > 1. 

So let’s now restrict our attention to the case 0 < q < 1. To get at this case, 
there are a few facts that are best isolated. They are presented in Lemma 2. 
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Lemma 2. If 0 <q <1 andm > 1, then 


a) lim [n] = (1 - q)7! b) > 


no j=o[m]” [m] -1 


:) Y= 


j=l oF - [m] -1 


All three parts readily follow from the standard formula for summing convergent 
geometric Series. 

Since each term in the sum ¢,(s) is positive, it suffices to show that the sum of 
any rearrangement of these terms converges. Towards this end, suppose that p, 
denotes the kth smallest prime. Let 


p(k) ={n © Z": p, is the largest prime divisor of n}. 


Since the union U%_,p(k) consists of pair-wise disjoint sets and is equal to 
Z*\{1}, the sum 


sv(n) 


-1+y ry 4 


k=1neptk) {7} 


is a rearrangement of ¢,(s). 
Clearly, v(n) > k — 1 for all n © p(k). Furthermore, from the definition of 
p(k) and from parts (b) and (c) of Lemma 2, it follows that 


1 22 2 1 
nen ( Be », 2 [2°31 ++ Lega Le)” 
[27 3) Leal 1 


~ PP-1Bl-1 [mas -1 [mJ -1 


For convenience, let c, denote this sum. Together, the comments of this para- 
graph imply that 


S<1+ Yqgke ) ¥ 7=1+ Yie,.ia™ 
k=1 nép(k) {n} k=0 


The proof will be complete if the last series on the above right can be shown to 
converge. This is easy. Since 0 < gq < land s > 1, it follows from the definition of 
c, and from Lemma 2(a) that 


S S 
Cygnet ; 5 [Prot] (1 — q) 
lim ——————- = lim = = 
kK>@  CK414 k>0° [Deyo] — 1 (l-q) —-1 
q° q° 
= = gg KX. 


1-(1-q) 1-(Q~-4q) 
The ratio test for convergence now implies the desired conclusion. 
7. A PROOF OF THEOREM 2. As Theorem 2 may be established in a manner 


completely analogous to the way in which Euler’s product formula for f(s) is 
proven, only a brief outline of its proof will be given. 
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Assume that 0 < g < 1 and that s > 1. For m € Z*, let 


gsP-D 1 


p<m [p] 


Because 0 < g*?~/[p]’ <1 for all primes p, each factor of Q,, may be ex- 
panded into an absolutely convergent series: 


f | - ; gok(e-D 
[p] k=0 [p]* 


Moreover, viewed as a finite product of absolutely convergent series, Q,,, may also 
be expressed as a convergent series. In fact, 


00 qike—D q 
Om n{s [p]* | - Lie 


where 1, is summed over the set of natural numbers n having no prime factors 
greater than m. Comparing Q,, with ¢,(s) leads to the inequality 


Sv(n) 


Sv(Nn) 


qd 
2 {n}" 
where 2, is summed over all n having a prime factor greater than m. By Theorem 
1, X, goes to 0 as m > ~, Thus, Q,, converges to ¢,(s). 


If4(s) - Qnl< Lo 


8. CONCLUDING REMARKS. Many special. functions (including the exponential, 
gamma, beta, and the classical orthogonal polynomials of Jacobi, Legendre, 
Laguerre, and Hermite) have well-known q-analogs. The book by George Gasper 
and Mizan Rahman [1] contains an excellent exposition on special q-functions 
from the perspective of basic hypergeometric series. Of interest from the stand- 
point of probability is a recent article by Gilbert Labelle [2]: He considers a 
qg-analog of Euler’s gamma function within the context of a geometrical distribu- 
tion. The extent to which classical functional equations involving both the zeta and 
gamma functions have well-behaved g-analogs appears to be an open question. 

There are also several Bernoulli schemes in which the probability of tails 
occuring may be directly identified with well-known g-analogs in combinatorics. A 
survey of such schemes may be found in [3]. 
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What Is the Shape of a Mylar Balloon? 


William H. Paulsen 


Mylar balloons of different shapes and sizes have become popular as gifts or in 
bouquets. The most common balloons are comprised of two circular sheets of 
mylar, fused together at the circumference. A small opening on the circumference 
allows the balloon to be inflated with either air or helium. These balloons are not 
spherical, which is at first surprising, for it is well known that the sphere gives the 
maximal volume for a given surface area. Thus, these balloons suggest the 
following mathematical problem: Given a circular mylar balloon of deflated radius 
r, what will be the shape of the balloon when it is fully inflated? In particular, we 
could ask 


1) What is the radius of the inflated balloon? 
2) What is the thickness of the inflated balloon? 
3) What is its volume? 


In this paper, we will answer all three of these questions. 

We begin by forming a mathematical model of the balloon. We will ignore the 
small opening in the balloon which is used to inflate the balloon, so the deflated 
balloon will have circular symmetry. However, as we inflate the balloon, crimping 
occurs near the “equator” of the balloon. From this, it is apparent that the surface 
area is not the constraining factor for the volume. To understand what is constrain- 
ing the volume, let us look at a cross sectional view through the center of the 
balloon. 

Let a be the radius of the inflated balloon. We let y = f(x) describe the curve 
of the cross section in the first octant, as in Figure 1. Because the mylar does not 
stretch by a significant amount, we have that the length of the curve f(x) from 0 to 
a is constrained by the original radius r. That is, 


[vi + f(x)’ de =r. (*) 


Thus, even though the actual mylar balloon has wrinkles, the basic shape, or 
convex hull, of the balloon will be determined by the constraint (+). 

We will assume that these wrinkles do not affect the volume of the balloon 
significantly, so the total volume, by the shell method, is given by 


V = 4 f xf(x) de. 
0 


We want to find f(x) and a such that V is maximized subject to the subsidiary 
condition (*) and the end point condition f(a) = 0. 

Of course, the crimping will greatly affect the surface area of the balloon, which 
in practice should remain constant as the balloon is inflated. Once the shape of the 
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Figure 1 


balloon is determined, we will be able to determine just how much crimping is 
taking place at different points of the balloon. 

We proceed using the technique of the calculus of variations. In general, the 
extremum function y(x) to the functional 


J[y] = [F(x y,y') de, 


with fixed boundary conditions y(a) = A and y(b) = B satisfies Euler’s equation 
[3, p. 184] 


d 
F, — Fass = 0. 


In the current setting, we also have a subsidiary condition (* ) and neither of the 
two end points are fixed. We can work the problem by first considering the 
endpoints as fixed, and then use the transversality conditions at the endpoints to 
determine a and f(0). 

To incorporate the subsidiary condition, we will use a technique not unlike the 
method of Lagrange multipliers. That is, the extremum function of [?F(x, y, y’) dx 
subject to the constraint [?G(x, y, y')dx =1 will also be an extremum of the 
functional 


[°F + AG) a& 


for some constant A. 


In the current setting, F(x, y, y’) = xy, and G(x, y, y’) = V1 + (y')’. Thus, 
we seek a maximal function y = f(x) of the functional 


['(o +ayvi + (9) ) de. 


By Euler’s equation, y = f(x) satisfies 


d . 5, -1/2 
—A—y'(1 + (y’ = 0, 
x—A—y'(1+(')) or 
’ 2 
y x 

V1+(’) 


for some constants A and C. 
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To find the value of these two constants, we note that the volume must be 
maximized over all choices of a and f(0). Thus, we must come up with the 
transversality conditions for the endpoints. In this case, it is easier to argue the 
transversality conditions geometrically. Because of the nature of the problem, 
the largest volume will be obtained when f(x) is orthogonal to both axes [2, p. 61]. 
That is, 


f'(0) =0 and lim f’(x) = —-»%. 


By plugging in x = 0 into the above equation, and using the condition f’(0) = 0, 
gives us C = 0.°*Thus, 


f(x) 
Vltf(xy 24 
Since we want f’(x) < 0 for 0 <x <a, let us write 2A = —k?. Then, solving for 
f'(x), we get 
/ —x? 
LO) = TaF 


Since we want f(a) = 0, we can write the solution as 


a t? 
(= | Tera 


This is an elliptic integral, which has no closed form solution. But in spite of its 
complexity, we will still be able to answer some questions. 

Let us begin by solving for k and a. The volume will be the largest when a is 
chosen such that the tangent at (0, a) is vertical. That is, lim, _, ,-f’(x) = —. This 
will happen if a = k. Thus, 


a t? 
x)= / —_— dt, O<x <a). 
f(x) = [reat ) 
The graph of f(x) is given in Figure 1. To find a, we will use the constraint (*). 
Since f’(x) = —x?/ Va* — x*, this becomes 
a’ dx 


This is another elliptic integral, but it is a definite elliptic integral, which we can 
put in closed form using a substitution. Let x = at'”*, so dx = at~°/*/4 dt. The 
integral becomes 


an 
z/ 13/44 — 1) dt =. 
0 


This is a Beta integral [1, p. 288]. The general form of the Beta integral is 
T(m + 1)P(n +1) 


> -I,n>-—l). 
T(m+n+2) ° (m m ) 


perc — x)" dx = 
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Here, m = —3/4, and n = —1/2. Thus, 
P(1/4)P(1/2)a 
"AT (3/4) 


Since [(p)T'U — p) = 1 /sin(p7) for 0 < p < 1, we have that [1 /2) = Va and 
[G/4) = wv2 /T(1/4). Thus, 


4rv2a 
a= ND 
(1/4) 


Hence, the radius of the inflated balloon is about 0.7627 times the radius of the 
deflated balloon. . 
The thickness of the balloon can be determined as 2 f(0), or 


a x? dx 
2[° 
0 Vat — x4 
Making the same substitution as before, we get 
P(3/4)TU/2)a 
21(5/4) 


We can use the fact that [(p + 1) =pI(p) for all p to simplify [/4) = 
1 /4)/4. Thus, the thickness of the balloon is given by 


1672r 
r(1/4)*- 


or about 0.9139 times the radius of the deflated balloon. 
Finally, the volume of the balloon is given by 


V = Aer [af 2) de = ae ff me 


If we treat this as a double integral over the region 0 <x <a and x <t<a,we 
can change the order of integration to get 


a rt 
sha 4) 7? dt = 


a xt? a 2rt* 
V= arf | ae eat = i Ta 
Using t = au'/*, we get 
a’aT(5/4)T(1/2) 6407°r? 
I'(7/4) 7 30(1/4)* 


Thus, the volume of the inflated balloon is about 1.21857r°, where r is the radius of 
the deflated balloon. 

At this point we can estimate how much crimping will take place on different 
parts of the balloon. To do this, consider the effect on the balloon if the mylar was 
able to shrink, but not stretch. The overall shape of the balloon would remain 
unchanged, but the crinkles would be able to be smoothed out, forming the convex 
shape given by f(x), which I will call the rectified shape of the balloon. Of course, 
the true shape of the balloon is different, since the mylar does not shrink. 

Let us define the crimping factor at a point on the balloon to be the ratio 
between the surface area of a small patch of the balloon to the corresponding 


= 137 1/4(1 _ ,,\71!/2 _ 
V= [a 5H (1 - uw) du = 


956 WHAT IS THE SHAPE OF A MYLAR BALLOON? [December 


patch on the rectified balloon, taken in the limit as the area of the patch 
approaches 0. By this definition, the crimping factor must be greater or equal to 1. 
Since we have a model of the rectified balloon, given by f(x), we can use this to 
compute the surface area of a small patch, and compare this to the same patch of 
the deflated balloon. 

Consider the patch of the inflated balloon which lies over the polar interval 
x<r<x+Ax,a<0@<a+ Aa. The surface area of this patch of the rectified 
balloon is given by 


xa” Ax Aa 


x+Ax a) 2 
AasV¥1+f'(s)° ds = Ax Aaxy1+f'(x)) = —_—_—_—. 
fo" dasy1 + f(s) axy1 + f(x) = 


Note that a point x units from the center of the inflated balloon came from a 


point 
fvitr(sy ds = fit f ee 


units away from the center of the deflated balloon. Thus, the same patch on the 
deflated balloon would be the polar interval 


[= <r< ————— , a<@<at+Aa. 


The area is given + approximately as 
a* Ax Aa r a’ ds 
Va‘ —x* °0 Vat ~ 54 


Thus, the ratio between these two areas is 


C x ads 
() =o) Tz 

Note that lim,_,, C(x) = 1, so there is no crimping at the “poles” of the 
balloon. The maximum crimping occurs at the equator, x = a, for which C(x) = 
r/a =1T(/4)?/4V27 = 1.32. For comparison, this is about the same amount of 
crimping as the function 1.23 sin(x). 

It is interesting to compare the mathematical results to some experimental 
measurements. Although the volume of a mylar balloon is difficult to measure, the 
radii for the inflated and deflated balloons, as well as the thickness, are relatively 
easy. Using the standard technique of measuring five times, and averaging the 
middle three measurements, I found that the deflated diameter of a given mylar 
balloon was 43.85 cm, while the diameter of the inflated balloon was 33.35 cm. The 
thickness was measured to be 20,166 cm. Thus, r ~ 21.925 cm, a ~ 16.675 cm, and 
f(O) = 10.083 cm. The ratio of the radii is experimentally 0.7605, which is surpris- 
ingly close to the theoretical result of 0.7627. The ratio of the thickness of the 
balloon to the original radius is experimentally 0.9198, compared to the theoretical 
result of 0.9139. 

This problem could be generalized by considering different shapes of balloons. 
That is, instead of starting with two circular disks, we fuse together two pieces of 
mylar of some other shape. The most common variant would be the heart-shaped 
balloons, seen mostly around Valentine’s Day. However, this problem would be 
very formidable analytically. A more reasonable problem would be to find the 
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shape of a toroid balloon, formed by two annular rings. This shape is suggested by 
the inflatable swimming rings. This problem could be solved using similar methods 
as above, except the boundary conditions would cause the problem to be much 
more complex. 
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The Law and Mathematicians 


From an article by Henry Louis Gates, Jr., in ACADEME (January- 
February, 1994, p. 17): 


One recalls Justice William O. Douglas’s 1973 remarks, “One of the 

most offensive experiences in my life was a visit to anation where 

bookstalls were filled only with books on mathematics and books on 
religion.” 

Submitted by Raymod Greenwell 

Hofstra University 


From a letter by Doug Jungreis (UCLA): 


I was on jury duty for 3 weeks... I was assigned to a few jury 
selections but never put on the jury. In one jury selection, the third 
question that the judge asked all the potential jurors was, “Are you or 
a family member a mathematican or engineer?” Apparently they have 
trouble with us math people because we don’t understand the idea of 
“proof beyond a reasonable doubt.” 


Submitted by Joe Gallian 
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Euler’s Theorem for Polyhedra: 
A Topologist and Geometer Respond 


Peter Hilton and Jean Pedersen 


In their stimulating paper [1], to which we here make a friendly and constructive 
response, the authors, Branko Griinbaum and Geoffrey Shephard, introduce the 
interesting geometric concept of polyhedral set, generalizing the familiar notion of 
polyhedron, but confining themselves, for their present purposes, to subsets of R°. 
They discuss dissections of such sets, especially relatively open convex dissections 
of bounded polyhedral sets and show, by easily accessible arguments, the nice 
properties of the Euler characteristic y relative to such dissections. They are 
thereby led to a formula for y(P), namely, Theorem 4 of [1], 


‘If P is a polyhedral set, thenV—-E+F—-C=y(P)- 


Here V, —E,F,—C are defined as y (k-scaffold of P), k = 0,1,2,3, so the 
resemblance of their Theorem 4 to the classical theorem of Euler might seem at 
first to owe more to a judicious choice of notation than to a conceptual closeness; 
however, careful inspection of their definition of a k-scaffold and the contempla- 
tion of examples show that their notation may, indeed, be justified. 

The paper is, in our judgment, a very real contribution to polyhedral geometry, 
clearly written, and enriched, as one would expect of these authors, by clear 
illustrative diagrams. Why then did we seek the approval of the Editor of the 
Monthly to publish a response? It is because the paper appears, in contrast to its 
obvious positive contribution, to both neglect and distort the contribution which 
‘algebraic and combinatorial topologists have made to the development of the 
Euler characteristic. We claim that the ‘problem’, referred to on page 110 of [1], 
that ‘a simple numerical identity! is now seen to be hedged with additional 
conditions or exceptional cases’ is no real problem at all, and that topologists have 
completely and satisfactorily formalized the Euler characteristic (now generalized 
to the Euler-Poincaré characteristic and applied to compact polyhedra of any 
dimension) and in the process fully understood its topological—and hence combi- 
natorial—invariance. Moreover, we deny categorically that ‘this development led 
to a loss of the connection to the origins of Euler’s theorem as a relation involving 
the vertices, edges and faces of a polyhedral object.’ It is an irony that this 
extraordinary allegation should be almost the only reference to topological devel- 
opments in 5 pages devoted to ‘Historical Remarks and Comments.’ 

Specifically, we reject the calculation of v —e +f for Figure 1(b) which we 
here reproduce.” We claim that a polyhedron is made of cells and a closed n-cell is 
the homeomorph of the n-dimensional ball x? + x3 +--+: +x2<1 in R”. In 


'The authors of [1] are, of course, referring to v — e + f = 2. 
* Reproduced figures are accompanied by their original captions. 
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(a) (b) 


Figure 1. (a) A polyhedron to which Euler’s theorem in its elementary form v — e + f = 2 does not 
apply. (b) A polyhedron of genus g = 1 to which Euler’s theorem in the form v — e + f = 2 — 2g does 
not apply. 


particular, a 2-dimensional face must be a polygonal region. Indeed, one might 
legitimately comment that it is unreasonable to expect that one can count the 
number of holes by counting subsets which are themselves allowed to have holes! 
Moreover, if one allows strange (2-dimensional) faces as in Figure 1(b), then why 
not regard the entire surface as a face and thus obtain v -e +f = 1 for any 
surface! Finally, we remark that it should never happen that scratching a few extra 
line segments on a surface—the only difference between Figures 1(a) and 1(b) of 
[1]—should alter a really significant invariant quantity. 

The error in permitting Figure 1(b) as a legitimate dissection of the torus is to 
be found in many parts of the world (including, for example, the UK and New 
Zealand); so much so that we have ourselves written an article [2] attempting to 
stem the tide of error. . 

We regard the theorem (referred to somewhat disparagingly in [1]) that the 
Euler-Poincaré characteristic equals the alternating sum of the Betti numbers as 
fundamental. It not only explains Euler’s original result (since orientable closed 
surfaces bounding convex regions are homeomorphic to the sphere), but estab- 
lishes the homotopy invariance (not merely the topological invariance) of y. Of 
course it also explains formula (2) of [1], that is 


v-e+f=2-2g, (2) 


for an orientable closed surface of genus g, since the Betti numbers of such a 
surface are given by p, = 1, p, = 2g, p, = 1. Thus formula (2) is scarcely a great 
Stride in extending Euler’s Theorem and so should not be seen as ‘dealing with a 
discrepancy’. Nor should Figure 2 of [1] create a problem (once one ‘has under- 
stood the instructions for the construction of the polyhedron P in question’; 
apparently the top and bottom faces of the cube are missing and the ‘drilling’ takes 
place through empty space—are we right?). Certainly y(P) = 1 for this polyhe- 
dron, so formula (2) does not apply. But P is not a closed orientable surface, so 
how could formula (2) apply? 

In this century topologists have made great strides generalizing the properties of 
the Euler-Poincaré characteristic (the Hopf Trace Formula, the Lefshetz Fixpoint 
Theorem, ...), and applying it to many problems like the vector field problem and 
studies of multiplicative structures on manifolds. We may also cite Gottlieb’s 
Theorem on aspherical finite complexes X: If y(X) # 0, then the fundamental 
group of X has trivial center. Very recently, Ross Geoghegan and Andrew Nicas 


3We interpret this figure as obtained from a sphere by identifying two of its points; of course, the 
resulting space is not a manifold at this point. 
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Figure 2. A funnel-shaped polyhedron for which the elementary forms of Euler’s theorem are not valid. 


have been engaged in developing important higher-order analogs y, of y which 
are relevant, for example, to the study of homotopies of self-maps of manifolds. 
They further show essentially that, if X is aspherical, then y,(X) # 0 implies that 
the fundamental group of X is infinite cyclic. 

It is splendid that geometers as well as topologists are making progress in this 
field; let neither disparage the efforts and successes of the other. 
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RESPONSE FROM GRUNBAUM AND SHEPHARD 


Our paper A New Look at Euler’s Theorem for Polyhedra seems to have caused 
some lively discussion. Naturally we are very pleased; it is far better for a paper to 
attract comment (even if critical) than to gather dust, unread, on the shelves of 
libraries. 

However, the “Response” by Peter Hilton and Jean Pedersen seems to us to be 
misleading and inappropriate. On a matter of detail, we never claimed that the 
“calculation” of the Euler characteristic of the polyhedron in Figure 1(b) was 
correct. It was presented as an example of how earlier authors, hampered by 
inadequate definitions, were misled in their treatment of the subject. Our intention 
is clear from the “Introduction”, as well as from the more detailed discussion in 
the “‘Historical Remarks and Comments.” The rejection of that “calculation”, and 
of the similar one for the polyhedron in Figure 2, was our explicit aim, obviously 
missed by Hilton and Pedersen. 

However, our objection to the “Response” is much deeper. Their comment that 
“topologists have completely and satisfactorily formalized the Euler characteristic 
(now generalized to the Euler-Poincaré characteristic as applied to compact 


1994] EULER’S THEOREM FOR POLYHEDRA 961 


polyhedra of any dimension)” is true, but irrelevant. In particular, we reject the 
implication that the geometrical approach we propose is redundant. The purpose 
of our paper, as stated in the Introduction, is to show how the “traditional” 
approach of counting vertices, edges and faces, can, with care and appropriate 
modifications, lead to results which are valid and far-reaching. In particular, there 
is no need, as topologists generally do, to restrict attention to bounded polyhedra 
which can be expressed as finite unions of topological balls all of which are open, 
or all of which are closed. This restriction eliminates many of our examples in the 
paper, to which our methods apply. In particular, it seems unreasonable to us to 
reject a polyhedron such as that shown in Figure 9(a) by stating dogmatically that 
every ““2-dimensional face must be a polygonal region’ and then to define a 
polygonal region in such a way as to eliminate multiply-connected regions. Euler 
himself, if he were alive today, would, we believe, feel far more comfortable with 
our approach to his theorem than that advocated in the ‘“‘Response”’. 

Our purpose in writing the paper was to show that a simple geometrical 
treatment of a classical topic can lead to results that are more general, and more 
intelligible to the majority of mathematicians, than the recondite and much more 
abstract topological approach. Hence our “neglect” of topology. On the other 
hand, we would be very pleased if topological generalizations of our approach were 
to be developed. 

We would like to take this opportunity to point out some errors that crept into 
the captions of two of the figures in our paper; we are grateful to many colleagues 
for drawing our attention to these. In Figure 4, we should have: (c) 0 (8, 8, 0); (d) 6 
(8,0, —2); @) 4 (6, 2,0). In Figure 5, the correct values are: (c). —4 @, 9,2); (d) 2 
(2, 5,5). 


(Added August 19, 1994) In “Die Euler-Poincaré- Charakteristik und ihre topol- 
ogische Weiterentwicklung” (Elemente der Mathematik, 49(1994), pp. 66-76) P. J. 
Hilton refers to an unnamed paper, which appears to be our paper in the 
“Monthly”. He presents there objections very similar to those in the above 
Response—in fact he goes further and describes parts of the paper as “complete 
nonsense!” (“vollkommener Unsinn!’’). 


Department of Mathematics, GN-50 School of Mathematics 
University of Washington University of East Anglia 
Seattle, WA 98195 Norwich NR4 7TJ, England 
grunbaum(@math.washington.edu G. Shephard@uea.ac.uk 


Common sense is the collection of prejudices acquired by 
age cightcen. 


—Albert Einstein (1879-1955) 
ET Bell, Mathematics, Queen and Servant of the Sciences, 
New York: McGraw Hill, 1951. 
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The Role of Paradoxes in the Evolution 
of Mathematics 


I. Kleiner and N. Movshovitz-Hadar 


A paradox has been described as a truth standing on its head to attract attention. 
Undoubtedly, paradoxes captivate. They also cajole, provoke, amuse, exasperate, 
and seduce. More importantly, they arouse curiosity, stimulate, and motivate. 

In this paper we present examples of paradoxes from the history of mathematics 
which have inspired the clarification of basic concepts and the introduction of 
major results. Our examples will deal with numbers, logarithms, functions, continu- 
ity, tangents, infinite series, sets, curves, and decomposition of geometric objects. 

We will use the term “paradox” in a broad sense to mean an inconsistency, a 
counterexample to widely held notions, a misconception, a true statement that 
seems to be false, or a false statement that seems to be true. It is in these various 
senses that paradoxes have played an important role in the evolution of mathemat- 
ics. Indeed, as Bell and Davis, respectively, put it: 


The mistakes and unresolved difficulties of the past in mathematics have always been the 
opportunities of its future ({1], p. 283). 


One of the endlessly alluring aspects of mathematics is that its thorniest paradoxes have a way of 
blooming into beautiful theories ((6], p. 55). 


Paradoxes can also serve a useful role in the classroom. The temporary confu- 
sion and insecurity which they may engender in students can be put to good use. 
Conflict and predicament are useful pedagogical devices (provided, of course, that 
they are dealt with). They may foster a positive attitude to “getting stuck,” provide 
the opportunity to participate in debate and controversy over mathematical issues, 
and promote the realization that mathematics often develops in this very way. 
Teachers may gain a better appreciation of students’ difficulties in coming to grips 
with concepts and results with which some of the greatest mathematicians of all 
time struggled. Such concepts and results, while paradoxical and challenging at the 
time, became commonplaces in subsequent generations. In the words of Kasner 
and Newman ({12], p. 193): 


The testament of science is so continually in a flux that the heresy of yesterday is the gospel of 
today and the fundamentalism of tomorrow. 


PARADOXES INVOLVING NUMBERS. The evolution of the concept of number 
has been beset by paradoxes almost every step of the way. As P. J. Davis put it ({7], 
p. 305): 


It is paradoxical that while mathematics has the reputation of being the one subject that brooks 
no contradictions, in reality it has a long history of successful living with contradictions. This 
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is best seen in the extensions of the notion of number that have been made over a period of 
2500 years. From limited sets of integers, to fractions, negative numbers, irrational numbers, 
complex numbers, transfinite numbers, each extension, in its way, overcame a contradictory set 
of demands. 


The first sentence in the above quotation may be thought of as a 
““metaparadox’—a nontechnical, paradoxical statement about technical, paradoxi- 
cal phenomena. We will point out a variety of such metaparadoxes; they are 
interesting in their own right as issues for philosophical discussion or contempla- 
tion. But now to some paradoxes dealing with the evolution of various number 
systems. 


(a) The Pythagoreans of the 6th century B.C. believed that every line segment 
can be measured by a positive integer or the ratio of two such integers. This was to 
them not merely a very plausible fact, but an article of faith, an aspect of their 
philosophy. Moreover, the idea formed the basis of the pythagorean theory of 
proportion (see [23]). It was thus a great shock (paradox) to them when they 
discovered that the diagonal of a unit square cannot be measured by a whole 
number or by a ratio of whole numbers; or, as the Greeks put it, that the diagonal 
and side of a square are incommensurable. Their proof of this result is essentially 
the one we use today to show that y2 is irrational. The paradox was arrived at by 
using the Pythagorean Theorem. Thus the 


Metaparadox: The Pythagorean Theorem was the undoing of the pythagorean 
philosophy and the pythagorean theory of proportion. 


The discovery of the incommensurability of the diagonal and side of a square 
had far-reaching consequences for Greek mathematics. On the. positive side, it 
inspired Eudoxus to found a sophisticated theory of proportion which applied to 
both commensurable and incommensurable magnitudes. This, in turn, motivated 
Dedekind more than two millennia later to define the real numbers via Dedekind 
cuts. On the debit side, it turned the direction of Greek mathematics (at least in its 
very productive, classical period) from a harmonious collaboration of number and 
geometry to an almost exclusive concern with geometry. 


(b) The introduction of negative numbers into mathematics and their subse- 
quent use occasioned considerable consternation and difficulties. A major concep- 
tual framework that had to be abandoned was the prohibition of subtracting a 
greater from a smaller number. As Wallis in the 17th century put it ((20], p. 438): 
“{How can] any magnitude...be less than nothing, or any number fewer than 
none?” 

Among other paradoxes having to do with negative numbers are the following 
two: 

(i) Wallis “proved” that negative numbers are greater than infinity. His argu- 
ment was that since (for positive a) § = ~, a/a neg. no. > »; this is so because 
decreasing the denominator increases the fraction. 

(ii) In a letter to Leibniz, Arnauld (a 17th-century mathematician and philoso- 
pher) objected to the equality -; = — on the grounds that the ratio of a greater 
to a smaller quantity cannot equal the ratio of a smaller to a greater. Leibniz 
agreed this was a difficulty, but argued for the tolerance of negative numbers 
because they are useful and, in general, lead to consistent results. See [5], pp. 
39-40. 
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Justification of otherwise inexplicable notions on the grounds that they yield 
useful results has occurred frequently in the evolution of mathematics. This brings 
up the following 


Metaparadox: How can meaningless (or at best inexplicable) things be so useful? 


Of course, out of meaninglessness (or confusion) emerged, in time, clarity and 
understanding. 


(c) The solution by radicals of cubic equations was one of the great achieve- 
ments of 16th-century mathematics. Cardan’s solution of the cubic x? = ax + b 
was given by the formula 


3 
Bombelli applied it to the equation x? = 15x + 4 to obtainx = V2 +V—-—121 + 


3 

y2—yV-— 121 . Cardan had earlier denied the applicability of his formula to such 
equations since it introduced square roots of negative numbers, which he rejected. 
But Bombelli noted (by inspection) that x =4 is a solution of x? = 
15x + 4. (The other two roots, —2 + y3, are also real.) Here was a paradox: The 
roots of x? = 15x +4 are real, yet the formula yielding the roots involved 
complex, and at the time meaningless, numbers. ““The whole matter seemed to rest 
on sophistry rather than on truth,” noted Bombelli ((15], p. 2). And he set himself 
the task of resolving that sophistry, which resulted in the birth of complex 
numbers.! Birth, however, did not entail legitimacy. It took another two and a half 
centuries before complex numbers were accepted as bona fide mathematical 
entities. 


PARADOXES INVOLVING LOGARITHMS. The issue of the meaning of loga- 
rithms of negative and complex numbers arose in the early 18th century in 
connection with integration. In analogy with the real case, Johann Bernoulli 
integrated-1/(x* + a”) as follows: 


dx dx 1 1 1 
love *lavm@aca 7 ai | xt+ai x-ai 
1 | yo] a x + ai 
= ~ g,;lloe(x + ai) ~ loa(x ~ ai)] = ~ 5 loa. 


In an exchange of letters (begun in 1702 and lasting sixteen months) Bernoulli and 
Leibniz argued about the meaning of log (x + ai/x — ai), and, in particular, about 
the meaning of log(—1). Bernoulli asserted that log(—1) is real while Leibniz 
claimed it is imaginary, each advancing various arguments to support his view. For 


'Bombelli developed rules for manipulating expressions of the form a + by— 1 and was thereby 


3 3 
able to show that (one of the values of) y 2+ y—121 + ¥2-—y- 121 is indeed 4. 
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example, Bernoulli argued that since 


dx  d(-x) dx d(—-x) 

x —-x ’ i] x i] —Xx 
hence log x = log(—x). In particular, log(—1) = log1 = 0. Among Leibniz’ argu- 
ments were the following: 

(i) Since the range of log a, for a > 0, is all real numbers, it follows that log a, 
for a <0, must be imaginary, because the real numbers have already been 
“spoken for’’. 

(ii) If log(— 1).were real, then log i would also be real, since log i = log(—1)!/7 = 
5 log(—1). But this is clearly absurd, alleges Leibniz. 


? 


(iii) Putting x = —2 in the expansion 
log(1 i” x 
O FX) SEX- Tt 
g( ) 5 +3 
yields log(—1) = -—2 - $ - 3 —.... Since the series on the right diverges, it 


cannot be real, hence must be imaginary. 


The above are indeed interesting examples of the art (not to say “science”) of 
symbolic manipulation practiced by some of the greatest mathematicians of the 
17th and 18th centuries. The resulting paradoxes had “for a long time... tor- 
mented me,” noted Euler ({17], p. 72). He resolved them in a 1749 paper. We 
quote from its interesting introduction ({14], p. 4): 


Since logarithms are clearly part of pure mathematics it may well be surprising to learn that they 
have been until now the subject of an embarrassing controversy in which whatever side is taken 
contradictions appear that seem completely impossible to resolve. Meanwhile if truth is to be 
universal there can be no doubt that these contradictions, ..., however unresolved they seem can 
only be apparent. ...I will bring out fully all the contradictions involved so that it may be seen 
how difficult it is to discover truth and to guard against inconsistency even when two great men 
are working on the problem. 


The crux of Euler’s solution was the Euler-Cotes formula e’® = cos @ + isin 6. 
It implies that e“7*?"™ = cos(a + 2na) + isin(a + 2n7) = cos7 +isina = 
—1, so that log(—1) = i(7 + 2n7), where n = 0, + 1, + 2,... .Thus log(—1) is 
multivalued (in fact, infinite-valued) and all its values are complex. Both Bernoulli 
and Leibniz were wrong, the former “more so” than the latter. 


PARADOXES INVOLVING FUNCTIONS. The concept of function originated in 
the early 18th century. Newton and Leibniz invented the calculus in the latter part 
of the 17th century. Here, then, is a 


Metaparadox: Calculus without functions. 


The calculus of Newton and Leibniz was a calculus of curves (given by equations) 
rather than of functions. 

A function was viewed at different times as a formula, a curve, or an arbitrary 
correspondence. Paradoxes turned up to dethrone one or another of these views of 
functionality. Even the very meaning of a formula, as well as its scope (ie., the 
functions that are representable by formulas), changed over time, and were often 
subjects of considerable controversy. For example: 
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(a) To Euler and his contemporaries of the mid-18th century a function meant a 
formula, where the latter concept, though not rigorously defined, was broadly 
construed to allow (among other things) infinite sums and products in its forma- 
tion. There were several implicit assumptions: 

(i) The function (formula) had ‘to be given by a single expression. For example, 


f(x) = (* x>0 was not considered a function. 
—-x, x <0 

(ii) The independent variable had to range over all real numbers (except 
possibly for isolated points, as in f(x) = +). For example, f(x) =x,0 <x <1, 
was not considered a function 

(iii) Two functions which agreed on an interval were assumed to agree every- 
where on the line. 

The significance of these assumptions was the fact that the algorithms of the 
calculus applied at that time only to such functions. 

Many of the 18th-century (mis)conceptions about functions were overturned by 
Fourier’s work on heat conduction in the early decades of the 19th century. As a 
result of this work Fourier claimed to have shown that any function defined on 
some interval can be represented on that interval as an infinite series of sines and 
cosines—a Fourier series. For example, if 


—-1, -7w<x <0 
f(x) = 0, x=0 ; 
1 O<x<qm7 
then 
4 /sinx sin 3x sin 5x 
f(x) =—- + + forall x €(-7,7)- 
T 1: 3 5 


Several fundamental departures concerning functions resulted from Fourier’s 
work: 


1. It became legitimate, and important, to consider functions whose domain is 
an interval rather than the entire real line. 

2. Two functions could agree on an interval but differ outside the interval. 

3. A function given by two or more distinct expressions could equal a function 
given by a single expression. 


(b) In an 1829 paper on Fourier series Dirichlet introduced the so-called 
Dirichlet function 


_ [1, tf x is rational 
D(x) = (o if x is irrational. 


This function was neither a formula nor a curve. It was a new type of function, 
described by a correspondence. It was the first of many functions which came to be 
called “pathological’’—but not for very long (see [25]). 


Although this result is, of course, incorrect (given our conception of functions) in the generality 
which Fourier claimed for it, a large class of functions can be represented by Fourier series. In fact, 
Fourier’s contemporaries would have been hard put to find an exception. 

3In the latter part of the 18th century, following debates surrounding the famous vibrating-string 
problem, it became legitimate (at least in some quarters) to consider functions defined by several 
expressions. See [16]. 
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At the end of the 19th century Baire extended (again) the notion of formula. To 
him it meant an expression obtained from a variable and constants by a (possibly 
countable) iteration of additions, multiplications, and the taking of limits. He 
called such a function analytically representable and showed that the Dirichlet 
function is of this type: D(x ) = lim,, _,,, lim, _,,. cos(m!7rx)”. Thus the “pathologi- 
cal” Dirichlet function became a “tame” analytically representable function. 

Is analytic representability a universal mode of representability of functions? 
That is, are there functions which are not analytically representable? Yes and no. 
If you are a formalist, you can show by a counting argument that the set of 
analytically representable functions has cardinality c, while the set of all functions 
(clearly) has cardinality 2°. Thus there are uncountably many functions which are 
not analytically representable. But no one has given a constructive example of even 
one. 


PARADOXES INVOLVING CONTINUITY. Although the concept of continuity is 
nowadays fundamental in mathematics, its modern definition was not formulated 
until the 19th century, about 150 years after the invention of the calculus by 
Newton and Leibniz. In the 18th century, Euler did define a notion of continuity in 
response to the famous vibrating-string controversy ((8], p. 301). Thus a continuous 
function was one given by a single expression (formula), while a function given by 
several expressions was considered discontinuous. For example, to Euler the 
function 


f(x) = { x>0 


x, x <0 


was discontinuous, while the function comprising the two branches of a hyperbola 
was considered continuous, since it is given by the single expression f(x) = = (see 
[16], p. 301). 

The work on Fourier series showed the untenability of the 18th-century notion 
of continuity. For example, the function 


-1, -7w<x<0 
g(x) = 0, x=0 
1, O<x<q7 


could (as we have seen) be represented by a single expression, namely its Fourier 
series, hence it was both continuous and discontinuous in the 18th-century sense of 
that concept. 

In an 1821 work Cauchy initiated a reappraisal and reorganization of the 
foundations of 18th-century calculus. In this work Cauchy defined continuity 
essentially as we understand the concept today, although he used the then-prevail- 
ing language of infinitesimals rather than the now-accepted « — 6 formulation 
given by Weierstrass in the 1850s. The shift in point of view from Euler’s to 
Cauchy’s conceptions of continuity was fundamental. In the former case continuity 
was a global property while in the latter case it was a local property. But the 
concept of continuity proved to be very subtle, and was not completely understood 
even by Cauchy and his contemporaries of the early to mid-19th century. For 
example: 

(a) Cauchy “proved” that an infinite sum (a convergent series) of continuous 
functions is a continuous function ({4], p. 110). This, of course, is incorrect. A 
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counterexample was given by Abel in the 1820s— it is essentially the series 
° 6sin(2n + 1)x 
2n+1 


n=0 
we encountered earlier, which is discontinuous at x = k7,k =0,+1,+2,.... 
The error in Cauchy’s proof resulted from his failure to distinguish between 
convergence and uniform convergence of a series of functions. In fact, “the 
realization of the central role of the concept of uniform convergence in analysis 
came about slowly in the last [19th] century ” ((21], p. 97). 

(b) Euler’s continuous functions were, in practice, differentiable (except possibly 
at isolated points). So were Cauchy’s—at least this is what Cauchy and his 
contemporaries believed, and what some of them “proved” (see [26]). It was 
therefore astonishing when Weierstrass in the 1860s gave an example of a continu- 
ous function which is nowhere differentiable, namely f(x) = L7_, b” cos(a" 7x), a 
an odd integer, b a real number in (0,1), and ab > 1 + (37/2). This and similar 
examples showed for the first time that the concept of continuity is considerably 
broader than that of differentiability, and thus established continuity as an impor- 
tant concept of investigation in its own right. The examples also showed the 
limitations of intuitive geometric reasoning in analysis, and thus the need for 
careful, analytic formulations of basic notions. 

In a modern development of a different kind, Schwartz and Sobolev showed in 
the 1940s that every continuous function is, indeed, ‘‘differentiable’’. But the 
derivative is now a “generalized function” (a “distribution’”’). For example, if 


1, x>0 
f(x) = oF x=0 
0, x<Q, 
then 
’ 0, x #0 
r(x) = {% = 0, 


which is the Dirac delta “function” 6(x). As this example shows, there are even 
discontinuous functions which are differentiable (in the Schwartz /Sobolev sense) 
—a shocking realization (it would have been) for mathematicians of the second 
half of the 19th century. 


PARADOXES INVOLVING ASPECTS OF THE CALCULUS (OTHER THAN 
CONTINUITY). (a) The calculus was invented (independently) by Newton and 
Leibniz in the last third of the 17th century. But many of its important ideas were 
foreshadowed in early-17th century works of prominent mathematicians, notably 
Fermat. In the late 1630s he devised a method for dealing with problems on 
tangents and on maxima and minima. The following example illustrates Fermat’s 
approach (see [8], p. 122): 

Suppose we wish to find the tangent to the parabola y = x* at some point 
(x, x”). Let x + e be a nearby point on the x-axis and let s denote the subtangent 
of the curve at the point (x,x7) (see accompanying diagram). Similarity of 
triangles yields x*/s = k/(s + e). Fermat notes that k is “approximately equal” 
to (x + e)*; writing this as k = (x + e)” we get 
x? (x +e)’ 


S St+e 
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~~ —_— —_— —d 
I —— eee 


(x +e)° — x? x? 4+2ex+e?—x? e(2x +e) 2x+t+e’ 


hence x*/s = 2x + e. Note that x*/s is the slope of the tangent to the parabola 
at (x, x*). Fermat now “deletes” e and claims that the slope of the tangent is 2x. 

Fermat’s method was severely criticized by some of his contemporaries. They 
objected to his introduction and subsequent suppression of the mysterious e. 
Dividing by e meant regarding it as not zero. Discarding e implied treating it as 
zero. This is inadmissible, they rightly claimed. In a somewhat different context, 
but with equal justification, Bishop Berkeley in the 18th century would refer to 
such e’s as “the ghosts of departed quantities,” arguing that “by virtue of a 
twofold mistake ...[one] arrive[d], though not at a science yet at the truth” ((13], 
p. 428). 

The justification of 17th- and 18th-century algorithms of the calculus was that 
they yielded correct results—another important example of the utility of 
“meaningless” procedures (cf. p. 965). The end seemed to have justified the 
means. Rigorous justification of the calculus—of one kind—came with the 1821 
introduction of limits by Cauchy, and—of another kind—with the 1960 introduc- 
tion of infinitesimals by Robinson. 


Metaparadox: How can the calculus be founded on two distinct, and in some ways 
incompatible, theories: limits, based on the real numbers, and infinitesimals, based 
on the hyperreal numbers? Or, as Steen put it: ““The epistemological foundation of 
mathematical analysis is far from settled” ((22], p. 92). 


(b) Power series were a potent tool in 17th- and especially 18th-century calculus. 
They were manipulated as polynomials, with little if any attention paid to ques- 
tions of convergence. In fact, Euler and others consciously used divergent series to 
great advantage. The results thus obtained were impressive and important, but 
errors and paradoxes became unavoidable. Here are two: 

(i) There is undoubtedly a touch of the metaphysical in the mathematical 
infinite. The following example, due to Euler, confirms it ((13], p. 447): Letting 
x= -lind+x)*=1-2x+4+ 3x? -4x° +..., he gets 


o=14+24+34+44--:(*). 
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Letting x =2inQ@ -—x)'=14+x+x7+x°7+..., one has 
—-1=14+24+44+8+---(*#). 


Since each term on the right side of (**) is greater than or equal to the 
corresponding term on the right side of (*), —1> ©. But clearly «» > 1. Hence 
—1> o> 1. Euler infers that © must be a type of limit between the positive and 
negative numbers, and in this sense resembles 0. 

(ii) Occasionally 17th- and 18th-century mathematicians revelled in the art of 
series-manipulation if for no better reason (it would seem) that to demonstrate 
their prowess.. For example, putting x = 1 in logd +x) =x —x7/2 + 
x°/3—... yields log2 =1-5+4-—%4+-::. So far so good. But now, the 
argument went, the right side equals 


(+ 5t+5tec)+G+aegtetoc)-2w+egtet) 
=(1+5+ 3 +G+¢5te)-(14+5+5+G+5t°°°) =0, 


hence log 2 = 0. It was only in the mid-19th century that Riemann resolved this 
paradox by proving that the sum of a conditionally convergent series can assume, 
upon rearrangement, any value. “The discovery of this apparent paradox con- 
tributed essentially to a re-examination and rigorous founding... of the theory of 
infinite series” ({21], p. 30). 


PARADOXES INVOLVING SETS. (a) During nearly the last three decades of the 
19th century Cantor developed many important set-theoretic ideas using an 
intuitive (‘naive’) notion of set. Eventually his concept proved inadequate and led 
to paradoxes. Perhaps the best known is Russell’s classic paradox of 1902: Let 
R ={x:x €x}. Then R ER if and only if R € R. This paradox had a profound 
effect on a number of mathematicians (see [19]). It devastated the logician Frege, 
who had just completed a two-volume treatise on the foundations of arithmetic 
which relied on set-theoretic notions. Learning of Russell’s paradox, he lamented 
((13], p. 1192): 


A scientist can hardly meet with anything more undesirable than to have the foundation give way 
just as the work is finished. I was put in this position by a letter from Mr. Bertrand Russell when 
the work was nearly through the press. 


On the other hand, the paradoxes of set theory had positive effects. In 
particular, they provoked mathematicians to give precise meaning to the notion of 
set by devising various axiomatizations of set theory (e.g., the Zermelo-Fraenkel 
axioms, the Russell and Whitehead theory of types, the G6del-Bernays system). 
Although such axiom-systems avoided the known paradoxes, they did not guaran- 
tee that new ones would not emerge. As Poincaré put it picturesquely ({13], 
p. 1186): 


We have put a fence around the herd to protect it from the wolves but we do not know whether 
some wolves were not already within the fence. 


Here are two metaparadoxes resulting from Cantor’s work in set theory: 


Metaparadox 1: Infinity comes in different sizes; in fact, in infinitely many differ- 
ent sizes. 
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The second metaparadox comes from juxtaposing the following two quotations by 
Poincaré and Hilbert, respectively ((13], p. 1003): 


Metaparadox 2: (a) “Later generations will regard Mengenlehre [set theory] as a 
disease from which one has recovered.” 
(b) “No one shall expel us from the paradise which Cantor created for us.” 


PARADOXES INVOLVING CURVES. The notion of curve is, of course, funda- 
mental in geometry. To Euclid it meant “‘breadthless length”. The collection of 
curves known to his contemporaries was small—the conic sections, the conchoid, 
the cissoid, the spiral, the quadratrix, and a very few others. The situation changed 
dramatically with the invention of analytic geometry in the 17th century. Now any 
equation in two variables came to represent a (plane) curve, although ‘“‘seven- 
teenth-century mathematicians did not have a uniform definition of the concept of 
curve (nor apparently did they feel the need for such a definition)” ({3], p. 296). 
The study of curves was pursued vigorously for the next three centuries, attracting 
some of the best mathematicians who attacked it by geometric, analytic, algebraic, 
arithmetic, and topological means. 

“Pathological” functions introduced in the second half of the 19th century 
raised questions about the nature of curves. For example, in what sense does a 
continuous nowhere-differentiable function represent a curve? Jordan responded 
(in 1887) with what came to be the first formal definition of a curve (other than 
perhaps Euclid’s). To him a curve was the path of a continuously moving point. 
More precisely, it was {( f(t), g(t) |f, g:[0,1] ~ R are continuous functions}.* In 
1890 Peano gave his famous and astounding éxample of a “space-filling curve;” 
that is, he exhibited a continuous mapping of the unit interval .onto a square 
(including its interior). But that, according to Jordan’s definition, made the square 
into a curve—a not very desirable state of affairs. ““How was it possible that 
intuition could so deceive us?”, wondered Poincaré ((24], p. 123). Jordan’s defini- 
tion was too broad and had to be modified. 

But Jordan’s definition also turned out to be too narrow. For we would surely 
want the graph of y = sin = and its limit points on the y-axis (ie., {(x, sin +): x © 
(—00, 0) U (0, &)} U {(0, y):— 1 < y < 1}) to be called a curve, but it is not the 
image of a continuously moving point.” 


Metaparadox: How can a definition be both too broad and too narrow? 


A satisfactory resolution of the dilemma was achieved (by Menger and Uryson) 
only in the 1920s. First one had to clarify the notion of dimension ((18]).° When 
this was done, a curve was defined as a one-dimensional continuum (see [28)]).’ 
The definition proved adequate until the 1970s when Mandelbrot introduced 
curves—his fractals—whose dimensions are fractions. See [9]. 


“It was in this context that he stated, and proved (incorrectly, as it later turned out) the celebrated 
“Jordan-Curve Theorem”. 

>This is intuitively clear, although to prove it we need topological notions. See [11], p. 1968. 

°That notion, too, was challenged by the paradoxical Peano curve which implied that a square is 
one-dimensional since it is the continuous image of the unit interval. Cantor’s proof of the one-one 
correspondence between an interval and a square also put to question the intuitive notion of dimension. 

74 continuum is a closed, connected set of points. 
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PARADOXES INVOLVING DECOMPOSITION OF GEOMETRIC OBJECTS. 

(a) In 1924 Banach and Tarski proved that a pea and the sun are equidecompos- 
able. That is, the pea may be cut up into finitely many pieces® which can be 
rearranged to yield the sun (in volume if not in substance). This is the celebrated 
Banach-Tarski paradox (see [27]). In fact, Banach and Tarski have shown that any 
two bounded sets in Euclidean space R” are equidecomposable if they contain 
interior points and if n > 2 (see [2], p. 351).’ 

Of course, the pieces into which the pea is cut in the Banach-Tarski decomposi- 
tion are not measurable; that is, they do not have a volume. They are not the kinds 
of pieces that can be obtained using scissors or other cutting devices. They are 
obtained using-the axiom of choice. 


Metaparadox: How can simple assumptions (e.g. the axiom of choice) have such 
formidable consequences (e.g. the Banach-Tarski paradox)? 


Of course, the axiom of choice may not be such a simple assumption after all 
(see [19]). But it would have been very helpful to the Delians of Greek antiquity 
((27], p. v): 


Delians: ‘“‘How can we be rid of the plague?”’ 

Delphic Oracle: “‘Construct a cubic altar double the size of the existing 
altar.”’ 

Banach and Tarksi: ““Can we use the axiom of choice?” 


(b) At long last, the circle has been squared. This is no hoax. It is the title of an 
article which appeared recently in the reputable Notices of the American Mathe- 
matical Society ({10]). In 1988 the Hungarian mathematician Laczkovich showed 
that the circle can be decomposed into finitely many pieces which can be reassem- 
bled to give a square of equal area. But the pieces are not measurable (none has 
an area) and the decomposition is secured using the axiom of choice. See [10]. 


CONCLUDING REMARKS. We have presented a variety of mathematical para- 
doxes from different historical periods. They resulted from (among other things) 
debates and controversies among mathematicians, counterexamples to what were 
thought to be immutable notions, failures to see the need for tightening (broad- 
ening) a concept or broadening (tightening) a result, and the application of a 
“principle of continuity” which suggested the transferability of procedures from a 
given case to what appeared to be like cases. We saw that such paradoxical 
phenomena have had a very substantial impact on the development of mathematics 
through the refinement and reshaping of concepts, the broadening of existing 
theories and the rise of new ones. Moreover, this process is ongoing. 

We have also suggested roles for paradoxes in the teaching and learning of 
mathematics. They can generate curiosity, increase motivation, create an effective 
environment for debate, encourage the examination of underlying assumptions, 
and show that faulty logic and erroneous arguments are not an uncommon feature 
of the mathematical enterprise. 


It was shown in the 1940s that five pieces suffice; in fact, no number less than five will do. 
’If one allows for denumerable decompositions, then this result holds also for n = 2 (see [2], p. 351). 
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Regions in the Complex Plane Containing 
the Eigenvalues of a Matrix 


Richard A. Brualdi and Stephen Mellendorf 


The GerSgorin circle theorem gives a region in the complex plane which contains 
all the eigenvalues of a square complex matrix. It is one of those rare instances of a 
theorem which is elegant and useful and which has a short, elementary proof. It is 
surprising that it hasn’t made its way into many introductory texts on linear 
algebra. One reason for this neglect may be the fact that many (even most) 
mathematicians still regard linear algebra as being only about algebra. But modern 
linear algebra is more than algebra. It’s linear systems, matrix theory and analysis, 
geometry, applications, numerics and, of course, algebra. The first course in linear 
algebra at most colleges and universities is to a great extent a service course for 
future scientists. 

GerSgorin’s theorem is not an algebraic theorem. It is a simple analytic theorem 
involving elementary inequalities derived from the basic algebraic eigenvalue / 
eigenvector equation. If A =[a,,] is a complex matrix of order n and A is an 
eigenvalue of A, then there exists a nonzéro vector x = (x,,x,,...,x,)"’ in C” 
such that 


Ax = AX. (1) 
One of the components of x, say x,, has the largest magnitude: 
Ix,| = max{|x,|:1<i<n}> 0. (2) 
Choosing, the kth equation from among the n linear equations making up (1), we 
get 
Ay X, + ApgXg t+ Hay, X, = AX, 


which, after rewriting, becomes 


» Anjyx; = (A = ayy) Xx. 
jek 


(Here, as later, the summation Lj2, Means the summation over all integers j 
between 1 and n excluding k.) Taking absolute values, and using the triangle 
inequality and (2), we then obtain 


A - a,,| Ix, | = 


» Ajj 


J#k 
< > la,,;| Ix, | 
J#k 
<( Dlay| bel. 
J#k 
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Hence 
[A —_ a,x < y |a,;l. 
j#k 
Defining 
R, = Ylayl = layl + ++ + lay seal + lanigil + ++ +1a,,1(@2 = 1,2,...,n), 
ji 
the sum of the magnitudes of the entries of A in row i other than the entry on the 
main diagonal, we have proved the following theorem of GerSgorin. 


Theorem 1. Each eigenvalue of the matrix A of order n is in at least one of the disks 
GA) = {z:lz -—a,| <R;} (1<is<n) (3) 


in the complex plane. Equivalently, the n eigenvalues of A are contained in the region 
in the complex plane determined by 


YA) = U GA). (4) 


We call (4) the GerSgorin row-region of A. By applying Theorem 1 to the 
transpose A’ of A we also obtain a GerSgorin column-region that contains all the 
eigenvalues of A. Let 


S,= dla) = lal +--+: +1a; 


j-Lj (1 <j <n), 
ixj 


| + laj., jl +: +a, ;| 


the sum of the magnitudes of the entries of A in column j other than the entry on 
the main diagonal. Then each eigenvalue of A’also lies in 


9A) = VBA) 


where 
QA) = {z:|z-a,| <8} (<j<n). (5) 
Example. Let 
4 — 3i i 2 —2 
_ i —1+i 0 0 
A = 1+i ~j 5+ 6i 2i (6) 
1 —2i 2i —~5 — 5i 


Then 
R,=5,R,=1,R,;=3+V2, and R,=5. 


By Theorem 1, the eigenvalues of A are contained in the region D(A) shown in 
Figure 1.! We also have 


S,=2+v2, S8,=4, S,;=4, and S,=4. 


Hence the region Y’( A) shown in Figure 2 also contains all the eigenvalues of A 
(so the eigenvalues of A all lie in BA)N F'(A)). In fact, we computed?’ the 


tall figures were generated using Maple. 
*The computation was done using MATLAB. 
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—10 


Figure 1 


eigenvalues of A to be: 
5.0353 + 5.95397, 3.2940 — 2.81817, —0.7229 + 1.09481, —4.6065 — 5.23061. 


The eigenvalues correspond to the four points in Figures 1 and 2. The region 
GA) has two connected components @, and @, where @, is one disk and @, is 
the union of three disks. It is a consequence of the fact that the eigenvalues of a 
matrix depend continuously on its entries that @, contains exactly one eigenvalue 
of A and @, contains the other three eigenvalues. 

More generally, if A is a matrix of order n, and the GerSgorin row-region of A 
has two components @, and @, where @, is the union of k disks and @, is the 
union of the other n — k disks, the @, contains k of the eigenvalues of A and 7@, 
contains the other n — k eigenvalues of A. This can be seen by writing A in the 
form 


A=D+B 


where D is a diagonal matrix whose main diagonal is the same as the main 
diagonal as A and B =A — D. The GerSgorin row-regions of the matrices 


A(e) =D + €B(0 <é€ <1) 


are all contained in the GerSgorin row-region of A, with the centers of the disks 
equal to the diagonal entries of A. Moreover, @, contains k disks of A(e) and @, 
contains the other n — k disks. We have A(O) = D for which the disks degenerate 
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—10 


Figure 2 


to the n points corresponding to the eigenvalues of D (the n diagonal entries of 
A). As e increase from 0 to 1, the continuity of the eigenvalues implies that 7, 
always contains exactly k eigenvalues of A(e). Since A(1) = A, @, contains exactly 
k eigenvalues of A and @, contains the other n — k eigenvalues. Note that 
continuity does not imply that each disk always contains at least one eigenvalue 
(unless the disk itself is a connected component of the GerSgorin region). For 


example, let 
—_{5 -1 
A | 5 a 


Then the eigenvalues of A are 2 and 3, and neither of these is in the disk of radius 
1 with center 5. 
The matrix A of order n is row-diagonally dominant provided that 


la;;| > R; (i = 1,2,...,7), (7) 


that is, the distances from the centers of the GerSgorin row-disks to the origin are 
all greater than their radii. Thus if A is row-diagonally dominant, the GerSgorin 
row-region does not contain the origin and hence by Theorem 1, 0 is not an 
eigenvalue of A. Therefore if a matrix A is row-diagonally dominant, then A is 
invertible. Conversely, the fact that a row-diagonal dominant matrix A is invertible 
implies that the eigenvalues of A are contained in its GerSgorin row-region: If A 
is an eigenvalue of A, then AJ, — A is not invertible and hence cannot be row- 
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diagonally dominant. Thus for some i we have 


IA ail < Lil - axl = Lilagl = R;. 
k#i k#i 
This is an instance of a correspondence that prevails between theorems implying 
that a matrix is invertible and theorems giving a region in the complex plane 
containing all the eigenvalues of a matrix. Similarly, if A is column-diagonally 
dominant, that is, 


la,,| > S, (j =1,2,...,n), 


then A is invertible. 

In the next theorem we first obtain a condition that implies invertibility and 
then use it to obtain a region in the complex plane which contains the eigenvalues 
of a matrix. We use the fact that a matrix of order n is not invertible if and only if 
there is a nonzero vector x such that x4 = 0 (equivalently, 0 is an eigenvalue 
of A). 

Suppose that x = (x,, X,,..., X,) is a nonzero vector such that xA = 0. Let x, 
be the component of x with the largest magnitude. Since (1 /x,)x)A = 0, we may 
assume that |x,| = 1. The vector 


(xq le. .5 Xgl, eegals- +s IX Ql) (8) 
thus belongs to the (nm — 1)-dimensional unit cube 
Q,-1 = {(44,4,---,4,-1):0 <a; <1,(1 si<n-1)}. 


The cube Q,_, is a convex set whose extreme points (vertices) are the 2”~' 
(n — 1)-tuples of 0’s and 1’s. 


Lemma 2. Let A =[a,;] be a matrix of order n which is not invertible and let 


x = (x1, X5,...,X,) be a nonzero vector such that xA = 0. Assume that 1 = |x,| = 
|x,| for each i = 1,2,...,n. Then (|x,|,..., 1X,_yl, IXe4,l,---> [x,D is @ point in 
QO ,,_, Satisfying the inequality 
de lajgl lxjl = lagel- (9) 
ixk 
Proof: Since xA = 0 we have 
De ig Xp = Age Xk: 
itk 


Taking absolute values and using the triangle inequality and the fact that | x,| = 1, 
we get 


» Ain; 


i#k 


< YS lajgl |x; 1- LJ 
ixk 


lAnx| = 


Lemma 3. Let A = [a;)] be a matrix of order n which is not invertible and let 


x = (Xx,, X2,...,x,) be a nonzero vector such that xA = 0. Assume that 1 = |x,| = 
|x,;| for each i = 1,2,...,n. Then 
de (lal — Ri )lxjl < Re — lage. (10) 
ixk 
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Proof: Since xA = 0 we have 

n 

Le 4;;x; = 0 (j = 1,2,...,n). 

i=] 
Using the triangle inequality, we get 

la,\lxjl < do la,,| lx,| (j =1,2,...,n). 
ixj 

Adding these inequalities and interchanging the order of summation we get 


n 


n 
y la, ;||x;| S y » |a;,| |x; | 


j=l j=li¥j 


n 
> > la;,| Ix, | 


i=1 j¥i 


n 
yy R,lx;). 


i=1 


A 


Subtracting the last sum from the first and using the fact that |x,| = 1, we obtain 
(10). Oo 


The equation 
lazy; = lap, ) 
it+k 
corresponding to the inequality (9), is the equation of a hyperplane H, which 
partitions real (n — 1)-dimensional space R”~! into a closed half-space 


A= {(Yise Yenw errs Yad} » laixly; = cel | (11) 


L#k 


and an open half-space 


Ay, = {Oise Yann Senses Ya) > ldixly; < eel} (12) 


i#k 
Similarly the equation 
» (la;,| — R;)y; = R, — laggl, 
itk 
corresponding to the inequality (10), is the equation of a hyperplane J, which 
partitions R”~! into a closed half-space 


J, = {tse Yeas Mentos ss In) > (la;, — R;)y; <R,- aul | (13) 


L#k 


and an open half-space 


Je = {Ones Vena Vesper Madi (la;;] — Ri)y; > Ry - lace (14) 
i#k 

Thus by Lemmas 2 and 3, a vector x with x # O such that xA = 0 and such that 
1 = |x,| = |x,|G@ = 1,2,...,) determines a point (8) in the unit cube Q,,_, which 
is in both of the closed half-spaces H; and J,. Hence if for each k = 1,2,...,n 
we assume that there is no point in the cube Q,_, which is in both H; and J;, 
then A is an invertible matrix. Thus we have the following result which is 
equivalent to Theorem 3 of Pupkov [3]. 
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Theorem 4. Let A be a matrix of order n such that 
Hi{NJZN@,-.=9 (k=1,2,...,7n). 


Then A is an invertible matrix. 

Our goal is now to impose conditions on a matrix A which guarantee that the 
half-spaces H; and J, do not have any common points in Q,_, for each 
k =1,2,...,n. Such conditions will ensure the invertibility of the matrix. If our 
conditions are such that they guarantee that none of the half-spaces H; contains a 
point of Q,,_,, then A is invertible. Similarly, if our conditions imply that none of 
the half-spaces J, contains a point of Q,_,, then A is invertible. 

For instance, assume that A is column-diagonally dominant. Then for each 


point (y,,.--, Vp-p Veap+++> ¥,) in Q,_, we have 
DV lainly, < Lo lagl = Sy < laggl 
ixk i#k 


implying H; 1Q,-_, is empty for each k = 1,2,...,m. Hence, as already noted, 
column-diagonal dominance implies invertibility. 

Now assume that A is row-diagonally dominant. Then for each point 
CVin--+> Ve-v Vert-++> ¥,) in Q,_, we have 


» (la,, — R;)y; = 0. 


ixk 
Since R, — |a,,| <0, we conclude that J; MQ,_, is empty for each k = 
1,2,...,m. Hence, as also already noted, row-diagonal dominance implies invert- 
ibility. 


We now show that a weakened form of row-diagonal dominance can be 
combined with a weakened form of column-diagonal dominance to ensure that H; 
and J; have no common point in Q,,_, and thus to ensure invertibility. This result 
was obtained by Solov’ev [4] in a somewhat more complicated way. Before stating 
his result, we remark that we use the convention (already used above) that for each 
k =1,2,...,n the mn — 1 coordinates of Q can be written as 


CV -e eo Ven Veg peres Vays 


n-1 


Theorem 5. Let A = [a;;] be a matrix of order n and let r be an integer with 
1 <r<n. Assume that A satisfies both of the following conditions: 


(i) For each j =1,2,...,n 
la,;| > Sf~? 


where S‘’~") is the sum of the magnitudes of the r — 1 largest off-diagonal 
elements in column Jj; 

(ii) For each set of r rows of A, the sum of the magnitudes of the diagonal 
elements in those rows is Strictly greater than the sum of the magnitudes of 
all the off-diagonal elements in those rows. 


Then A is invertible. 

If in Theorem 5 we have r = 1, then condition (i) is vacuous and (ii) asserts that 
A is row-diagonally dominant. If r=n, then condition (i) asserts that A is 
column-diagonally dominant and (ii) is implied by column-diagonal dominance. 
Hence Theorem 5 generalizes the fact that row-diagonal dominance or column- 
diagonal dominance implies invertibility. 
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We note that condition (i) is equivalent to the statement that the magnitude of 
each diagonal element is strictly greater than the sum of the magnitudes of any 
r—1 or fewer of the off-diagonal elements in its column. Condition Gi) is 
equivalent to the assertion that the sum of the r smallest of the numbers 
lai4,| — Ry, la| — Ro,..., la,,| — R, is positive. Hence if (ii) is satisfied, at most 
r—1 of these numbers are negative and hence any sum of s>r of them is 
positive. 


Proof of Theorem 5: Let k be an integer with 1 <k <n and consider the 
(n — 1)-dimensional unit cube Q,_, with coordinates written as (y,,..., ¥,_1, 
Verir+++>¥,) Lhe hyperplane determined by the equation 


Yet FVpar Fee Ft FY, = 
determines two closed convex sets 


n-1= {(Visee+ Vents Vetie+++9 Yn) EQ, i:y,t0" 
+Yp-1 + Vea tot ty, =r tt} 
and 
Q,-1 = {( Vises +> Venas Verto++ +2 Yn) EQ,-i:yit°": 
+Yp-1 t+ Vea ttt ty, <r ty. 
The extreme points of Q7*_, are the (m — 1)-tuples of 0’s and 1’s with at least 


r — 1 1’s, while the extreme points of Q7_, are the (n — 1)-tuples of 0’s and 1’s 
with at most r — 1 1’s. We show that 


(a) Q,_, is contained in H, (and hence H; has no points in common with 
O,_,), 
and that 


(b) Q7_, is contained in J; (and hence J, has no points in common with 
2-D 
n-1"° 


This implies that there is no point in Q,_, which is in H; N J,. By Theorem 4 
this will imply that A is invertible. 

By condition (i) we know that for each subset J of {1,...,k —1,k + 1,...,”} 
of cardinality at most r — 1, we have 


De laigl < laggl- 
ie] _ 


Since the extreme points of Q7_, are all the (m — 1)-tuples of 0’s and 1’s with at 
most r — 1 1’s, it follows that 


de lanl: < lage 

i#k 
holds for all extreme points (y,,..., Yp_-4) Veai-++> Y,) Of Q,_, and hence for all 
points of Q7_,. Therefore (a) holds. 


By condition (ii) we know that for each subset L of {1,...,k —1,k + 1,...,n} 
of cardinality at least r — 1, we have 


» (la;; — R;) > 0, 


ieEL U{k} 
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and hence 


» (la;l — Rj) > Ry — lage. 
ieL 


Since the extreme points of Q7_, are all the (m — 1)-tuples of 0’s and 1’s with at 
least r — 1 1’s, we conclude that 


» (la;,| — R;)y; > Ry, — lag, 


i#k 
holds for all extreme points (y,,..-; Vp_—15 Ver t+ +9 Yn) Of O7_, and hence for all 
points of Q”*_,. Therefore (b) holds. O 


Let A be an-eigenvalue of a matrix A. Then AI, —A is not invertible. By 
applying Theorem 5 to the matrix AI, — A, we obtain another region in the 
complex plane which contains the eigenvalues of a matrix [4]. 


Theorem 6. Let A =[a,,] be a matrix of order n and let r be an integer with 
1 <r <n. Then each eigenvalue of A is either in one of the disks 


{z : Iz —a,| < serv} (j = 1,2,...,n), (15) 


or in one of the regions 


{2 lz -a,| < > Ri (Pc {1,2,...,n},|P| =r). (16) 


ieP ieP 


The region I’, determined by the union of the disks in (15) is contained in the 
GerSgorin column-region of A. There are @ regions in (16) and each region is an 
‘average’ of r of the disks making up the GerSgorin row-region: Let 


{2 YVilz-a,l< & R}} (17) 
ieP ieP 
be one of the regions in (16). If w is a complex number such that |w —a;,| > R; 
for each 7 in P, then 
le — al > LR; 
ieP ieP 
and hence yp is not contained in (17). It follows that the region in (17) is contained 
in the union of the r disks with centers at a;,; and radii equal to R; with i € P. In 
particular this implies that the union I’, of the regions defined by (16) is contained 
in the GerSgorin row-region of A. Thus Theorem 6 determines two regions I’, and 
I’, whose union [', UT, contains all of the eigenvalues of A where I’, is contained 
in the GerSgorin column-region of A and I, is contained in the GerSgorin 
row-region. 
The regions in (16) are in general very hard to describe. But in the case r = 2 
we obtain the regions 


{z:|z —a,| + jz —4,;,| <R,;+R;} (1 <i<j<n). (18) 
If the two disks 
Iz -—a,| <R,; and |z-a,| <R,; 


are disjoint, then the region (18) is empty. Otherwise, the region (18) is bounded by 
an ellipse with foci at a;; and a,,; and with major axis R; + R; and minor axis 
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Figure 5 


/ (R,; +R iy — |aj— a, A? . This ellipse passes through the two points of intersec- 
tion of the circles bounding the two disks. In the case that one disk is contained in 
the other, then the ellipse contains the smaller disk and is contained in the larger. 

Let A be the matrix in (6) and let r = 2. In Figure 3 we show the region I’, as a 
subset of the GerSgorin column-region of Figure 2, and in Figure 4 we show the 
region I’, as a subset of the GerSgorin row-region of Figure 1. In Figure 5 we show 
the region I’, UI’, which by Theorem 6 contains the eigenvalues of A. 

Other generalizations of GerSgorin’s theorem, and many references, can be 
found in [1] and [2]. 
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Characterization of Solvable Quintics 
x>t+axt+b 


Blair K. Spearman and Kenneth S. Williams 


We consider the quintic equation 
x°>+ax+b=0, (1) 


where a and b are nonzero rational numbers. In general the roots of (1) cannot be 
expressed as algebraic functions of the coefficients a and b. We will characterize 
completely those irreducible quintics x° + ax + b which are solvable by radicals. 
We do this by extending Cardano’s familiar method of solving the cubic equation 
x? + ax +b = 0. We begin by recalling Cardano’s method in a way which enables 
us to apply it to the quintic equation (1). 

If u,,u, are complex numbers and w is a complex cube root of unity, expanding 
the product 


(x — (u, + uy))(x — (wu, + @7u,))(x — (w*u, + wuy)), (2) 
we obtain the polynomial 
x° — 3uyu,x — (u; + U3). (3) 


As x; = w/u, + wu, (j = 0,1, 2) is a root of the cubic polynomial (2), substitut- 
ing it into (3), we obtain the identity valid for j = 0,1, 2 


(w/u, + wu,) — 3u,u,(@/u, + wu.) — (up + 43) = 0. 


Thus the cubic x* + ax + b = 0 has the three solutions x, = w/u, + w”/u, (j = 
0,1, 2), where u? and u3 are determined from uj + uz = —b, ujuz = —(a/3). 
An obvious generalization of this is to consider the quintic polynomial 


4 
It — (wu; + wu,)), (4) 


where w is now a complex fifth root of unity. Expanding the product (4), and 
proceeding as above, we find that the quintic x° + ax* + (a*/5)x + b (sometimes 
called DeMoivre’s quintic) has the solutions x, = w/u, + wu,, j = 0,1, 2,3, 4, 
where u} and u3 are determined from u? + u3 = —b, uju3; = —(a/5)”. 

We refine this method by considering instead of (4) the quintic polynomial 


4 
[1 (4 — (@/u, + ou, + wu; + wVu,)), (5) 
j=0 


where uU,,U,,U3,U, are nonzero real numbers and w is a complex fifth root of 
unity. Multiplying out (5) is somewhat more challenging than (4), so MAPLE was 
employed to do the work. Replacing x by w/u, + wu, + wu, + ou, in the 
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expanded product, we obtain the identity valid for j = 0,1, 2,3, 4 
(wu, + wou, + wu, + wu) 
—5U(w!u, + wu, + wu, + wu) 
—5V(w!u, + wu, + wu; + wiu,) (6) 
+5W(w!u, + wu, + wuz + wVU,) 
+5(X-Y)-Z 


= 0, 
where 


U=uju, + unu;, 

V =u,us + u,ur + u,u? + u4us, 

W = utue + uzur — ueu, — u3u, — u3u, — udu; — Uyu,U3U,, 
_ 13 3 3 3 

X = UjUzU, + UZU,U, + UZU,U, + UGU,U, 

Y = u,ujus + u,uzuy + uzu3u, + u4urus, 

Z =u; +u3+u3 4+ uj. 


The essential ingredient of the proof of our characterization of solvable quintic 
trinomials is the determination of real algebraic numbers u,, u,, U3, U, satisfying 


UU, + UU, = O, (7) 
2 2 2 2 
22422 23) 293 23 _ 
S(ujug + uzuz — uju, — UU, — U3, — U4U3 — U{UzU3U,4) = a, (9) 


and 
5((ujuzu, + U3U,U3 + UZU,U4 + UZUyUy) 
—(uyuzuy + u,uzus + u3u5uy + uyuzus)) (10) 
—(uj + u3 + u3 + uj) =), 
so that the quintic polynomial (5) becomes x° + ax + b and has the roots 


x, = (wu, + ou, +o%u,+o%u,) (jf = 0,1,2,3,4). (11) 
Theorem. Let a and b be rational numbers such that the quintic trinomial x? + 
ax + b is irreducible. Then the equation x° + ax + b = 0 is solvable by radicals if 
and only if there exist rational numbers €(= +1), c(= 0) and e(# 0) such that 


5e4(3 — dec —4e?(1le + 2c 
p-8G=4e0) m4 2) 


cfe+1 0” ce +1 
in which case the roots of x° + ax + b = 0 are 
x, =e(w/u, + wu, + 0%uz+o%u,) (j= 0,1,2,3,4), (13) 
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where w = exp(27ri/5) and 
1/5 


_ [virus [0504 ve (070, a _ [vg v2 ve 
“TV pe} 2 M2 pepo BV pe ye M4 | pay 
(14) 
(15) 

v,=-VD+yYV¥D+evD, v,=VD-YVD-evD, 


D=c? +1. (16) 
Proof: We begin by supposing that the irreducible quintic polynomial x° + ax + b 
is solvable by radicals. Thus the resolvent sextic of x° + ax + b, namely, 
x®© + 8ax°? + 40a7x* + 160a7x? + 400a*x? 
+ (512a° — 3125b*) x + (256a° — 9375ab*) 
has a rational root r [1, Theorem 1]. Hence r satisfies 
(r + 2a)*(r? + 16a”) — 5°b*(r + 3a) = 0, (17) 


which shows that r # —2a,—3a as a #0. We define the nonnegative rational 
number c and the nonzero rational number e by 


3r — 16a —5be 


= ,] = ,] h = +1. 18 
“ 4(r + 3a) ° 2(r + 2a) ees & (18) 
Then 
25(r? + 16a? 
16(r + 3a) 
25a 
3 — 4ec = ; 
r+ 3a 
25(r + 2a)e 
lle + 2c = ——_ 
2(r + 3a) 
so that 
5e*(3 — 4ec) 5°ab4(r + 3a) 
——— a 
co +1 (r + 2a)*(r? + 16a”) 
and 
—4e°(1le + 2c) 5°b°(r + 3a) 
c+] (r + 2a)*(r? + 16a”) 


giving the required parametrization. 
We now show that the irreducible quintic trinomial 


5e4(3 — 4ec) 4e?(1le + 2c) 
2447 ~*~ 7244. (19) 
ce + 1 c“ + I 


with e = 1 is solvable by radicals with roots given by (11). In fact it is not necessary 
to assume that the quintic is irreducible. For general e the transformation x — ex 


x? 
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gives the required result (13). From (15) we see that 


v, +u, = 2vD, v,+v,= —2vD, (20) 
vw, = evD, V5V3 = —eyD, 
and so 
VU, +v,+0,+ 0, = 0, 
| 1 2 3 4 (21) 
U4, + Vv, = 0. 
Further, from (14), we obtain 
2 2 2 2 
UiU3 V3, USU, UqU> 
u; - D2 ] u3 - D2 ] U3 = D2 9 ui; = D2 (22) 
Easy calculations making use of (20) and (22) yield 
E E 33 
uu,= -——<, Uunu, = ——, 
Mee TE. tals = 7 (23) 
U3 U5 UV; U4 
U,uz = D’ UZU4 = D’ uju, = —, Ugly D? (24) 
and 
wu, = — wu, = _ E304 uu, = _ SE Wu, = rok 
DVD DVD DVD DVD 
(25) 


which give the required equations (7) and (8) in view of (21). From (15), (22), (23), 
(24) and (25), we deduce that 


2,2 2,2 _ 13), — 73 — 134. — y3y. — 
S(ujug + u3u3 U;U, — U5Uy — UZU, — UU, UjUzU3U,4) 


_ 5(3-4evD—-1) — 5(3 — 4ec) 


26 
D c7 +1 (26) 
and 
S((ujuzu, + uzUyU3 + UZU,U, + UU) 
—(uyu3uz + unuzuy + uzuzuz + u,ujus)) (27) 


(44e + 8VD - 1) 4(11e + 2c) 
ny > ny cn 
which are the required equations (9) and (10). This proves that 
5(3 — 4ec) A(11e + 2c) 

C4+t°  )©6etd 
is solvable by radicals and has the roots given in (11). 


— (uj + u3 + u3 + uj) = 


xo + 


The discriminant of the trinomial quintic x° + ax + b is 44a> + 5°b4 [2, p. 259]. 
The equation x° + ax + b = 0 has exactly one real root if 44a> + 5°b* > 0 [3, 
p. 113]. The discriminant of the quintic (19) is 

445°e20 5 
pa (hee? — $4c* — 37ec — 122) > 0 (28) 
so that the quintic (19) has exactly one real root. Suppose now that (19) is 
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irreducible over Q. By the Theorem, (19) is solvable by radicals, and so its Galois: 
group is solvable. Hence its Galois group is isomorphic to the Frobenius group F,, 
of order 20, the dihedral group D, of order 10, or to the cyclic group of order 5. 
However (19) has complex roots, so its Galois group cannot be cyclic of order 5. By 
[1, Theorem 2] the Galois group of (19) is the dihedral group D, of order 10 if and 
only if 5D is a perfect square in Q. Otherwise the Galois group is the Frobenuis 
group F,, of order 20. 
We close with five examples. 


Example 1. We consider the quintic f(x) = x° — 5x + 12, which is irreducible as 
f(x — 2) is 5-Eisenstein. The resolvent sextic of f, is 

x° — 40x° + 1000x* + 20000x? + 250000x? — 66400000x + 976000000, 
which has the rational root r = 40. From (18) we see that e = 1,c =2,e = —1, 
so that by (16) D=5. Since 5D = 5* the Galois group of f, is D,. By the 
Theorem the unique real root of f, is 


BBB+ Or) 


25 


1/5 


x= 7 


1/5 


25 
{C= Bs EE)” 
25 


(We 5-8) (-W8 - 8) 


25 


A little manipulation shows that this root can be rewritten as 
x= S(RIC FRC +R? + RY), 
where R,, R,, R3, R, are given at the bottom of page 399 of [1]. 
Example 2. We take f,(x) =x° + 15x + 12, which is irreducible as f,(x) is 
3-Eisenstein. The resolvent sextic of f, is 
(x + 30)*(x? + 1800) — 28 - 34- 54(x + 45), 
which has the rational root r = 0. Hence, by (16) and (18), we have e = —1, 


c = 4/3, e = 1, D = 25/9. Since 5D is not the square of a rational number, the 
Galois group of f, is Fj). By the Theorem the unique real root of f, is 


~75 —21V10 \'” (225 — 72V10 \'”” 
+ 125 r 125 
225 + 72V10 \\°{ —75 + 21V10 \'”” 
— + a 
r 125 125 


in agreement with the more complicated formula given at the top of page 399 
in [1]. 
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Example 3. Here we take € = 1, e=5/2,c =7/24, so D=1+ (4) = (, 
and the quintic (19) is f,(x) = x° + 330x — 4170, which is irreducible as fal) i is 
5-Eisenstein. Since 5D = 5°/(2°- 3”) the Galois group of f3 is Fy). By the 
Theorem the unique real root of f; is 


x = 541/79 + 12179 + 6481/9 — 1441/9, 
Example 4. Here we take € = —-1, e=1, c=11/2, so D=125/4, and the 


quintic (19) is f,(x) = x° + 4x, which is clearly reducible. However, by the remark 
preceeding (20), the roots of x° + 4x = 0, namely x = 0, +(1 + i), are given by 


(13). Here | 
= 3(5v5 + V5 25 + 2Vv5), vs = 4(-Sv5 + V5 725 - 25 ), 
viv; 1 
Di = <5 (1000 - S00VS + 180725 + 2V5 ~ 24025 — 2V5 | 
1 
= (1000 — soov5 + 12075 + 2v5 - 6607s — 2v5 J, 
and 


“= | - =(-15 - ¥5- 295). 


u, = 4(v5 — V5 + 25 J, 
u; = 3(v5 + V5 + 25 J, 
u, = 4(-v5 + y5- 25 |. 


Clearly x) =u, tu, +u,; + u, = 0. Further, as 


w = exp(2mi/5) = ((v5 — 1) + iV 10 + 2v5 | 4 
we have 
X, =U,o + U,w* + u,0° + Uw 
= a((-x —~y)(x-1+i(y+z)) + (*-z)(-x -1-i(y -2z)) 
+(x+2z)(-x -1+i(y-—z))+(-x+y)(x-1-i(y +z))), 
where x = V5, y = V5 — 2v5° ,Z= V5 + 2V5° . Simplifying the expression for x,, 
we deduce 
x, = —(-43? — 2iy* — 2iz*) = —— =-1-i. 


We leave it to the reader to show that x, =1+i,x,=1-1,x,= -1 +i. 


Example 5. Let p be a prime with p = 3 (mod 4). We show using the Theorem 
that the quintic equation x° + 2px + 2p” = 0 is not solvable by radicals. We first 
observe that x° + 2px + 2p’ is 2-Eisenstein so that it is irreducible. Suppose 
however that the equation is solvable by radicals. Then, by the Theorem, there 
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exist rational numbers e(= +1), c(> 0) and e(# 0) such that 


5e4 
2p = ary 87 fee): (29) 
2p? = - fe (1le + 2c) (30) 
c7 +1 


Expressing the rational numbers c and e in the form c = m/n and e = r/s, where 
m,n,r,s are integers with gcd(m, n) = gcd(r, s) = 1, and appealing to (29) and 
(30), we obtain 


2p(m? + n*)s* = 5r4(3n — 4em)n, (31) 
2p?(m? + n’)s° = —4r?(11ne + 2m)n. (32) 


As p is a prime = 3 (mod4) and gcd(m,n) = 1, p does not divide m? + n?. 
Further, as gcd(r,s) = 1, it is clear from @1) that p does not divide r. Let 
p%, p®, p’, p° be the exact powers of p dividing n, 3n — 4em, 1len + 2m,s 
respectively. As p does not divide both of m and 3n — 4em we see that a or 
B = 0. Similarly a or y = 0 and B or y = 0. Equating powers of p on both sides 
of (31) and (32), we obtain 


1+ 456=a+B, 

2+56=aty, 
which contradicts that at least two of a, B,y are 0. Hence the equation x° + 
2px + 2p” = 0 is not solvable by radicals. 


Other examples of solvable quintics are given below together with their Galois 
groups. : 


e = I, e= —-1, c=2/11 x? + 11x 4+ 44. Ds 

e=1, e= —1, c=0 x? + 15x + 44 Fy 

e= —1, e=1, c=1/2 x? + 20x + 32 D; 

e = 1, e= —2, c=7 x°> — 40x + 64 Fy, 
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A Halmos Problem and a Related Problem 


John B. Cosgrave 


To my daughters Catherine and Marie for their 18th and 21st birthdays, and for my first year 
students Clare Connolly and Orla Walsh without whom there would have been no story to tell. 


The November ’92 issue of the Monthly has a review by Stan Wagon of Paul 
Halmos’ ‘Problems for Mathematicians Young and Old,’ and one of those which 
he quotes Is: 


‘Which positive integers are sums of three or more consecutive positive 
integers?’’, and then says: several of these problems have surprising answers 
and I won’t spoil your fun too much, but to whet your appetite for the book 
here is the answer to the above problem: all integers except for the primes 
and the powers of 2 (surprising, and surprisingly easy to prove). 


I first saw this problem in a British Sunday newspaper, the Sunday Times, about 
fifteen years ago—I don’t know who first thought of it, perhaps some British 
reader could provide a reference—but only last October did I present it, for the 
first time to any of my students, to my first year students, who are training to be 
primary school teachers. 

Then I was introducing them to Number Theory and had asked them to find 
some primes which are a power of 2 plus 1. They quickly came up with 3, 5 and 17 
which are (2! + 1), (22 + 1) and (24 + 1), and one of them said ‘there’s a pattern!’ 
Good! Namely? ‘The next one will be (2° + 1).’ They checked that it is prime, and 
I asked how could they be sure that it is the next prime of this type. They tested 
all the ‘missing’ ones, (2? + 1), (2° + 1), (2° + 1) and (27 + 1), all composite. 
[Later I showed, for example, (2% + 1) can be seen to be composite: (2 + 1) = 
(23 + 1)(27° — 2% + 27 — 2 + 219 — 2° 4+ 1)). 

And the next one? I was of course, given (2'°© + 1); and then? Well—I was 
told—the next one is (2°* + 1). It was then I let out that they had just rediscovered 
Fermat’s false claim with respect to his primes (that the ‘Fermat numbers’ 
F, (= 2?" + 1), are prime for all n = 0,1,2,3,...) and got them to verify that 641 
divides (2°? + 1) (Euler). I also asked them to find primes that are a power of 2 
less 1; here they quickly found 3, 7, 31 and 127 which are (2? — 1), (2? —- 1), 
(2° — 1) and (2’ — 1). Again someone saw a ‘pattern,’ claiming the next to be 
(21! — 1), but it is composite, being 23.89 (first noted in the 15th century). 

Then I presented this problem (not the ‘related’ one of the title): Some natural 
numbers can/can’t be expressed as a sum of at least two consecutive natural 
numbers; find some of each kind. Check down to 20. Soon they had two lists: 


‘can’: 3,5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20. 
‘can’t’: 1, 2, 4, 8, 16. 
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Now, who could guess the next entry in the ‘can’t’ list? There was a tentative 
suggestion that it might be 32. But is it correct? So we tested and found that 32 is 
in the ‘can’t’ list. And after 32? Is 64 the next one? And what about the ones in 
between, namely: 21, 22, 23,...,31,33,...,63,65,...? 

The problem now became: could they prove the conjectures: 


(i) any natural number that is NOT a power of 2 IS a sum of at least two 
consecutive natural numbers. 
(ii) any natural number that IS a power of 2 CANNOT be such a sum. 


I didn’t expect that any of them would be able to prove these and indeed none 
could make a start: So I asked: how can one express notationally a sum of at least 
two consecutive natural numbers? Someone suggested a + (a + 1), but it was 
quickly agreed that that wasn’t good enough and it was a while before we settled 
on.at+(at+D+(a+2)+°:::+(a+bd);a,beEN. 

Could they now prove conjecture (ii), that is: 


is 2"”=a+(a+1)+(a+2)+::: +(a +5) impossible for a,b,m <=N 
(and, trivially, m = 0)? 


Could anyone tell me what a + (a + 1) + -:: +(a + b) summed to?, and after 
a while (which entailed going over high school work in connection with 1 + 2 + 
3+.-°-: +b = b(b + 1)/2, as some had said—from memory, and without reserva- 
tion—that it came to b(b + 1)), we found the rearranged sum to be: 


(atatat:::+a)+(14+2+--: +b) =a(b+1) + b(b + 1)72, 


which comes to (b + 1)(2a + b)/2. 
Could they now prove that: 


2” =(b + 12a + b)/2, that is 2”+! = (b + 1)2a +b), is impossible for 
a,b,m €N? 


They were stuck again, and so IJ asked: if the product of two natural numbers is a 
power of 2, what can you tell me about those two natural numbers? Can anyone 
prove that the only factors of 2”, n © N, are 2’, r= 0,1,2,...,n? And, while 
we're at it, what can one say about the factors of 3”, 4”,5”,6”,...? 

It was seen that we really did have a problem here when they came up with 
examples like: 12 is a factor of 6, but 12 is not a power of 6; 2 is a factor of 4, but 2 
is not a power of 4. Eventually—after much discussion, using only simple ideas, 
and not resorting to the unique factorization theorem (I will send details to anyone 
who is interested)—I proved that the only factors of 2” are 2’ (r = 0,1,2,...,7), 
and a proof of conjecture (ii) easily followed: 

Suppose 2” =a +(a+1)+°:: +(a +5) for some a,b,m €N, then 2” = 
(b + 1)(2a + b)/2, and so 2”*+! = (b + 1)(2a + b). Then (b + 1) and (2a + b) 
are 2” and 2”, some x,y EN (with, though it is not needed, x+y = 
m + 1)—observing that neither x nor y could be 0 as (b + 1) and (2a + b) are 
both greater than 1. Addition gives (2a + 2b + 1) = 2* + 2”, which is clearly 
impossible. 

All of this had been quite a struggle, and so it was a relief that a solution to (i) 
came more quickly. First we had some discussion as to the shape of a natural 
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number that is NOT a power of 2—it must be like this: 2° - (2d + 1), c chosen 
from 0,1,2,...; d from 1, 2,3,... . Now, (conjecture (i)) question: 

Given c and d as above, are there a and b € N, such that 2°: (Qd +1) =a+t 
(a+1)+(a@+2)+-::+(a+b)?, that is: 2°:-@Qd+1)=(b+ 1)Q2a + b)/2, 
which someone replaced by: 2°t!- 2d + 1) = (b + 1)Qa + b). 

At first no one knew what to do, so I asked for values of c and d to be chosen; 
someone suggested c = 4 and d = 7, and I asked if anyone could find a,b E N 
with 2°- 15 = (b + 1)2a + b). Clare suggested choosing b = 14, making b + 1 
= 15. Then? She also suggested setting 2a + b = 2°, which gives a = 9. Good! 
So, 24-15 = (240) =9+ 10+ 11 + +++ +22 + 23. 

Another example? Someone choose c = 4 and d = 2, which led in identical 
manner to 24-5 = (80) = 14 + 15 + 16 + 17 + 18, but when I asked for another 
example with c held constant at 4 and d chosen differently, I was offered d = 31, 
and then Orla, with 2° - 63, suggested choosing b + 1 = 63, giving b = 62, and 
then 2? = 2a +b = 2a + 62 gave a = —15. The uniform reaction of the class 
was to reject this value (“it’s not a natural number!’’) and then Clare suggested not 
choosing b + 1 to be 63, but to be 2°. Good! That gave b to be 31, and then 
setting 2a + b = 63 gave a = 16, and so: 


2*- 63 = (1008) = 16+ 17+ 18 +--+ +46 + 47. 


It then gave me great pleasure to point out that the ‘rejected’ value of a(— 15), 
together with the original b(62), gave the same conclusion! a = —15, b = 62 give 
a+b=47 and 24-63 = -15-14-:::-1+0+1+4+:-:: +144 15 + 
16 + --: 47. Many of them seemed to be taken aback with that, but more 
examples made them feel at home with it and it was soon clear that either 
approach (with respect to the ‘choice’) now led to a proof of conjecture (i). 

It’s easy to see that if ‘at least two’ is replaced by ‘three or more’ (Halmos’ 
problem) then only the primes from the above ‘can’ list transfer to the ‘can’t’ list, 
as (b + 12a + b)/2 is clearly composite for a,b € N, and b> 1 (‘three or 
more’); and every composite is so representable, and so stays in the ‘can’ list. 

I then .asked them to consider this question, the ‘related problem’ of the title: 
Some natural numbers can /can’t be expressed as a sum of at least two consecutive 
even/odd natural numbers. For example: 8 = 3 + 5, 12 =2 + 4+ 6, etc.; but 7 
and 13 are not so representable. Which can/can’t? Check to 20. 

To the surprise of many these were the respective lists: 


can: 4, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20. 
can’t: 1,2, 3,5, 7, 11, 13, 17, 19. (for them: 1 and the primes) 


But could they prove the following conjectures (having discussed how one might 
represent a sum of consecutive even/odd natural numbers): 


(iii) if m <N and m is composite, are there a,b € N with m =a + (a + 2) 
+(a+4)+-:-+(a+2b)? 
(iv) if p &N and p is prime (or 1), is this impossible: p = a + (a + 2) + 
(a+4)+-::+(a+ 2b), all a,b € N? 
They were able to show that 
at+(at+2)+(a+4)+---+(a+ 2b) 
=(atatat:::+a)+(2+4+4+ °°: +2b) 
=(b+1)a+21+2+°::: +b) =(b+1)a+2-b(b+1)72 
=(b+1)at+b(b+1)=(b+1)(a+b), 
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and then a proof of (iv) followed instantly. So did a proof of (iii), by the same 
approach as before—not smoothly, but at least not with as many holdups as 
earlier! 

Finally, if you investigate which natural numbers can/can’t be expressed as 
at+(a+3)+(a+6)+ °°: +(a + 3b), a,b EN, you will find (the latter list is 
complete down to 100): 


can: 5, 7, 9, 11, 12, 13, 15, 17, 18, 19, 21, 22, 23, 24, 25, 26, 27, 29, 30,... 
can’t: 1, 2, 3, 4, 6, 8, 10, 14, 16, 20, 28, 32, 44, 52, 56, 64, 68, 76, 88,... 


Things are a bit more jumbled up than before, but here are some 
observations /exercises which I leave to the reader: 


(0) (a trivial one) Every odd number from 5 is in the ‘can’ list (and so, of the 
primes, only 2 and 3 are in the other). 


(1) Every power of 2 is in the ‘can’t’ list (as with c = 1). In fact, if you consider 
which natural numbers can/can’t be represented by general sums of the form: 


at(at+c)+(a+t+c2)+:::+(a+tcb), a,b,cEN, 


then NO power of 2 can be so represented when c is odd (the above cases are 
c = 1 and 3), but ALL powers of 2 can be so represented (apart from small, easily 
explained exceptions) when c is even. 


(2) When c = 1 or 2 both ‘can’ lists are closed under product (the product of 
two non-powers of 2 is a non-power of 2, and the product of two composites is also 
composite). In fact the ‘can’ list is closed under product for all c. Prove it! 


(3) There are many other structural questions that occur on exaniining the lists 
—and I don’t wish to spoil your fun of finding your own—but just to mention one: 
consider g from a ‘can’t’ list with g > 1. 


c = 1: the product of two such g’s remains in the ‘can’t’ list. 
c = 2: the product of two such g’s goes to the ‘can’ list. 
= 3: here some products remain in the ‘can’t’ list (e.g. 2-3, 2-4, 2: 8, etc.) 
while others move to the ‘can’ list (e.g. 2: 6, 3: 6, 3: 3, etc.). Prove (or 
disprove!) that if g is NOT a power of 2 then all proper powers of g are 
in the ‘can’ list. This is vacuous in the c = 1 case, is trivial in the c = 2 
case, and is false in the c = 4 case (e.g. 3,5, 7,9, 25, 49 © ‘can’t’ list). 


I wish to thank Professor Halmos for his very generous reply on receiving a first 
draft of this note, and record that in the problems section of his book he asks: 
which positive integers are sums of two or more consecutive integers?, and also 
that in the solutions section he asks (and answers): which positive integers are 
sums of two or more consecutive odd positive integers?, and: which positive 
integers are sums of three or more consecutive positive integers? 


Mathematics Department 
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Dublin 9 
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A “Popular” Class Number Formula 


Kurt Girstmair 


1. A THEOREM ON THE DIGITS OF 1/p. Even if one does not know what the 
class number is, he can understand the following consequence of our Theorem, 
which is quite deep (in the mathematical sense): 

Let p > 7 be a prime number, p = 3 mod 4, and let 


1/p = 0.%,X5%3... 


be the decimal representation of 1/p. Thus x, € {0,1,...,9} means the kth digit 
of 1/p. This representation of 1/p is recurring (or periodic), i.e., 


L/p = 0.41 ...%,% 1... XX 16 Xoo. 


holds for a certain natural number n. The smallest possible number n of this kind 
is called the period length of 1/p. Suppose now that 1/p has the period length 
n=p-—41. This is the case, e.g., for p = 7,19,23,47,59, but not for p= 
11, 31, 43, 67. Then 


X, +X, 4X5 t+ 06° +X, 9 <Xgt Xt +X~ 1, (1) 


and the difference of these numbers is a multiple of 11. 
Note that the sums in (1) are equal if p is = 1 mod4. This, however, is not 
deep and shown in Section 2. The inequality (1) means that an average digit of 


X1yX3,---yK is smaller than an average digit of x2, %4,...,%,_1. 


p-2 
Example. Let p = 47. From 


1/47 = 0.0212765957446808510638297872340425531914893617 .. 


we get x, +X, + °°: +x, = 76 and X_ + X4 + +++ +X4¢ = 131, whose difference 
is 55. Hence the average of x,, X3,... iS 34, whereas the average of x, X4,... iS 
516, 


The Theorem holds for the digit expansion of 1/p with respect to an arbitrary 
basis g © Z, g > 2, provided that g is a primitive root mod p. The property of 
being primitive can be characterized as follows: For k = 0,1,2,3,... define 
g, = {0,1,...,p — 1} by 


g, = g* mod p. 


Then g is primitive if, and only if, g, runs through all of 1,..., p — 1 if k does. 
The assumption “g primitive” just means that the period length of the digit 
expansion of 1/p with respect to the basis g equals p — 1 (see [3]; [3] and [1] are 
standard references for digit expansions). 

The most important notion in the Theorem is the class number h of the 
discriminant —p (which is identical with the class number of the field Q(y — p )). 
A definition of / is given in Section 2. For the reader not interested in details it 
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may suffice that h is a positive integer and one of the fundamental number 
theoretic data attached to p. That is why number theorists need a formula to 
compute h for a given p. The Theorem supplies such a ‘“‘class number formula.” 


Theorem. Let p > 7 be a prime, p = 3 mod 4, and g a primitive root mod p. Let 


1 co 
—= ¥x,e*, x, = {0,1,...,g—- 1}, (2) 
DP k=1 


be the (uniquely determined ) digit expansion of 1/p with respect to the basis g. Let h 
be the class number of the discriminant —p. Then 


p-1 
y. (-1)*x, = (g + DA. 
k=1 


In the case g = 10 the Theorem says 


(x, + X44 se +x ,_1) — (x, +x, + se +X,_2) = 11-hA, 


whence (1) follows. We shall see that (2) is a transformation of a famous class 
number formula of G. P. L. Dirichlet (1805-1859). In order to enounce Dirichlet’s 
formula, we need the Legendre symbol 


k 


Dp 


of an integer k not divisible by p. By its definition, («| = 1, if k is a square mod p, 


Dp 
i.e., k =j* mod p for some j € Z; otherwise («| = —1. The Legendre symbol has 


the following properties, which will be used in the proof of the Theorem (they are 
easy to prove, see [4], p. 35 f.): 


(A) For a primitive root g mod p, (£] = -1. 


(B) If k =j mod p, (£)= (4). 


Pp Dp 


(C) For all k and j (not divisible by p), (} = (£)(4). 


. 


Now Dirichlet’s class number formula says: For each prime p = 3 mod4, p > 7, 


P-l(k\k 
h=-) {-]-. (3) 
k=1\P)P 


In Section 2 we give some indication of the ingredients of the proof of (3). A 
complete proof can be found, e.g., in [4], p. 300 ff. Since it is a class number, the 
expression on the right side of () must be positive. There seems to be no other 
way to see this (except in special cases). That is why the positivity of the right side 
of (3) belongs to the deep results of number theory; and so does (1), which is 
equivalent to this result in an elementary manner. 


Proof of the Theorem: Since g is primitive, 


p-1 


k=1 
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By the properties (A)—(C) of the Legendre symbol we obtain 


(3)-[2)-(E) -eo 


For these reasons (3) can be written as 


pal k+1 &k 
h= Vi(-1)" —. 
k=1 Dp 


§ 


—— 


Dp 


Now 
pol ge, P.! g, Po} 88,1 — By 
. k+1 k k _ 
(g+DhA= Ve (-12) —- LY (-1*— = Y (- 
k=1 Dp k=1 Pp k=1 
For integers k > 1 put 
§8x—-1 — Sx 
y, = ——. 
D 


The proof is complete if we can show that y, =x, for all k. Indeed, y, is an 
integer since gg,_, = gt = g, mod p. It is in the range 0,..., g — 1. For, on the 
one hand, g,_,/p <1 yields y, <g; on the other hand, —g,/p > —1 yields 
y, > —1. Finally, 


oe) 7 1 ° Bn 4 oe) 
Le =F ETT LD =) =s: 


1 
P\,=u18 k=18 14 
The uniqueness of the digit expansion shows y, = X;,. 
For generalizations of the Theorem and other results about the digits of 1/p 
see [2]. 
2. REMARKS ON THE THEOREM. Clearly the identity 
— S8x-1 ~ &k 
Dp 


(4) 


Xk 
also holds for a prime p = 1 mod 4 if g is a primitive root mod p. We show that in 
this case 

X14 + X3 + oan +X, H=X_ tT Xy + .e +X 5-4 = (g- 1)(p —_ 1) /2. 


Let n = (p — 1)/2. Since g?~' = 1 mod p, g” = +1 mod p. But g” = 1 mod p 
contradicts the primitivity of g, so g” = —1 mod p. Therefore g,., = —g, 
mod p, which implies g,,, =p — g, for all k > 0. From (4) we obtain 


_ SP 7 8x1) 7 (P78) 


g—-1-x,, k>1, 
Dp 


Xn+k 


i.e., X, + X,4,, = 2 — 1. Because of p = 1 mod4 the number n is even, and 


xj +X. tore: +X 5-2 _ (x, + Xn41) + (x3 + Xn43) 
nh 
bo F(X y-1 + Xan-1) = (8 — DY 2° 


The same argument works for x, +x, + °°: +x 
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Next we define the class number h of the discriminant —p, p = 3 mod 4. We 
consider integral binary quadratic forms (henceforth simply called forms). By this 
we mean polynomials of the shape 


f = aX? + bXY + cY? 


with coefficients a, b,c € Z. Two forms f, f’ are said to be equivalent, if there is a 
matrix 


with integral entries.and determinant ru — st = 1 such that 
fi =f(rX + sY,tX + uY). 


Since the inverse of A is of the same type (integral entries, determinant = 1), the 
equivalence of forms is well defined; it is an equivalence relation, indeed. The 
number D(f) = b? — 4ac is called the discriminant of f. The discriminant re- 
mains invariant under equivalence: If f and f’ are equivalent, D(f) = D(/’). 
Therefore it is quite natural to consider the equivalence classes of forms with a 
fixed discriminant. In the sequel we always assume that D(f) = —p, p a prime, 
p = 3 mod 4. There is at least one form with this property, namely 


pt+il 


fo =X? 4+ XV 4+ Y’. 


Moreover, if f = aX*+bXY+cY* and f' =a'X* + b'XY + c'Y* are equiva- 
lent, their first coefficients a and a’ have the same sign. Thus the following 
definition of the class number / makes sense: 


h is the number of equivalence classes of forms with discriminant —p and first 
coefficient > 0. 


Since f, is such a form, / cannot be 0, so / > 1. It is not trivial, however, that h is 
finite. 
Dirichlet was able to connect / with the convergent series 


k 


Dp 


co 
y 
k=1 


k . 


A thorough analysis of quadratic forms and a geometric argument enabled him to 
show for p > 7 


co {k\1 arh 
y (=|o- = 
ka1\D) Kk VP ©) 
On the other hand he proved 
k\k 


1 aw Pa! 


Ro © 


k 
Dp 


co 
yy 
k=1 


p/p. 
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From (5) and (6) formula @) follows immediately. The proof of (6) involves 
complex analysis (Abel’s limit theorem) and the identity 


k 
5 [Lense = si 


The difficult part in the proof of this identity is the sign of the right side. It took 
Gauss four years to show that the sign is always “+.” 
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Variations on Wolstenholme’s Theorem 


Emre Alkan 


Wolstenholme’s Theorem is stated and proved in [1]. Our aim is to give similar 
results. 


Theorem 1 (Wolstenholme). Jf p is a prime greater than 3, then the numerator of 
the fraction 1 + 5 + 4 +++: +1/(p — 1) is divisible by p?. 


We first give an equivalent theorem in the sense that Wolstenholme’s Theorem 
can be deduced from it. 
Theorem 2. If p > 3, then the numerator of 


1 1 1 


is divisible by p. 


Theorem 3. If p > 3, then the numerator of 1+1/2*+-::+1/(p-— 1)’ is 
divisible by p. 
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(Mod p); so these congruences reduce to the set of congruences below. 
1.x =(—1)¢?0/(-D(Mod p) 
2.x =(- 1)? ™M/e- (Mod p) 


(p — 1)x =(- 1) ¢0™/(P-D(Mod p) 


The solutions are clearly distinct (Mod p). Hence, they are 1,2,..., p — 1 in some 
order. Finally, we have: 
o(m) o(m) p-1 
C,= ——(1+ 2+-:: +p—-1) = —— - —— -p=0(Modp), 


where C > is the numerator of the fraction. 


ACKNOWLEDGMENT. I wish to thank C. K. Bayram for his help and encouragement in the 
preparation of this note. 
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The Second-Partials Test for Local 
Extrema of f(x, y) 


Leonard Gillman 


The second-partials test for local extrema provides a sufficient condition that a 
critical point of f(x, y) be a local minimum (or a local maximum). Every college 
calculus book presents this test. This note concerns the proof. The standard proof 
is straightforward, and is the one usually given. Some texts however present an 
“advanced calculus” argument, containing epsilonic manipulations beyond the 
comfort level of most freshmen. Finally, there are texts that omit a proof alto- 
gether, as being a topic for advanced calculus. In a “random” sample of a dozen 
freshman texts (i.e., those within ten feet of my desk), the three types of presenta- 
tion were about equally divided. The two types of proof also appeared equally in 
my half-dozen advanced calculus texts. 

I believe that the standard proof is appropriate to the course and moreover 
furnishes an enlightening view of mathematics at work. 


1. THE ONE-VARIABLE CASE. For background, start with one variable. Assume 
that f(x) is twice differentiable on an open interval about a critical point x = a. 
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There are two versions of the test, which faute de mieux I'll call geometric and 
analytic. 


Geometric. The standard second-derivative test argues that if f’(x) > 0 on the 
interval then f’ is increasing there, so f is concave up and a is a local minimum. 


Analytic. This version, not always mentioned, is based on Taylor’s formula for 
n = 1. Since f’ vanishes at a, the formula reduces to 
f(a +h) —-f(a) = 5h’f'"(a+0h), O0<6<1. 


It is then obvious that a is a local minimum. 


2. THE TWO-VARIABLE CASE. The test for a local minimum of f(x, y) at a 
critical point (a, b) depends on the following fact: 
AX* + 2BXY + CY’? is positive definite provided that 
A > 0 and A > 0, where (1) 
A = AC — B’. 
(To prove (1), complete the square to write the form as [(AX + BY)* + AY*]/A.) 
In the test itself, A, B, and C are given by 
A=filx,y), B=fp(x,y), C=fyl(x,y), 


assumed continuous on a neighborhood of (a,b). The condition for a local 
minimum is that the inequalities 


A>0O and A>0O 


hold at (a,b). By continuity, they then hold on some open disk D about (a, b). 
Again there are two versions of the test. This time the analytic version is the 
popular one. 


Analytic. Since f,(a, b) = f(a, b) = 0, Taylor’s formula for n = 1 reduces to 
f(a t+h,b+k) -—f(a,b) = $[ Ah? + 2Bhk + Ck?], (2) 


where A, B, and C are evaluated at some point (a + th,b + tk),0<t< 1. Any 
such point lies in the disk D whenever (a + h, b + k) does. By (1), the right side 
of (2) is positive for all points (a + h,b + k) # (a,b) in D, and therefore f has a 
local minimum at (a, b). 

How can such a nice proof get to be complicated? The standard way seems to 
be to use the form g(x,y) +R instead of p(a + 6h,b + 6k) and then get 
involved making R small. The epsilonics are not only forbidding in their own right 
but add twenty lines to the argument. No student is going to struggle through that. 


Geometric. This version casts the problem in terms of a function of a single 
variable. Introduce polar coordinates <r, a) relative to (a, b): 


(x,y) =(a+rcosa,b+rsina), 
and define 
Fi(r) =f(*,y) =f(rcosa,r sina). 
Now differentiate twice with respect to r: 
Fi(r) =fi(x, y)cosa + f,(x, y)sin a, 
F"(r) = Acos*a + 2Bcosasina + C sin’ a. 
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(These are V,f and V.V.f, the first and second directional derivatives in the 
direction a.) 

Since the sine and cosine are never zero together, (1) shows that F’(r) > 0 at 
all points of D. Then the trace of f in every direction from (a, b) is concave up 
throughout D. Therefore f has a local minimum at (a, D). 

The derivation of the two-variable Taylor expansion also starts out with a single 
variable, setting F(t) = f(a + th,b + tk) and differentiating twice; and in fact, 
except for notation, this is identical to finding F’(r). Thus in the two-variable 
setting, the analytic and geometric proofs are very closely related. Still, I think the 
geometric version has a certain charm to it and should be better known. 


ACKNOWLEDGMENT. I am indebted to the Notes editor for a helpful mix of prodding and advice. 
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PICTURE PUZZLE 
(from the collection of Paul Halmos) 


This is not an algebraic Hawaiian picture but an analytic one. 
(see page 1012.) 
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Answer to Picture Puzzle 
(p. 1006) 


Richard Courant, Peter Lax, and Michael Atiyah relaxing in Honolulu 
in 1969. 
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UNSOLVED PROBLEMS 
Edited by: Richard Guy and Richard Nowakowski 


In this department the MONTHLY presents easily stated unsolved problems dealing 
with notions ordinarily encountered in undergraduate mathematics. Each problem 
should be accompanied by relevant references (if any are known to the author) and by 


a brief description of known partial or related results. Typescripts should be sent to 
Richard Guy, Department of Mathematics & Statistics, The University of Calgary, 
Alberta, Canada T2N IN4. 


Mousetrap 


Richard K. Guy and Richard J. Nowakowski 


Cayley [2, 3] introduced a permutation problem he called Mousetrap which is 
‘loosely based on the card game Treize [1]. Suppose that the numbers 1,2,...,7 
are written on cards, one to a card. After shuffling (permuting) the cards, start 
counting the deck from the top card down. If the number on the card does not 
equal the count, transfer the card to the bottom of the deck and continue counting. 
If the two are equal then set the card aside and start counting again from 1. The 
game is won if all the cards have been set aside, but lost if the count reaches 
n+. 

For example the permutation 1243 will win, the cards being set aside in the 
order 1,3,4,2, while 1324 only sets aside cards 1 and 2. 

Cayley proposed two questions. 


1. For each n find all the winning permutations of 1,2,..., 7. 
2. For each n find the number of permutations that eliminate precisely i cards 
for each i, 1 <i <n. 


An answer for question 2 would illuminate question 1, but very little is known 
about either. 

In [3], Cayley lists all the possible outcomes for n = 4. Steen [7] notes that 
Cayley made some errors. He goes on to calculate, for any n, the number of 
permutations that have i, 1 <i <n, as the first card set aside. This number he 
denotes by a, ; and he used 5, ,(c,, ;) to denote the number of permutations that 
have 1 (respectively 2) as the first hit and i as the second. He obtained the 
recurrence relations 


Ani — An i-1 ~ 4@n-1,i-1> by = An-1,i-1 


(i-1-k) 


i—2 if 
Cri = np ~ (i- Ve,-1.14+ Le (-1) 5 C,-x,., forn >i+1 
k=2 
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and used them to show that for 0 <i <n, 


On =NA,_19+(-1), Ayo = 1; ani » (-'{i)(n- 1—k)! 


k=0 
By ij = An-1,i-1 ~ 4n-2,i1-2 ~ 4n-3,i-2 
k(k +3 
= - 5 (- 1st 1 KC 5 OF?) on i +k — 1 I=(i — 1)(n — 3)!4(n — 2)! 


Steen denoted the sums of the a,, ; b 
i= 0,i=1,i = 2 respectively) by a, b 


n,i€n,, taken over 0 <i <n (but omitting 
Ch and further showed that 


n 
a, = Na,_\ + (-1)"*" by, = An] 


and deduced a complicated expression for c, from his formula for c, ; which 


unfortunately holds for neither i = nor i =n — 1. In [4] there are corrected 
versions of Steen’s formulas: 


cana = E(-1)*{" ¢3)(n k= 2) 


k=0 


o) 
I 


ceo Boots 


k=0 


+(R + on =k —~k—3)i+2(-1)"~° 


1 
C,=(n- 2)(n — apt] ((n — 1)!-—(n — 2)!- 2(n - |] 


where [[x]] is the nearest integer to x. 

If i = 0, so that no card is set aside then the permutation is a derangement ((6], 
sequence 766), that is, card j is in place j for no value of j. Sloane [6] also contains 
some related sequences. Steen’s a,,c, and (nm — 2) — 2)!—c, are sequences 
1423, 1186 and 1635, respectively. The first edition of [6] cited Steen’s sometimes 
erroneous values of the last two for 3 < n < 10. They have been corrected in the 
forthcoming second edition. They can be calculated from the formula above. 

Notice that the number of permutations which eliminate at least one card is the 
nearest integer to n!(1 — *). This is the special case j = n — 1 of a more general 
problem [5]: find the minimum number of permutations of 1,2,...,” which 
contain all permutations of a given j-element subset. More generally for n> 2k—-1 
there is an expression of the form 


IG — 1)!—(k — 1)(n — 2)!+(k? — 5k + 2)(n — 3)!+ + + 


(—1)""'(2k — 3)(m — 2k + 2)!+(-1)* '2(n - 2k + 1)!) 


F 


A third question arose during our investigations about which we also know very 
little. Consider a permutation for which every number is set aside. The list of 
numbers in the order that they were set aside is another permutation. Any 
permutation obtained in this way we call a reformed permutation. 
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3. Characterize the reformed permutations. 


Not all permutations are reformed permutations. For example, permutations on 
n objects are not reformed permutations if they start with n; or with x,n — l,y 
where y #n #x. On the other hand, the identity permutation is always a 
reformed permutation; the permutations yielding this are 1, 12, 132, 1423, 13254, 
142563, 1527436, 16245378, 142863795,.... 

The permutation 4213 is a winning permutation which gives rise to the permuta- 
tion 2134; this in turn gives the reformed permutation 3214 which is not a winning 
permutation. 


4. For a given n, what is the longest sequence of reformed permutations? 


Table 1, whose column sums are n!, gives the numbers of permutations yielding 
sequences of length /. 


TABLE 1. NUMBERS OF SEQUENCES OF REFORMED PERMUTATIONS. 


n= 1 2 3 4 5 6 7 8 9 
1 1 4 18 105 636 4710 38508 352902 
— 1 2 4 14 72 316 1730 9728 
2 1 11 14 81 242 
1 1 8 


5. Are there sequences of arbitrary length? Are there any cycles other than 
1-1-1-—->1... and (12 > 12> 12> 12...? 


MODULAR MOUSETRAP. We can play Mousetrap, but instead of counting 
n,n+1,..., we can start again,...,n,1,2,.... Now at least as many cards get 
set aside. In fact if m is prime, then either the initial deck is a derangement, or all 
cards get set aside, so every sequence cycles or terminates in a derangement. The 
identity permutation 123... will always form a 1-cycle and now there are also 
examples of nontrivial cycles. 

For n = 2,12 ~ 12 - 12 — ...cycles and 21 terminates. 

For n = 3, 132 —- 123 —- 123 — ...cycle, while 321 — 213 — 312 and 231 
terminate. 

If n is composite, the number of cards set aside may be strictly between 0 and 
n. As before, exactly m — 1 cards cannot be set aside; and it’s easy to see that 
neither can just one card. For example, with n = 4 there are 9 derangements with 
the permutations 2431 and 4132 at distance 1 from two of them; 7 permutations 
which set aside just 2 cards and 4213 at distance 1 from one of them; and the 
1-cycle 1234 with two permutations at distance 1 from it and 1243 and 1432 at 
distance 2. 

For n = 5 there is a 2-cycle, 21345 and 32145; and there is also a path of length 
4: 

54321 — 34215 — 52143 — 21435 — 51423. 


6. Are there k-cycles for every k? What is the least value of ” which yields a 
k-cycle? 


We are grateful to Sherwood Washburn for bringing Mousetrap to our attention and for supplying 
copies of the early literature. 
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December and June 


cold 
winds howl 
geese go south 
nights long June waits 
tempcratures fall low 
ponds freeze snowmen grow 
toboggans slide down hillsides 
sun hides ice coats June waits 
wood-fires flame snowballs fly 
winds howl eroundhogs hibernate 


sun glows raspberries ripen 
catbird sings iris blooms 
days bright streams play June dreams 
holiday picnics catch flies 
wheat thrives crickets chirp 
tomato plants climb 
streams dance June plays 
catbird sings 
sun glows 
warm 
From /ntersections: Poems by JoAnne Growney, 
Kadet Press, Bloomsburg, PA, 1993, p. 50. 


The numbers of syllables in the phrases of this poem 
follow the patterns of factorization of the integers from \ 
to 10, then 10 to 1, into prime factors. 
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Answer to Picture Puzzle 
(p. 1006) 


Richard Courant, Peter Lax, and Michael Atiyah relaxing in Honolulu 
in 1969. 


1012 THE AUTHORS [December 


PROBLEMS AND SOLUTIONS 


Edited by: 
Richard T. Bumby, Fred Kochman and Douglas B. West 


Proposed problems should be sent to the MONTHLY PROBLEMS address given on the 
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PROBLEMS address given on the inside front cover. If possible, solutions should be 
typed with double spacing. Two copies suffice. Several solutions may be mailed 
together, but they should be on separate shects of paper. The problem number and the 


solver’s name and mailing address should appear on each solution. A mailing label 
should be included tf an acknowledgment is desired. 


The published solution ts likely to be based on a solution that is complete and correct. 
Additional information, such as references to other appearances of the problem or its 
solution, is also welcome. 


An asterisk (*) after the number of a problem, or part of a problem, indicates that no 
solution is currently available. 


PROBLEMS 


10417. Proposed by Charles Vanden Eynden, Illinois State University, Normal, IL. 
Characterize the positive integers m such that 


m" =1 (mod n) = m=1 (mod n). 


10418. Proposed by Razvan Satnoianu, A. S. E., Bucharest, Romania. 


Given the acute triangle A BC, lethg, hp, and h, denote the altitudes and s the semiperime- 
ter. Show that 


/3 max {ha, hy, Rc} > S. 
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10419. Proposed by Bill Correll, Jr. (student), Denison University, Granville, OH. 


Let k be an integer greater than or equal to 3. Let S(k) be the set of nonnegative real 
numbers x for which 


er} i ere) 


(a) Determine the largest integer in S(k). 


(b) Show that S(k) is the union of a finite number of intervals with the sum of the lengths 
of those intervals equal to (k? — 3k + 6)/2. 


10420. Proposed by C. R. Selvaraj and S. Selvaraj, Penn State University — Shenango, 
Sharon, PA. 


Let 


gi(n) = \- k=i+1 ( (n + 2)* — 2e(n + 1)* + e*n*). 
k=i 


Prove that, for all i > 1, g;(n) is a polynomial inn of degree 1 — 2 and g;(n) > 0 for all 
néNandi eéN. 


10421. Proposed by Gigel Militaru, University of Bucharest, Bucharest, Romania. 


Let n be an integer, n > 3, and let z1,..., Z, and?t,,..., 4, be complex numbers. Prove 
that there exists an integer i, 1 <i <n with 


n 
4 \ziti| < > zit; + Zjti|- 
j=l 


10422. Proposed by Adam Fieldsteel, Wesleyan University, Middletown, CT. 


Let f : [0,1] — RbeaC' strictly increasing function with f(1) = L, where L is the 
length of the graph of /. 


(a) Show that f) f(x)dx > 1/4. 
(b) Show that I f(x) dx = 7/4 only if the graph of f is a quarter circle. 


10423. Proposed by M. Filaseta & C. Nicol, University of South Carolina, Columbia, SC. 


For a positive integer n, let 
Pr(x) = {xi /1 <j <n,ged(j,n) = 1}. 


For example: P;(x) = P2(x) = 1; P3(x) = x+1; Pa(x) = x741; Ps(x) = x34+x24+x41; 
and Pg(x) = x* +1. Prove that P,(x) is reducible over the rationals for every n > 7. 
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SOLUTIONS 


Asymptotic Formulas from Chebyshev Polynomials 


6332 [1981, 150]. Proposed by Petter E. Bjgrstad, Stanford University and Henry E. Fettis, 
Mountain View, CA. 


In analyzing an accelerated numerical integration method for the biharmonic equation, 
the finite sum 


~ 


N-1 5 
Sy = | sin? (4) / [1+a? - 2a cos (4F) | 
k=1 


arises (0 < a* < 1). Find a closed algebraic expression for Sy that exhibits its asymptotic 
behavior in terms of a and N. 


Solution by Heinz-Jiirgen Seiffert, Berlin, Germany. We shall prove that 
N 14 @2N-2 qeN-2 
= saa | oa ~2NG —aK 
Consider the Chebyshev polynomials of the first and the second kind defined by 
To(x) =1, M(x) =x, Troi %) = 2xTh(x) — Th-1 (x), forn > 1, 
Up(x) =1, Uy(x) =2x, Unix) = 2xU) (x) —U,-1(x) forn > 1, 


SN 


respectively. It is well-known (I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series 
and Products, prepared by A. Jeffrey, Academic Press, 1980, p. 1032) that 


T,(x) = AI (x 4 J x2 — 1) +4 é — Jx2 1)']. (1) 


; (3 n Jaa)" 7 (> 7 yea) 
n(x) = rr oa (2) 
and . 
U, (cos @) = onesie) (3) 


In particular, (3) shows that Uj —1 has the N —1 distinct roots cos(km/N ),k =1,...,N—1. 
Since Uy _1 has leading coefficient 2” —! we have 


N-1 


I] ( — cos (4) ) = 2) NUy_j(x). (4) 


k=1 
Differentiating (4) logarithmically gives 
hl 1 — Uy_1@) 
“= x —cos(*) Unix) 


Differentiating the latter equation we get 


= (5) 


(x ~ cos ( ))’ Un—1(x)? 


eo (Up_s00)F — Un-tUh 100) 
k=1 W 
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Replacing x by 1/x in (4) we find 
N-1 


[] (1-008 (4) ) =($)" "Uw-a(4). 


k=1 


The same procedure as above produces 


(6) 


Now, in (6) we replace x by 1/x, divide the resulting equation by x*, and subtract this result 
from (5). Since 1 — cos? (kx/N) = sin* (kx/N ), we obtain 


N-1 27 kx 
sys) = So a) 8 AW) iy gy, 7) 


kat (x—cos(#t) J Uv" 
where 
An—1(x) = (x* — 1)Uy_1(x)UR_, (x) + 2xU n(x) Uy _1 (*) 
~ (x? —1)(UN_,(x))’. 


It is well-known (F. G. Tricomi, Vorlesungen tiber Orthogonalreihen, Springer-Verlag, 1955, 
p.188) that 


(8) 


(x? —1)UN_ 1 (x) = (N — 1)xUn_4(x) — N Un_2(x), (9) 
and 
(x? — 1)UN,_ 4 (x) = (N? — 1)UNn_1(x) — 3xUN_1 (2). (10) 


Combining (8), (9) and (10) with a tedious calculation yields 
(x* — 1) Aw-1 (x) = (N — 1)? — N — 1)Un-1(x)” 


(11) 
+ N(2N — 1)xUn—1(x)Un—2(x) — N?7Uy-2(x)’. 
Using 
xUn—1(x) = ( Un(x) + Un-2(x) ) / 2 
and 
Un (x)Uy_2(x) = Un-1(x)* — 1, 
(11) becomes 


(x? — Ayia) = ((N - DO? = 1) + $) Uae)? 
~ SUy_a(x)? — MOND. 
Substituting the latter equation in (7) gives 


N Uy-1(x)* — Un-2(x)? —2N +1 
Sy (x) = nr, nr es rc 
2 (x* — 1)Un-1(x) 
or, recalling that 
2(x? — 1)Uy_1(x)* = Ton (x) -1 
and 


Un—1(x)* — Uy-2(x)* = Uoy_2(2), 
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we finally obtain 
Usn—2(x) -2N4+1 
Sv (x) = N—————__.. (12) 
Inn (x) —-1 

A straightforward calculation shows that («) is true for a = 0. Since cos(z —x) = —cosx, 
it is easily Seen that the value of Sy, does not change if a is replaced by —a. Hence, it 
suffices to prove («) for 0 < a < 1. In (12) we take x = (14 a*)/(2a). Then we have 
Sn (x) = 4a*Sy. From (1) and (2) it follows that 


Ton (x) = $ (a +.a°%) and Upy_-2(x) = 7 gz (a TONTD _ G21), 
Now, («) easily follows from (12). 
It should be noted that, except for x € {cos(ka/N):k =1,..., N — 1}, (12) is valid for 


all complex numbers . Taking x = cos ¢ in (12), using (3) and 73, (cos ¢@) = cos(2N@), a 
simple calculation yields 


= sin? ( 
y 5 
k=] (cos — cos (4#) ) 


valid for all @ with cos ¢@ ¢ {cos(ka/N):k =1,..., N —1}. In particular, if N is odd, we 
may take @ = 2/2 to get 


kn 
) — N(N esc*(N@) — cot(N¢) cot d — 1), 


Ya? (42) =N(N-1) (N =3,5,7,...). 


There is no record of any other solution being received. 


Everywhere Unimodal 


10243 [1992, 675]. Proposed by Michel Balazard, Université Bordeaux I, Talence, France. 


Define a sequence of functions f;(t) for t > k recursively by 
fi@t) =1 
t—1 ; 
Sr+i1(t) -/ Sku) > 
Prove that, for every real number ¢ > 1, the sequence (f(t): 1 < k < ft) is unimodal. 


Solution by Robin J. Chapman, University of Exeter, U. K. It is clear that f;(t) > O for all 
t > k. We claim that for each k > 1 there exists % in (kK + 1, 00] such that f(t) > fk4i(t) 
ifk+1<t < mand f(t) < feii(t) ift > t%. To establish this by induction on &k, first 
observe that fo(t) = log(t—1), sot; = 1+e. Then assume that k > 1 and that the inductive 
hypothesis holds for k — 1. Consider the function g, defined by gx(t) = fx(t) — Sk4i (2). 
We have 


k t—1 
gk (t) -/ Pons +| [fk-1(¥) — fru) 


The function g; (t) is increasing fork+1 < t < &%_,+1 and decreasing whent > t_1+1. 
Since it is positive fork +1 < t < t,_; +1, it has at most one zero, ata value % > &%& 1 +1. 
(If gx has no zero, then we put &% = oo and tg = o& for all £ > k.) This proves the claim 
and also that t.41 > t +1 forall k. 

To show unimodality it suffices to show “once a decrease always a decrease.” Consider 
a fixed ¢t. If fx(t) > fkii(t), then t < %, and hence t < %41. Thus fx4i(t) > fxi2(), 
and the result follows. 


1994] PROBLEMS AND SOLUTIONS 1017 


Editorial comment. The proposer’s solution worked with the function r; defined by r;(t) = 
Sk41(2)/fx(t), and showed that r;(t) is strictly increasing in t fort > k + 1. This led to 
feu (t) > fx(t) fer2(t) fort > k +2, hence the sequence f;(t) is logarithmically concave 
forl<k <t. 

The proposer introduced these functions to study the unimodality of the distribution of 
the number of prime divisors of an integer. In fact, let F(x) be the number of positive 
integers n not exceeding x that have exactly k prime divisors and such that each of these 
exceeds x!/!. Then 


fk (t) = lim F(x) / (x/log x). 


For k = 1 andt >.1 this is the prime number theorem. 

He also remarks that if kg(t) is any function such that rz(t) > 1 for k < k(t) and 
re (t) < 1fork > ko(¢), then there is an elementary proof that ko(t) < logt+1. The theory 
outlined here leads to a lower bound on ko(t) that is also of the form logt + O(1) but he 
claims that, “the proof is no longer elementary”. 


Solved also by by the proposer. 
An Odd Square 


10263 [1992, 873]. Proposed by J. G. Mauldon, Amherst College, Amherst, MA. 


Let m and n be odd integers, and suppose that n* — 1 is a multiple of |m? +1-n?|, 
Prove or disprove that this requires that |m? +1—- n?| be the square of an integer. 


Solution by the proposer. The implication holds, and it also holds if m, n are nonzero even 
integers. Furthermore, m* > n”, so that m2 + 1 — n7 itself is a square. 

By hypothesis, we have n* — 1 = t(m* + 1 — n*) for some integer t, which we rewrite 
as m* = (t+ 1)(m* +1—n7). Letk =1t +1; this integer is nonzero. It suffices to show 
that k is a square, since-that will show that the integer m” + 1 — n? is a rational square. 
However, the only integers that are squares of rationals are squares of integers. 

Let S; be the set of integer ordered pairs (x, y) such that (x + y)* = k(1+4xy). Because 
((m+n)/2, (m—n)/2) € Sx, the setisnonempty. Leta = min {|x| : (x, y) € Sx}. Itsuffices 
to show that a = 0. 

The set S, is invariant under negation and under interchange of coordinates. Hence a 
belongs to a pair in S,. The other member of such a pair must solve the quadratic equation 
(x +a)* = k(1+ 4ax). Hence the two roots b;, b2 must both be integers and have absolute 
value at leasta. From the quadratic equation, we have b} +b2 = 4ak—2a and b,b2 = a* —k. 
By direct computation, these yield (a + b;)(a + b2) = (4a2 — 1)k. 

If k < 0, then the equations above yield b;b2 > 0 and b; + b2 < 0. Hence each of 
b;, b2 is negative. Since a < |b;|, this implies a + b; < 0, and hence (4a2 —1)k > 0. With 
k < 0, this requires a = 0, but we have noted that a = 0 implies k is a square. Hence we 
may assume k > 0. 

If k > O anda > 0, the equations yield b; + bz > O and (a+ b,)(a + b2) > 0, so 
each of bj, b2 is positive. By the choice of a, we have a* < bib, but also k > 0 implies 
b, b> = a* —k < a*. The contradiction implies a = 0, and hence k is a square. 


Editorial comment. The other correct solutions reduced the problem to solving a Pell 
equation (i. e., a Diophantine equation of the form u* — Dv* = 1 for some non-square 
positive integer D). In this case, D is the squarefree part of k(k — 1), and the special form 
of D is exploited in the solution. The incorrect solution claimed that the conditions hold 
only when m = -+tn or n = +1, which omits the solution (m,n) = (105, 99). 


Solved also by J. C. Binz (Switzerland), R. J. Chapman (U. K.), I. Kastanas, J. P. Robertson, and the GCHQ Problem 
Solving Group (U. K.). One incorrect solution was received. 
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Polygons with Inscribed Circles 


10303 [1993, 401]. Proposed by David E. Gurarie, Case Western Reserve University, Cleve- 
land, OH. 


Let a},...,@, (n = 3) be positive real numbers. 
(a) Find necessary and sufficient conditions on a;,...,a, for there to exist a convex 
n-gon which admits an inscribed circle and whose sides, in cyclic order, are a1,..., Gy. 


(b) Find the radius of the inscribed circle. 


Solution by Richard Holzsager, The American University, Washington, DC. Suppose that 
Vo, Vi,..-, Vn = Voare the vertices of a convex polygon with an inscribed circle, and with 
side V;_1V; of length aj, 7 = 1,...,n. Let dj; be the distance from V;_; to the point of 
tangency of the inscribed circle on V;_;V;. Then 


aj =dj+dj+ (1) 


(with dni = dy ), and 
d; > 0. (2) 


The existence of such a set of numbers d; is also sufficient for the existence of an inscribed 
circle. In fact, consider the function f(r) = >  arctan(d;/r), which is monotonically 
decreasing for r > O, from nz /2 down to 0. If we choose the unique r with f(r) = z, 
and form the polygon circumscribed about a circle of radius 7, tangent at points around the 
circle at successive angles 2 arctan(d;/r), then the successive sides are the required a;. It 
remains to give criteria for the existence and positivity of the d;. 

If the d; exist, then the alternating sum )-(—1)/~'a; collapses: to 0 if n is even; to 2d 
if n is odd. In the odd case, d,, and similarly all the d;’s, are determined from the a;. So 
the solution, if it exists, is unique in this case. Furthermore, these alternating sums lead to 
expressions for d; that clearly satisfy (1). Positivity of these alternating sums is therefore 
necessary and sufficient for the existence of the circle. 

In the even case, we can choose any dj, and satisfy (1) by setting dj; = a; — d; for 
j = 1ton—1. The successive requirements for positivity can then be stated d; > 0, 
d, < a,, da, > a, — a2, da, < a; — a2 + 43, etc. A necessary and sufficient condition is 
therefore that the maximum of 0, a; — a2, aj — a2 + a3 — a4,... be less than the minimum 
of aj,a,; —a7 +4a3,.... 

In a sense we already have the answer to part b, namely r = f —l (sr), where f(r) = 
)_ arctan(d;/r). To make this a bit more tractable, note that f(r) is an integral multiple 
of x iff 3 [l@ + dji) ) = 0, or oyr”~! — o3r"~3 + o5r"—> ... = 0, where the o; are the 
symmetric functions of the d;. To get f(r) = z,1.e., the smallest positive integral multiple 
of z, the fact that f is decreasing shows that we need the largest root of the equation. 

Also note that there is an obvious relation A = r )| a; /2 between r and the area A of the 
polygon, shown by cutting up into triangles V;_;V;C, where C is the center of the circle. 
For a triangle, this relation, joined with the solution r = ./o3/01, gives Heron’s formula 
for the area in terms of the sides. 


Editorial comment. Several solvers noted that a rhombus of side 2 allows 0 < r < 1. 
The strict inequality 0 < r is a consequence of the strict inequality in (2). This suggests 
investigating the possible degenerate cases introduced by requiring only that the d; be 
nonnegative. Clearly, dj1; = O corresponds to both V;_,V; and V; Vj; lying on the 
tangent to the circle at V;. The angle at V; then degenerates to a straight angle and the 
resulting polygon has fewer than n sides. On the other hand, Ilias Kastnas noted that the 
case of n = 2k can be viewed as a degenerate case of n = 2k +1. The choice of d; amounts 
to locating an additional vertex on Vo Vj. 
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For n = 4, the algebraic expression for r as a function of d; is easily analyzed. However, 
as in the case n = 3, r is proportional to area when the side lengths are given. The study 
of possible values of r is thus a special case of the problem of finding extremes of area for 
convex quadrilaterals with given edge lengths. 


Solved also by R. Barbara (Lebanon), V. BoZin (student, Yugoslavia), R. J. Chapman (U. K.), B. N. Cheng (The 
Philippines), S. M. Gagola Jr., I. Kastanas, J. H. Lindsey II, O. P. Lossers (The Netherlands), H. Morris, A. Nijenhuis, 
R. M. Robinson, R. A. Simon (Chile), GCHQ Problem Solving Group (U. K.), and the proposer. 


REVIVALS 


A Permutation on the Cube 


6670 [1991, 862; 1993, 595]. Proposed by R. H. Jeurissen, Toernooiveld, Nijmegen, The 
Netherlands. 


Let {0, 1}” denote the set of n-bit strings of zeros and ones. If (a1, ...,a,) € {0, 1}”, let 
n(a1,.--,4n) be the string (b1,..., b,) given by by = a; and by = ax + ag_1 (mod 2) 
for 1 < k <n. Since (aj,...,a@,) can be retrieved from (b;,..., b,), it is clear that 7, 


is a permutation of {0,1}”. Determine the cycle structure of the permutation zy, 1.e., the 
lengths of the cycles that occur and the number of cycles of each length. 


Editorial comment. David Singmaster has noted that 2, 1s a famous permutation that 
may not be recognized here because of the emphasis on its cycle structure. If we let B,(k) 
denote the n place binary representation of k forO < k < 2”, then G,(k) = m,( Bn(k)) = 
Bn(k) ® Bn( lk /2] ), where @ denotes the vector space addition in {0, 1}”. The sequence 
G,,(0), G;(1),..., Gp(2” — 1) is known as the Gray code. It gives a Hamiltonian circuit 
on the n-cube. 

Adjacent strings in the Gray code also give the possible moves in the Chinese Rings 
puzzle. To solve the puzzle, one must find k from G,(k). Since k is easily recovered from 
B,, (k), it suffices to have a formula to invert z,,. Observations made in the published solution 
give the formula Bn(k) = Gn(k) ® Gn( Lk/2]) ®... ® Gn(|k/2"~' |). This method of 
analyzing the puzzle 1s attributed to Louis A. Gros who published a pamphlet on the puzzle 
in 1872. 


Products of Nilpotent Matrices 
10200 [1992, 163; 1993, 807]. Proposed by Daniel Goffinet, St. Etienne, France. 


(a) Prove that a (square) matrix over a field F is singular if and only if it is a product of 
nilpotent matrices. 

(b) If F = C, prove that the number of nilpotent factors can be bounded independently 
of the size of the matrix. 


Editorial comment. It has been shown over an arbitrary field that: for M a singular matrix 
that is not a 2 by 2 nonzero nilpotent matrix, M = AB with A and B nilpotent. This 
result was known over the field of complex numbers, but the best result over a general field 
submitted as a solution of this problem led to a product of four factors. Pei Yuan Wu has 
submitted references to proofs of the general result. 

In T. J. Laffey, “Products of matrices” in Generators and Relations in Groups and Ge- 
ometries (A. Barlotti, E. W. Ellers, P. Plaumann & K. Strambach, eds.), Kluwer, Dordrecht, 
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1991, 95-123, the space on which M acts is written as a direct sum of invariant subspaces 
Vo ® V; with the action of M being nilpotent on Vo and nonsingular on V;. Using known 
factorizations of nilpotent matrices, and of nonsingular matrices, a factorization of a matrix 
similar to M is given. Several cases must be considered to give a complete construction. 

In A. R. Sourour, “Nilpotent factorizations of matrices”, Linear Multilinear Algebra 31 
(1992), 303-308, an inductive proof is used. The key is a general proposition on partitioning 
matrices over a field F’. 


Proposition. Let A € M,,(F). Then A is similar to a matrix of the form 


ac 
b D 


witha € F, D € M,_-1(F), rank D = rank A — 1, b € Range D andc € Range D' if and 
only if A? 4 0. 


This proposition, combined with a study of matrices with M* = O achievesa factorization 
into nilpotent matrices each of which has the same rank as M. 


Collaborating editors: David F Appleyard, Paul T. Bateman, Bruce C. Berndt, Duane 
M. Broline, Barry W. Brunson, Frank S. Cater, Gulbank D. Chakerian, Underwood Dudley, 
Gerald A. Edgar, Michael A. Filaseta, Ira M. Gessel, Richard A. Gibbs, Jerrold R. Griggs, 
Douglas A. Hensley, John R. Isbell, Mourad E. H. Ismail, Murray Klamkin, Daniel J. Kleitman, 
Frederick W. Luttmann, Frank B. Miles, Richard Pfiefer, Stephen L. Portnoy, J. O. Shallit, John 
Henry Steelman, Kenneth B. Stolarsky, David E. Tepper, Douglas B. Tyler, Daniel Ullman, and 
William E. Watkins. 


In addition to the Collaborating editors, the following individuals served as Referees or 
Guest Editors for material considered for publication in 1994: George E. Andrews, Richard A. 
Askey, George Baloglou, Gilbert Baumslag, Jozsef Beck, Grahame Bennett, Peter B. Borwein, 
Ezra A. Brown, David G. Cantor, Bille Carlson, Sagun Chanillo, Gregory L. Cherlin, J. Brian 
Conrey, John H. Conway, David A. Cox, Dennis DeTurck, Persi Diaconis, Vladimir Drobot, 
Harold M. Edwards, Noam Elkies, Edgar A. Feldman, Joseph A. Gallian, Fred Galvin, Mur- 
ray Gerstenhaber, Bart Goddard, Gene H. Golub, Richard F. Gundy, Richard K. Guy, Aimo 
Hinkkanen, Roger Horn, James E. Humphreys, Geoffrey A. Kandall, J. H. B. Kemperman, 
Clark Kimberling, Victor Klee, Solomon Leader, James I. Lepowsky, Eugeme M. Luks, Feng 
Luo, Victor S. Miller, Howard Morris, Benjamin Muckenhoupt, Melvyn B. Nathanson, Roger D. 
Nussbaum, Ingram Olkin, M. J. Pelling, Carl Pomerance, Stanley Rabinowitz, Mizanur Rah- 
man, M. Rychlik, Michael Saks, Lawrence A. Shepp, Richard P. Stanley, Kenneth B. Stolarsky, 
Gilbert Strang, Howard M. Taylor, Charles H. Toll, Jerrold B. Tunnell, Charles L Vanden Eyn- 
den, Wolmer V. Vasconcelos, Bertram Walsh, Edward T: H. Wang, William C. Waterhouse, 
Gregory P. Wene, Richard Wheeden, Herbert S. Wilf, Eduard Wirsing, and Doron Zeilberger. 
The editors wish to thank them for their help in preparing this column. 


1994] | PROBLEMS AND SOLUTIONS 1021 


REVIEWS 


Edited by Darrell Haile 
Indiana University, Bloomington IN 47405 


Linear Programs and Related Problems, by Evar D. Nering and Albert W. 


Tucker, Academic Press, San Diego, 1993; 578 + pp 


Reviewed by Stephen B. Maurer 


This undergraduate text is a joint effort by the author of several fine texts (Nering) 
and one of the grand old men of mathematics and a former MAA President 
(Tucker). It should be on the shelf of anyone who teaches linear programming, and 
it invites careful consideration for classroom adoption. It provides an approach 
that deserves to be better known, for it combines theory and practice and makes 
key ideas (notably duality) particularly transparent. Most faculty will learn at least 
as much from the book as students will. 

Linear programming (LP) is the study of optimizing linear functions subject to 
linear inequality constraints. The subject is blessed to have both beautiful theory 
and myriad applications. Consequently, there‘are many LP texts of several vari- 
eties. Most common are books with an Operations Research orientation, such as 
Hillier and Lieberman [6] and Bradley, Hax and Magnanti [2]. These books regard 
LP as a tool for modeling. The theory is usually there, but as a sideshow. They 
emphasize how to recognize and interpret a linear program in a real-world 
situation and how to apply algorithmic implementations that are efficient and 
numerically stable. 

In contrast, books more mathematical in flavor tend to emphasize the theory 
and structure of the subject. They often dwell on the connection to n-dimensional 
polyhedra, and either downplay algorithms or highlight the more theoretical 
algorithmic issues from complexity theory. Books written in this vein are typically 
at the graduate level. A fine recent example is Schrijver [12]. Older and less 
advanced is Gass [5]. 

There are also LP books written by economists, which emphasize the economic 
applications and interpretations. Two classics are Dorfman, Samuelson and Solow 
[4] and Baumol [1]. 

This division of texts recapitulates history. When George Dantzig, at the 
Pentagon after World War II, was first able to secure funding to promote this new 
subject, three groups got started: one under Tucker at Princeton to develop the 
theory, another under Koopmans at Chicago to explore connections with eco- 
nomics, and the third under Dantzig to explore algorithms, specifically, the simplex 
algorithm. For more on the history, see [8, 9, 13]. 

Note the schism, deliberately introduced by the players themselves, between 
theory and practice. It doesn’t have to be that way. LP theory can be developed 
from the algorithms, and you can delight in both from the start. A key strength of 
this book is that it shows how. For instance, suppose an algorithm to compute the 
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optimum of a function f has the properties that 


—it has only.a finite number of states 

—it does not cycle among states 

—if its current state neither provides an optimal value of f nor shows that f is 
unbounded, then the algorithm passes to another state. 


Then it is a mere observation that f must either attain an optimum or be 
unbounded. Yet it is just this observation that one may use with the simplex 
algorithm to show that any feasible linear program either attains an optimum or is 
unbounded. (Feasible means the domain of f is nonempty.) 

As the authors put it (p. 126) 


Many people think of an algorithm as a method for finding a solution to a problem for which one 
already knows, by some other means, that a solution exists. We look on algorithms as much more 
than that and we use them for much more than that. In many cases an algorithm can supply the 
proof that the desired result exists. ... 


In other words, Nering and Tucker are advocates of proof by algorithm, a 
refinement of the old idea of constructive proof. 

Today, proof by algorithm is a well-known concept. Certainly in the field of 
combinatorial optimization it is the preferred method of proof—in this field 
existence proofs are not considered more esthetic. But since this preference is a 
rather drastic change from the mathematical esthetic of mid century, one can ask 
how the change came about. A reasonable assumption is that it came from 
Computer Science. However, it may be that Tucker had a lot to do with it. He 
started turning to the algorithmic viewpoint in the mid 1950s (see the interview 
with him [10]), and his group at Princeton included or interacted with almost all 
the early workers in combinatorial optimization. There is a fine math history 
research project waiting here—how did this new esthetic come about? 

A second great strength of this book is its treatment of duality. Linear programs 
come in pairs. To take the canonical case, if the “primal”? problem is to maximize 
cx subject to Ax < b and x > 0, then the dual is to minimize vb subject to vA >c 
and uv > 0. (Lowercase letters are vectors, uppercase matrices.) Dual problems 
interact. For instance, if both problems are feasible, then it turns out both attain 
optima and the optimum values are equal. Duality has many important conse- 
quences, e.g., a lazy supervisor test for checking a claimed optimum, alternative 
algorithms, shadow prices in economics. 

All LP texts cover duality, but usually as a somewhat mysterious add-on. This 
too mimics history, but there is a better way. Dual programs can be introduced 
simultaneously by the use of a special representation, the condensed or Tucker 
tableau: 


(1) 
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The variables on the top multiply the columns to obtain the variables on the right, 
i.e., Ax + (—1)b = —y, or equivalently, Ax + y = b, or Ax < b if all variables are 
nonnegative. These row equations are the constraints, and the basement row 
equation, cx — d = f, is the equation for the “objective” function f to be maxi- 
mized. This is the primal problem. The variables on the left and bottom give the 
dual: minimize vb — d = g subject to vA — c = u, or equivalently, vA > c. 

In brief, to develop the theory from (1), one shows by elementary linear algebra 
that the simplex algorithm just exchanges variables on the top and right, so that 
different variables become “basic” when the variables on the top are set to 0. Of 
course, the entries in the tableau must be updated too, so that the equal signs on 
the right are still correct after variables are exchanged. Marvelously, the same 
algebra maintains. the equal signs on the bottom as the dual variables are 
exchanged. The conditions that make the basic solution of a tableau optimal are 
simple to understand and describe—the entries in b must be nonnegative and 
those in c nonpositive. By (skew) symmetry these same conditions describe 
optimality for the dual. Thus a condensed tableau can exhibit optimality for both 
problems simultaneously and, since x and v are 0 for a basic solution, the objective 
functions f and g have the same value, d. 

Almost all LP books use tableaus, but most use some version of extended form, 
where A = [a,,] in (1) gets replaced by [A J] and the “slack” variables y in (1) 
move to the columns of J. But it is much harder to even see a dual, let alone 
develop its theory, in these formats. A few books do use condensed tableaus, but 
without the variables on the left and bottom, which is the key to handling duality. 
To my knowledge, the only other texts with true condensed tableaus are Kemeny 
Snell and Thompson’s “Finite Mathematics”, third edition only [7], and Rothen- 
berg [11]. 

Part I of Nering and Tucker develops linear programming itself, along the lines 
described above. Part II treats various related problems—related in that most can 
be stated as LPs but their special form allows for special algorithms. The emphasis 
on proof by algorithm continues, and duality is often used to clarify the algorithms 
(but not as‘ often as it could be). For instance, the minimax theorem of matrix 
games is reduced to the existence-duality theorem of LP. Kuhn’s Hungarian 
method for the minimum weight assignment problem is shown to work by increas- 
ing the sum of the dual variables until the primal and dual objective functions 
are equal. The same approach is used for analyzing Dantzig’s algorithm for 
the transportation problem. Other topics include various network problems— 
transshipment, maximum flow, shortest path—and an introduction to nonlinear 
programming—the Karush-Kuhn-Tucker Theorem (still known as the Kuhn-Tucker 
theorem to many) and various quadratic programs. 

Tucker once said that his goal in mathematics has always been to “unify and 
simplify”. The fruits of this attitude appear in this book. It’s a shame that his 
condensed tableau format is not better known. 

These days most talk in LP circles is about completely different approaches: 
Khachian’s 1979 ellipsoid algorithm, Karmarkar’s 1984 interior point method, and 
more recent variations. Khachian’s algorithm was important because it is polyno- 
mial: there is a polynomial P so that his algorithm will solve every LP problem in 
P(n) steps, where n is the amount of input data. However, in practice the Ellipsoid 
algorithm is much slower than simplex. (The simplex algorithm takes linear time 
on average but special cases take exponential time.) Karmarkar’s algorithm is not 
only polynomial, but fast in practice on at least some sorts of problems. The jury is 
still out on what is the best commercial method. 
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Nering and Tucker are thin, and mostly nontechnical, on the new methods. 
For Khachian’s algorithm, no algebraic description or analysis is given, nor are 
there numerical examples or homework problems. For Karmarkar’s algorithm, 
some algebraic detail is given, one iteration of one problem is shown, and there 
is one homework problem. (This material appears in the final chapter of Part I, 
which also introduces numerical issues and provides ties to other simplex ap- 
proaches.) 

This thinness is understandable. There are no small, complete examples for the 
new methods, and their theory is much more complicated than for the simplex 
method. Also, no one has shown a nice way to develop the existence and duality 
theory of LP from these algorithms. Since building the theory from algorithms is a 
key theme of this book, the authors rightly emphasize simplex. 

Nonetheless, since the new methods are at least part of the future, it would be 
good for students to get more than a passing nod at them even in a first course. 
Software seems called for. A program with an option to hide the numbers and 
show pictures might be best. To my knowledge, no text has appeared that takes 
this approach. 

How will students like this book? The authors have made LP theory simple (but 
still rigorous!) by reducing it to displays and arguments that are carried forward by 
linear equations and numerical calculations. The writing is clear, straightforward 
and leisurely, but also somewhat bland. The book is written more in essay style 
than in textbook chunk style—example, theorem, proof, sidebar, vignette, etc. 
Much as this chunk style has been criticized, students are used to it and it does 
allow them clear touchstones and stopping points in what is usually difficult 
material for them. In Nering and Tucker, important points often appear in the 
middle of paragraphs in pages full of text. 

Based on my experience teaching from a preliminary version some years ago, 
and more recent reports, I would say that students won’t find the book hard to 
understand—the usual math book complaint—but they won’t find it exciting 
either. It can’t be very neat mathematics if it is just about linear equations and a 
lot of number manipulation, can it? Of course it can, but the teacher’s biggest job 
in using this text will be to explain to students why. Alas, if you make something 
sufficiently simple for readers who don’t know that it used to be complicated, they 
won’t appreciate what they have! 

A very nice set of real-world-like examples and problems are introduced in the 
‘first chapter: problems too simplified to be really applied, but they show the way. 
In later chapters, the problems (though not the examples) are mostly purely 
mathematical. Problems appear at the ends of chapters only. A rather complete set 
of solutions is supplied in the back, an unusual feature. Also, menu-driven 
software is provided with the book—for PCs, but'a Macintosh version is due by the 
end of 1993. With this software one can do the arithmetic for all the problems, in 
the format used by the book, either all at once, or step by step. The data for all the 
problems is already read in. I found this software serviceable but not effortless. 
The instructions are terse. 

Readers intrigued by this book might also look at Chvatal [3]. It is like-spirited 
in doing theory, algorithmics and applications simultaneously, for a similar audi- 
ence. It has a somewhat livelier format and style, and a large variety of problems. 
It covers many more topics and is more advanced (however, much advanced 
material is identified with small print and can be skipped). On the other hand, 
Chvatal does not have Tucker tableaus (or any tableaus), and duality, though done 
early, is not present from the start. 
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To conclude, a founding father of mathematical programming, involved in it for 
45 years, has put the full maturity of his perspective into this book. His perspective 
is quite personal, and the excitement and beauty seen by both authors may not 
come across fully to you or your students. But if you teach LP you impoverish 
yourself by not taking a look. 


Note: Prof. Maurer was a Ph.D. student of Tucker, and learned LP from him. 
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The real danger is not that computers will 
begin to think like men, but that men will 
begin to think like computers. 


— Sydney J. Harris 
Howard W. Eves, Return to Mathematical Circles, 
Boston: Prindle, Weber and Schmidt, 1988. 


1026 REVIEWS [December 


TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee and Paul Zorn 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
and computer software appropriate to mathematics teaching and research. Special 
codes classify reviews by subject area and appropriate use: 


T : Textbook 
C : Computer Software 
S : Supplementary Reading 


P : Professional Reading 1-4: Semester 

L: Undergraduate Library ** : Special Emphasis 

13: Grade Level ?? : Questionable 
Readers are advised that price information is subject to change. Selected books and 


software packages receive a second, more extensive review in the Monthly. 


Books and software submitted for review should be sent to Book Reviews Editor, 
American Mathematical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, 


MN 55057-1098. 


General, L. Intersections. JoAnne Growney. 
Kadet Pr (Bloomsburg, PA 17815-1758), 1993, 
56 pp, $6.95 (P). [ISBN 0-9637964-0-2] A 
slim book of spare, accessible, sometimes el- 
egant poems. About 1/4 are on mathematical 
subjects, the rest on other aspects of an interest- 
ing, varied life. PZ 


General, L. Helaman Ferguson: Mathematics 
in Stone and Bronze. Claire Ferguson. Merid- 
ian Creative Group (5178 Station Rd, Erie, PA 
16510), 1994, xv + 79 pp, $39.95. [ISBN 0- 
9639121-0-0] Gorgeous photographs, artistic 
commentaries (by sculptor’s wife), and math- 
ematical discussions (by sculptor) of 29 strik- 
ing works by mathematician/sculptor Helaman 
Ferguson. Though inspired mathematically, 
sculptures go beyond illustration, incorporating 
physical and natural forms and materials. PZ 


Mathematics Appreciation, S(13). Fractal 
Worlds. Susan Brendel. J Weston Walch, 1994, 
iii + 4 pp, $14.95 poster set (w/black-line mas- 
ters). [ISBN 0-8251-2464-6] Four 17” x 22” 
color posters on classical fractals, the Mandel- 
brot set, Julia sets, and fractal universes. Only 
the classical fractals poster (Cantor dust, Sier- 
pinski carpet and gasket, Koch island) gives 
enough information for understanding. MW 


Precalculus, T*(13: 1, 2). Precalculus: 
Functions and Graphs: A Graphing Approach. 
Roland E. Larson, Robert P. Hostetler, Bruce 
H. Edwards. DC Heath, 1994, xxxii + 973 pp. 
[ISBN 0-669-35206-3] Visualization empha- 
sis: many graphs and graphical problems; notes 
on technology use. Includes discussion and 
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writing problems, sections on exploring data 
and applications. TH 


Education, S(16—18), P*. Learning Activi- 
ties from the History of Mathematics. Frank 
J. Swetz. J Weston Walch, 1994, viii + 269 pp, 
$23.95 (P). [ISBN 0-8251-2264-3] Excellent 
resource for secondary school teachers. Bio- 
graphical sketches (with library investigation 
questions) of 23 mathematicians from Thales 
to Bourbaki. Student activity sheets explore 
historical problems. Wealth of quotations and 
extensive resource list. MW 


Education, S(15-17). Teaching with Student 
Math Notes, Volume 2. Ed: Evan M. Maletsky. 
NCTM, 1993, ix + 123 pp, $14.50 (P). [ISBN 
0-87353-369-0] 20 issues of Student Math 
Notes from 1986-1990 NCTM News Bulletin. 
One theme per 4-page issue. Teacher notes in- 
clude solutions, extensions. Perforated pages 
permit classroom duplication. Classroom- 
tested ideas for secondary teachers. MW 


History, S, L. A History of Vector Analysis: 
The Evolution of the Idea of a Vectorial System. 
Michael J. Crowe. Dover, 1994, xvii + 270 pp, 
$7.95 (P). [ISBN 0-486-67910-1] Corrected 
reprint of 1985 edition (TR, February 1987). LC 


History, P. NBS-INA—The Institute for Nu- 
merical Analysis—UCLA 1947-1954: Mathe- 
maticians Learning To Use Computers. Mag- 
nus R. Hestenes, John Todd. NIST Special Pub., 
No. 730. US Government Printing Office (Supt. 
of Documents, Washington, DC 20402), 1991, 
ix + 471 pp, $12.50 (P). The INA was set up to 
ensure that some mathematicians could handle 
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the Turing computer. This history stresses re- 
search and educational aspects of the INA. DH 


History, S(16—18), P, L. A History of Complex 
Dynamics: From Schréder to Fatou and Julia. 
Daniel S. Alexander. Aspects of Math., V. E24. 
Friedr Vieweg & Sohn, 1994, viii + 165 pp, 
$42. [ISBN 3-528-06520-6] History of iter- 
ation of complex maps from Schréder’s 1870 
paper (examining Newton’s method) through 
works of Julia, Fatou, and Montel. Good read- 
ing for graduate students and mathematicians— 
specialists and nonspecialists alike. KS 


History, S(13-16), P, L. George Green: Math- 
ematician and Physicist 1793-1841: The Back- 
ground to his Life and Work. D.M. Cannell. 
Athlone Pr, 1993, xxvi + 265 pp, $70. [ISBN 0- 
485-11433-X] A complete picture of Green’s 
life and education. His contributions to math- 
ematical physics are huge, but relatively little 
known. A miller, untrained in mathematics un- 
til age 40, Green remained a scientific outsider. 
Author has devoted decades to restoring Green’s 
mill and to studying his life and work. KS 


History, P*, L. Mobius and his Band: Math- 
ematics and Astronomy in Nineteenth-century 
Germany. Eds: John Fauvel, Raymond Flood, 
Robin Wilson. Oxford Univ Pr, 1993, 172 pp, 
. $29.95. [ISBN 0-19-853969-X] In 1810 Ger- 
many was a scientific backwater; by 1860 it 
led the world. In this period M6bius rose from 
undergraduate to full professor and observatory 
director at Leipzig. The first five essays de- 
scribe the flowering of German science, treat- 
ing Mobius’ life and work as archetype. The 
final essay surveys highlights of modern celes- 
tial mechanics/chaotic dynamics. SK 


Logic, P, L. Metaphysical Myths, Mathemat- 
ical Practice: The Ontology and Epistemology 
of the Exact Sciences. Jody Azzouni. Cam- 
bridge Univ Pr, 1994, ix + 249 pp, $54.95. 
[ISBN 0-521-44223-0] Argues against math- 
ematics imitating practices of the empirical sci- 
ences. Backbone of mathematical practice is 
seen as algorithmic development, governed by 
axioms. Mathematics’ primary tool—proof— 
distinguishes it from other sciences. RM 


Discrete Mathematics, T*(13), C, L. Dis- 
crete Dynamical Modeling. James T. Sandefur. 
Oxford Univ Pr, 1993, xiii + 428 pp, $39.95. 
[ISBN 0-19-508438-1] Elementary introduc- 
tion, suitable for general audiences. Topics 
include first- and higher-order systems, prob- 
abilistic models, and systems. Excellent exam- 
ples and problems from many areas. MPR 


Discrete Mathematics, P. Selected Topics 
in Discrete Mathematics. Ed: A.K. Kelmans. 
Transl., Ser. 2, V. 158. AMS, 1994, xii + 
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221 pp, $79. [ISBN 0-8218-7509-4] 19 pa- 
pers, on disparate subjects, from the Moscow 
Discrete Mathematics Seminar (1972-1990). 


Number Theory, S(18), P. Catalan’s Conjec- 
ture: Are 8 and 9 the Only Consecutive Powers ? 
Paulo Ribenboim. Academic Pr, 1994, xv + 
364 pp, $64.95. [ISBN 0-12-587170-8] Au- 
thor recaps results on a variety of Diophantine 
problems. Elementary, algebraic, and analytic 
approaches are described. SG 


Number Theory, P. Motives. Eds: Uwe 
Jannsen, Steven Kleiman, Jean-Pierre Serre. 
Proc. of Symp. in Pure Math., V. 55, Parts 1 & 
2. AMS, 1994, $250 set [ISBN 0-8218-1635- 
7)|; Part 1, xiv + 747 pp, $140; Part 2, xiv + 
676 pp, $129. 47 papers (introductions, spe- 
cialized surveys, and research reports) from a 
1991 AMS-IMS-SIAM Summer Research Con- 
ference at the University of Washington. 


Number Theory, P. Décomposition Spec- 
trale et Séries d’Eisenstein: Une Paraphrase de 
l’Ecriture. Colette Moeglin, Jean-Loup Wald- 
spurger. Prog. in Math., V. 113. Birkhauser, 
1994, xxix + 320 pp, $98. [ISBN 0-8176-2938- 
6] 

Group Theory, T(17-18: 1, 2), P**. Compu- 
tation with Finitely Presented Groups. Charles 
C. Sims. Ency. of Math. & Its Applic., V. 48. 
Cambridge Univ Pr, 1994, xiii + 604 pp, $99.95. 
[ISBN 0-521-43213-8] Clear, extensive, thor- 
ough, accessible. Treats rewriting systems; 
automata and rational languages; coset enu- 
meration; the Reidemeister-Schreier procedure; 
computations with Abelian groups, polycyclic 
groups, quotient groups. DP 


Group Theory, P. Representations of Solvable 
Groups. Olaf Manz, Thomas R. Wolf. London 
Math. Soc. Lect. Note Ser., V. 185. Cambridge 
Univ Pr, 1993, xi + 302 pp, $39.95 (P). [ISBN 
0-521-39739-1] 

Algebra, P. Commutative Algebra: Syzygies, 
Multiplicities, and Birational Algebra. Eds: 
William J. Heinzer, Craig L. Huneke, Judith 
D. Sally. Contemp. Math., V. 159. AMS, 1994, 
vii + 444 pp, $61 (P). [ISBN 0-8218-5188-8] 
Proceedings of a 1992 AMS-IMS-SIAM Sum- 
mer Research Conference. 

Algebra, T(15-16: 1), C, L. Learning Ab- 
stract Algebra with ISETL. Ed Dubinsky, Uri 
Leron. Springer-Verlag, 1994, xix + 252 pp, 
$49, with disk. [ISBN 0-387-94152-5] Con- 
sistent with the authors’ view “that people learn 
best by doing and thinking about what they do,” 
text treats basic algebraic structures, interwo- 
ven tightly with the use of ISETL (a mathemat- 
ical programming language) to spur discovery 
and understanding. After introducing ISETL, 
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works through groups, subgroups, and the fun- 
damental homomorphism theorem; then repeats 
process for rings. Concludes with factorization 
in integral domains, construction of splitting 
fields. Less formal depth than in some other 
currently popular texts. Stresses computer ac- 
tivities and exercises. JS 


Algebra, P. Discriminants, Resultants, and 
Multidimensional Determinants. 1.M. Gelfand, 
M.M. Kapranov, A.V. Zelevinsky. Math.: The- 
ory & Applic. Birkhauser, 1994, x + 523 pp, 
$74.50. [ISBN 0:8176-3660-9] Studies dis- 
criminants and resultants of polynomials in sev- 
eral variables. Main approaches are geomet- 
ric (projective duality and associated hypersur- 
faces), algebraic (homological algebras and de- 
terminants of complexes), combinatorial (New- 
ton polytopes and triangulations). LC 


Calculus, T(13: 1-3). Calculus, Sixth Edi- 
tion. Earl W. Swokowski, et al. PWS, 1994, 
xxiv + 1384 pp. [ISBN 0-534-93624-5] New 
in this edition: more use of graphing technol- 
ogy; “Extended Problems and Group Projects” 
end chapters; biographical/historical sketches. 
(Fifth Edition, TR, May 1991.) AO 


Calculus, T*(14: 1). Basic Multivariable Cal- 
culus. Jerrold E. Marsden, Anthony J. Tromba, 
Alan Weinstein. Springer-Verlag & WH Free- 
man, 1993, xv + 533 pp, $39.95. [ISBN 3- 
540-97976-X; 0-7167-2443-X] Multivariable 
and vector calculus through theorems of Green, 
Gauss, and Stokes. Emphasizes intuitive under- 
standing, computational] skill. Careful exposi- 
tion; many physical examples. AO 


Real Analysis, T(17-18: 1), S, P, L. A Concise 
Introduction to the Theory of Integration, Sec- 
ond Edition. Daniel W. Stroock. Birkhauser, 
1994, viii + 184 pp, $24.50. [ISBN 0-8176- 
3759-1] Treats both modern integration the- 
ory and advanced calculus. To blend the two, 
much space is devoted to IR”-theory. New to 
this edition: two new sections on aspects of 
Lebesgue theory; minor reordering of material; 
solutions to some exercises. KS 


Real Analysis, T(16-18), P, L. The Rie- 
mann Approach to Integration: Local Geomet- 
ric Theory. Washek F. Pfeffer. Tracts in 
Math., V. 109. Cambridge Univ Pr, 1993, xv 
+ 302 pp, $49.95. [ISBN 0-521-44035-1] A 
detailed account of the McShane and Henstock- 
Kurzweil integrals (both generalized Riemann- 
Stieltjes integrals) and their relation. Classi- 
cal results and recent developments: lipeomor- 
phic change of variables, higher-dimensional 
multipliers, divergence theorem for discontin- 
uously differentiable vector fields, etc. One- 
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dimensional theory treated first. Worthwhile 
exercises are integrated into exposition. KS 


Partial Differential Equations, S, P. PLTMG: 
A Software Package for Solving Elliptic Partial 
Differential Equations: Users’ Guide 7.0. Ran- 
dolph E. Bank. Front. in Appl. Math., V. 15. 
SIAM, 1994, xii + 128 pp, $24.50 (P). [ISBN 0- 
89871-330-7] Reference manual for PLTMG, 
a piecewise-linear finite element multigrid 
package, freely available from Netlib. AO 


Partial Differential Equations, P. Nonlinear 
Nonlocal Equations in the Theory of Waves. P.I. 
Naumkin, I.A. Shishmarev. Transl. of Math. 
Mono., V. 133. AMS, 1994, ix + 289 pp, $149. 
[ISBN 0-8218-4573-X] 


Partial Differential Equations, T(18: 1), S, 
P. Microlocal Analysis for Differential Opera- 
tors: An Introduction. Alain Grigis, Johannes 
Sj6strand. London Math. Soc. Lect. Note Ser., 
V. 196. Cambridge Univ Pr, 1994, 151 pp, 
$29.95 (P). [ISBN 0-521-44986-3] Assumes 
basics of distribution theory, functional analy- 
sis, differential geometry. Topics include pseu- 
dodifferential operators, local symplectic ge- 
ometry, global theory of Fourier integral oper- 
ators, spectral theory, Cauchy problems, wave- 
front sets, propagation of singularities. Concise 
treatment; chapter exercises. HD 


Partial Differential Equations, T(17-18: 2). 
Partial Differential Equations in Classical 
Mathematical Physics. Isaak Rubinstein, Lev 
Rubinstein. Cambridge Univ Pr, 1993, xiv + 
676 pp, $94.95. [ISBN 0-521-42058-4] The- 
ory of PDE’s, viewed as language for describ- 
ing continuous phenomena in physics. Basic 
natural science concepts are a thorough expo- 
sition of standard scientific PDE problems. A 
rigorous, systematic treatment of mathematics 
applied in classical physics. DS 

Dynamical Systems, T(15—17: 2, 3), S, P, L. 
Chaos and Nonlinear Dynamics: An Introduc- 
tion for Scientists and Engineers. Robert C. 
Hilborn. Oxford Univ Pr, 1994, xvii + 654 pp, 
$55. [ISBN 0-19-505760-0] An introduction 
for scientists and engineers, by a physicist. Sur- 
veys and explains what nonlinear dynamics is 
about and what it does; not fully rigorous. As- 
sumes introductory physics, calculus through 
differential equations. Analytic and computer 
exercises end each chapter. KS 


Dynamical Systems, T(17: 1, 2), P, L. Chaos 
in Dynamical Systems. Edward Ott. Cam- 
bridge Univ Pr, 1993, xii + 385 pp, $69.95. 
[ISBN 0-521-43215-4] For science and engi- 
neering students. Besides standard material, 
treats quasiperiodicity, Hamiltonian systems, 
multifractals, quantum chaos. Retains math- 
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ematical flavor despite relegating mathematical 
details to appendices. SK 


Numerical Analysis, T(16—17: 1), L*. Nu- 
merical Solution of Ordinary Differential Equa- 
tions. Lawrence F. Shampine. Chapman & 
Hall, 1994, x + 484 pp, $64.95. [ISBN 0-412- 
05151-6] Theory and practice of numerical 
solution of initial value problems for ODE sys- 
tems. Practical focus. Assumes background in 
differential equations, numerical analysis. AO 


Analysis, S(18), P. The Banach-Tarski Para- 
dox. Stan Wagon. Cambridge Univ Pr, 1993, 
xviii + 253 pp, $24.95 (P). [ISBN 0-521-45704- 
1] The paradoxes at issue range from surpris- 
ing (perfect squares and positive integers are 
equally numerous) to astonishing (any bounded 
set in space can be dissected and rearranged 
to form any other bounded set). Book read- 
ably explores such paradoxes and links to other 
areas, especially measure theory. Preface and 
brief addendum mention recent work, e.g., of 
Laczkovich on circle-squaring by translation. 
(1985 hardcover text, TR, March 1986.) PZ 


Analysis, T(17-18: 1, 2), L. Difference Equa- 
tions: Theory and Applications, Second Edition. 
Ronald E. Mickens. Van Nostrand Reinhold, 
1990, xi + 448 pp, $54.95. [ISBN 0-442-00136- 
.3] New edition adds exercises and a new chap- 
ter on applications. (First Edition, TR, February 
1989.) AO 


Algebraic Geometry, S(18), P, L. Theory of 
Algebraic Invariants. David Hilbert. Cam- 
bridge Univ Pr, 1993, xiv + 191 pp, $39.95; 
$19.95 (P). . [ISBN 0-521-44457-8; 0-521- 
44903-0] Translated and edited notes from a 
course of 51 lectures given by Hilbert in 1897, 
focused largely on his own fundamental work 
in invariant theory. A skillful, readable, coher- 
ent presentation, with a contemporary flavor. 
Main results include Hilbert’s Basis Theorem, 
a Nullstellensatz, and the Syzygy Theorem. JS 


Differential Geometry, P. Harmonic Maps 
and Integrable Systems. Eds: Allan P. Fordy, 


John C. Wood. Aspects of Math., V. E23. Friedr’ 


Vieweg & Sohn, 1994, 329 pp, $64. [ISBN 3- 
528-06554-0] 12 expository articles explain 
applications of integrable systems to finding 
harmonic maps and related problems. 


Geometry, T**(16-17: 1, 2), L*. Visual 
Geometry and Topology. Anatolij Fomenko. 
Springer-Verlag, 1994, xvi + 324 pp, $89. 
[ISBN 0-387-53361-3] A beautiful textbook. 
Aims to “narrate, in an accessible and fairly 
visual language, some classical and modern 
achievements of geometry in both intrinsic 
mathematical problems and applications.” Con- 
tains 287 illustrative drawings, plus 50 plates of 
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the author’s original artwork. Topics include 
polyhedra, simplicial complexes, homologies, 
low-dimensional manifolds, symplectic topol- 
ogy, Hamiltonian mechanics, minimal surfaces, 
fractal geometry, and hyperbolic geometry. DP 


Geometry, T*(14: 1), P, L. College Geometry: 
A Discovery Approach. David C. Kay. Harper- 
Collins, 1994, xvi + 494 pp, $38. [ISBN 0-06- 
500006-4] Guided self-discovery of several 
Euclidean gems precedes axiomatic treatment 
of foundations and transformation approach. 
Concludes with hyperbolic geometry. Includes 
a wealth of figures, historical notes, graded ex- 
ercises (including research projects); useful an- 
cillary materials are available. An appealing 
text for mathematics majors and for prospective 
secondary teachers. JNC 


Algebraic Topology, T(15-16: 1), L*. A 
Geometric Introduction to Topology. C.T.C. 
Wall. Dover, 1993, vi + 168 pp, $6.95 (P). 
[ISBN 0-486-67850-4] A minimal approach, 
assumes no general topology and avoids sim- 
plexes. Parts I and II culminate in a proof 
of Alexander’s duality theorem. Part III illus- 
trates connections with other branches of pure 
mathematics. (1972 Addison-Wesley edition, 
TR, August-September -1972; Extended Re- 
view, December 1975.) DP 


Optimization, T(16-17: 1, 2), L. Linear Pro- 
gramming. James P. Ignizio, Tom M. Cava- 
lier. Ser. in Industrial & Systems Eng. Prentice 
Hall, 1994, xx + 666 pp. [ISBN 0-13-183757- 
5] Introduction to formulation, solution, and 
analysis of linear programming models. Also 
covers network and integer models, multiob- 
jective optimization. Relatively strong empha- 
sis on fundamental concepts and mathematical 
theory; assumes some linear algebra. AO 


Probability, P. Conditional Measures and Ap- 
plications. M.M. Rao. Pure & Appl. Math., 
V. 177. Marcel Dekker, 1993, xiv + 417 pp, 
$135. [ISBN 0-8247-8884-2] Axiomatic de- 
velopment of conditional measures from several 
different directions. Applications include com- 
putational problems associated with condition- 
ing. Uses historical development as motivation. 
No exercises; extensive bibliography. MK 


Probability, P. Counterexamples in Prob- 
ability and Real Analysis. Gary L. Wise, 
Eric B. Hall. Oxford Univ Pr, 1993, xii 
+ 211 pp, $39.95. [ISBN 0-19-507068-2] 
Counter-intuitive examples from real analysis 
and measure theoretic probability. RSK 


Stochastic Processes, P. Simulation and 
Chaotic Behavior of a-stable Stochastic Pro- 
cesses. Aleksander Janicki, Aleksander Weron. 
Pure & Appl. Math., V. 178. Marcel Dekker, 
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1994, vii + 355 pp, $125. [ISBN 0-8247-8882- 
6] 

Stochastic Processes, T(18: 1, 2), S, C. Nu- 
merical Solution of SDE Through Computer Ex- 
periments. Peter E. Kloeden, Eckhard Platen, 
Henri Schurz. Universitext. Springer-Verlag, 
1994, xiv + 292 pp, $49 (P), with disk. [ISBN 
0-387-57074-8] Stochastic differential equa- 
tions (SDEs) and discrete time approximation 
(strong and weak) of their solutions. Includes 
Turbo Pascal listings and disk. RWJ 


Elementary Statistics, T(13-14: 1). A First 
Course in Probability Models and Statistical 
Inference. J.H.C. Creighton. Texts in Stat. 
Springer-Verlag, 1994, xxxi + 717 pp, $49.95. 
[ISBN 0-387-94114-2] Probability, descrip- 
tive statistics, discrete and continuous probabil- 
ity models, estimation, hypothesis testing, and 
simple linear regression. Over 350 pages on 
hints and solutions of exercises. RWJ 


Statistical Methods, T(15-17: 1), P. Modeling 
Experimental and Observational Data. Clif- 
ford E. Lunneborg. Duxbury Pr, 1994, xv + 
544 pp. [ISBN 0-534-21426-6] For students 
in biological and behavioral sciences; empha- 
sizes generality of the modeling approach in 
handling analysis of variance and regression 
problems, including logistic and Poisson regres- 
sion. Techniques and diagnostic procedures are 
illustrated by meaningful examples. RSK 


Statistical Methods, T(15-17: 1), S. Visu- 
alizing Data. William S. Cleveland. Hobart 
Pr, 1993, 360 pp, $40. [ISBN 0-9634884-0- 
6] From Preface: “The success of a visualiza- 
tion tool should be based solely on the amount 
we learn about the phenomenon under study.” 
By the same criteria, this text is a success. 
Through about 20 analyses (classic studies and 
recent data sets) text exposes various graph- 
ical methods and visualization tools, always 
keeping scientific goals in sight. Some stan- 
dard techniques (Q-Q plots, boxplots, paramet- 
ric and non-parametric fits, slicing, 3-D plots, 
etc.), some less standard (stereograms, loess 
curve fitting, factor-plane methods, multi-way 
dot plots). Readable text, thoughtful examples, 
useful techniques. No exercises, but still a pos- 
sible text for scientific data analysis. MK 


Statistical Methods, T(18), S, P. Statistical 
Models Based on Counting Processes. Per 
Kragh Andersen, et al. Ser. in Stat. Springer- 
Verlag, 1993, xi + 767 pp, $69. [ISBN 0-387- 
97872-0] Theoretical framework (e.g., count- 
ing processes, continuous-time martingales, 
stochastic integration) for event history/survival 
analysis. Covers applications and theory under- 
lying many models and techniques. High math- 
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ematical level, but many examples and appli- 
cations. Amazing variety of topics: from non- 
parametric estimation to frailty models. MK 


Statistical Methods, T(16—17), C. Statisti- 
cal Principles of Research Design and Anal- 
ysis. Robert O. Kuehl. Duxbury Pr, 1994, 
xvi + 686 pp, with disk. [ISBN 0-534-18804- 
4] Classical introduction to design of experi- 
ments, stressing research design: “the total ef- 
fort in a study that includes development of the 
research hypothesis, the choice of treatment de- 
sign to address the research hypothesis, and 
the experiment design choice to facilitate ef- 
ficient data collection.” A beautifully written, 
well-conceived textbook for practitioners and 
newcomers. Examples based on actual stud- 
ies from life sciences, agriculture, industrial 
and chemical engineering. Good exercises; in- 
cludes disk. MK 


Statistical Methods, P. Model-Free Curve Es- 
timation. Michael E. Tarter, Michael D. Lock. 
Mono. on Stat. & Appl. Prob., V. 56. Chap- 
man & Hall, 1993, x + 290 pp, $49.95. [ISBN 
0-412-04251-7-5] 

Statistics, P, L. The Collected Works of John 
W. Tukey, Volume VIII: Multiple Comparisons: 
1948-1983. Eds: Henry I. Braun, et al. Chap- 
man & Hall, 1994, lxi + 485 pp, $62.95. [ISBN 
0-412-05121-4] 

Statistics, T(17-18: 2); L. Advanced Linear 
Models: Theory and Applications. Song-Gui 
Wang, Shein-Chung Chow. Stat.: Textbooks 
& Mono., V. 141. Marcel Dekker, 1994, x + 
537 pp, $165. [ISBN 0-8247-9169-X]  Sta- 
tistical inference for linear models and unified 
coverage of regression, ANOVA, ANCOVA, 
and variance components models. Preliminary 
chapters treat matrix results and the multivariate 
normal and related distributions. RWJ 


Statistics, T(17-18: 1), L. Hilbert Space Meth- 
ods in Probability and Statistical Inference. 
Christopher G. Small, D.L. McLeish. Wiley, 
1994, xi + 252 pp, $59.95. [ISBN 0-471-59281- 
1] A Hilbert space approach to random vari- 
ables. Applications to martingales, stochastic 
integration, interpolation, density estimation. 
Assumes linear algebra, probability, and statis- 
tics, and some knowledge of basic stochastic 
processes. RWJ 


Programming, P, C. The Borland C++ 4.0 
Primer. Keith Weiskamp. Academic Pr, 1994, 
xviii + 568 pp, $39.95 (P), with disk. [ISBN 
0-12-742683-3] 

Programming, C*. Mastering Mathematica: 
Programming Methods and Applications. John 
Gray. Academic Pr, 1994, xx + 644 pp, (P), with 
disk. [ISBN 0-12-296040-8] Core subject is 
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different types of Mathematica programming. 
Includes introductory tutorials, chapters on rep- 
resenting mathematics in Mathematica, trans- 
lating mathematical problems to Mathematica- 
solvable form. A must for prospective serious 
Mathematica users. DS 

Computer Systems, T(16), S*, C, L*. Applied 
Mathematica: Getting Started, Getting It Done. 
William T. Shaw, Jason Tigg. Addison-Wesley, 
1994, xiv + 432 pp, (P). [ISBN 0-201-54217- 
X] Impressive introduction to Mathematica’s 
power and versatility. Useful to both neophytes 
(though not suitable as a first pass through Math- 
ematica) and experienced users. Topics include 
data visualization in two and three dimensions, 
Statistical analysis, time series, digital imagin- 
ing. Includes basic Mathematica techniques, 
detailed coverage of data handling. No ex- 
ercises, but many well thought-out examples. 
Highly recommended. MPR 


Computer Systems, P. Motif Programming 
Manual: Volume Six A. Dan Heller, Paula M. 
Ferguson. O’Reilly & Assoc, 1994, xlii + 
972 pp, $34.95 (P). [ISBN 1-56592-016-3] 


Applications (Behavioral Science), S(15—16), 
P. Theory of Moves. Steven J. Brams. Cam- 
bridge Univ Pr, 1994, xii + 248 pp, $17.95 (P). 
[ISBN 0-521-45867-6; 0-521-45226-0] De- 
scribed as a theory that is based upon and ex- 
tends the classical theory of games, the work 
described in this book bears little resemblance 
to the kind of reasoning and proof mathemati- 
cians expect to see in a book on game theory. 
The effort to build a dynamic theory that an- 
ticipates the next move, and the next, makes 
interesting reading, as do examples that range 
from the Old Testament to the attempted coup 
against Gorbachev, but it won’t do for under- 
standing game theory. AWR 


Applications (Biological Science), P. Frac- 
tals: A User’s Guide for the Natural Sciences. 
Harold M. Hastings, George Sugihara. Oxford 
Univ Pr, 1993, xii + 235 pp, $19.95 (P). [ISBN 
0-19-854597-5] Applications of fractal ge- 
ometry to the natural world. Treats mathematics 
of fractals and modeling patterns; includes case 
studies and user-ready programs. DH 


Applications (Biological Science), P. Cell 
Biology. Eds: Byron Goldstein, Carla Wofsy. 
Lect. on Math. in the Life Sci., V. 24. AMS, 
1994, x + 135 pp, $38 (P). [ISBN 0-8218-1175- 
4] Invited lectures from the 1992 Symposium 
on Some Mathematical Questions in Biology 
held at the annual meeting of the American So- 
ciety for Cell Biology. 


Applications (Fluid Dynamics), P. The 
Couette-Taylor Problem. Pascal Chossat, 
Gérard Iooss. Appl. Math. Sci., V. 102. 
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Springer-Verlag, 1994, ix + 233 pp, $44.50. 
[ISBN 0-387-94154-1] Local analysis of vor- 
tex flow of fluid between two concentric cylin- 
ders. Provides experimental results and de- 
scribes G. Taylor’s model to study these exper- 
iments. Emphasizes bifurcation and symmetry 
breaking arguments. DS 


Applications (Physics), S(16—18), P, L. The 
Special Theory of Relativity: A Mathematical 
Exposition. Anadijiban Das. Universitext. 
Springer-Verlag, 1993, xii + 214 pp, $39.95 (P). 
[ISBN 0-387-94042-1] Strong mathematical 
emphasis—physical situations are seldom men- 
tioned. A useful reference, but a difficult en- 
trance into the subject. MU 


Applications (Physics), T*(15-16: 1), S, L. 
A Brief on Tensor Analysis, Second Edition. 
James G. Simmonds. Undergrad. Texts in Math. 
Springer-Verlag, 1994, xiv + 112 pp, $29.95. 
[ISBN 0-387-94088-X] New edition corrects 
typos, augments exercises; new section treats 
differential geometry. A real gem. (First Edi- 
tion, TR, February 1983.) MU 


Applications, P, L*. Bridging Mind and 
Model: Papers in Applied Mathematics. Ed: 
Peter J. Costa. St. Thomas Tech Pr (Center 
for Appl. Math., Mail #4410, 2115 Summit 
Ave.;, St. Paul, MN 55105-1096), vi + 306 pp, 
$24.95. [ISBN 0-9624229-7-5] 8 expository 
papers on maritime traffic control, path recon- 
struction, biochemistry, textile manufacturing, 
laser physics, and signal processing. 


Applications, T(17-18: 1), P. Mathematical 
Foundations of Elasticity. Jerrold E. Marsden, 
Thomas J.R. Hughes. Dover, 1993, xviii + 
556 pp, $14.95 (P). [ISBN 0-486-67865-2] 
Mathematical foundations of 3-dimensional 
elasticity using modern differential geometry 
and functional analysis. For mathematicians, 
engineers, and physicists. Only advanced cal- 
culus is needed for chapters on kinematics and 
dynamics of continuous media; more back- 
ground needed to study variational principles 
and functional analysis. (1983 Prentice-Hall 
text, TR, October 1983.) DS 
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Research Issues tn 
Undergraduate 


Mathematics Learning 
Preliminary Analyses and Reports 


James J. Kaput and 
Fd Dubinsky, Editors 


Research in undergraduate mathematics edu- 
cation is important for all college and university 
mathematicians. If our students are to be more 
successful in understanding mathematics, then 
college faculty need to understand how math- 
ematics is learned. This knowledge can guide us 
in curriculum reform and in improving our own 
teaching. It can help us make mathematics 
accessible to all students and it can increase the 
number of graduate students in mathematics. 

This volume of research in undergraduate 
mathematics education informs us about the 
nature of student learning in some of the most 
important topics in the undergraduate curricu- 
lum: sets, functions, calculus, statistics, abstract 
algebra and problem solving. Paying careful 
attention to the trouble students have in learn- 
ing mathematics will help us to work with stu- 
dents so they can deal with those difficulties. 

A survey of the literature begins the volume. 
Becker and Pence have brought together an 
unusually complete list of references on research 
in collegiate mathematics. Their comments will 
guide those attempting to begin or to continue 
a program of research in student learning. 

The sad fact that even good calculus students 
stumble over nonroutine problems is the theme 
of Selden, Selden, and Mason. Their conclu- 
sions point to significant shortcomings in the 
curriculum. This study of student difficulties is 
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continued by Ferrini-Mundy and Graham who 
investigate a single student’s interactions with 
the fundamental concepts of the calculus. 
Baxter studies a group of students to learn how 
they acquire the concept of set, while Cuoco 
does the same for the concept of function. 

Cooperative learning does help the student. 
That is the conclusion of Bonsangue, who inves- 
tigates how two carefully matched classes of 
students in a statistics course perform on exams. 
How students learn to write proofs in group 
theory is the subject considered by Hart. 
Rosamond breaks new ground by comparing 
how emotions vary in their effect on the prob- 
lem solving ability of novices and experts. 

All college faculty should read this book to 
find how they can help their students learn 
mathematics. 
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Exploring Mathematics 


With Your Computer 


Arthur Engel 


A must for academic libraries supporting an under- 
graduate major in mathematics. Public ibraries with 
strong science collections should have this book as a 
resource for traditional mathematics topics and as a 
recreation for the mathematically inclined. Capable 
high school students would also benefit. 

—CHOICE 


Today’s personal computer gives its owner tre- 
mendous power which can be used for experi- 
mental investigations and simulations of un- 
precedented scope, leading to mini-research. 
This book is a first step into this exciting field. 


This is a mathematics book, not a programming 
book, although it explains Pascal to beginners. 
It is aimed at high school students and under- 
graduates with a strong interest in mathematics 
and teachers looking for fresh ideas. It is full of 
diverse mathematical ideas requiring little back- 
ground. It includes a large number of challeng- 
ing problems, many of which illustrate how 
numerical computation leads to conjectures 
which can then be proved by mathematical 
reasoning. 


You will find 65 interesting and substantial 
mathematical topics in this book, and over 360 
problems. Each topic is illustrated with examples 
and corresponding programs. The major goal 
of the book is to use the computer to collect data 
and formulate conjectures suggested by the data. 
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Student Research Projects 


in Calculus 


Marcus Cohen, Edward D. Gaughan, Arthur Knoebel, 


Douglas S. Kurtz, and David Pengelley 


It is yet another workable way to rejuvenate college 
calculus. The monograph is highly recommended to 
those interested in experimenting with mathematics 
curricula. —AAAS, Science Books & Films 


I found the book very readable and thought provoking. 
Whether your students are in engineering, pure or 
applied science, or even liberal arts, this book may 
change the way you teach—and the way they learn— 
calculus for the better! —The Mathematics Teacher 


An important contribution to the growing list of new 
curricular materials becoming available to assist in the 
teaching of calculus. —CHOICE 


Changing the way students learn calculus was 
the goal of the authors of this excellent guide- 
book. In the Spring of 1988, they began work on 
a student project approach to calculus. 


You can use their methods in teaching your own 
calculus courses. Over 100 projects are pre- 
sented, all of them ready to assign to your stu- 
dents in single and multivariable calculus. The 
projects were designed with one goal in mind: to 
get students to think for themselves. Each project 
is a multistep, take home problem, allowing 
students to work both individually and in groups. 
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Each project has accompanying notes to the 
instructor reporting students’ experiences. The 
notes contain information on prerequisites, list 
the main topics the project explores, and sug- 
gest helpful hints. The authors have also pro- 
vided several introductory chapters to help in- 
structors use projects successfully in their classes 
and begin to create their own. 
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Puzzles and Problems 


in Tiling 
George Martin 


George Martin has done a truly marvelous job of 
presenting the material in this book in an attractive 
and clear way. —WMartin Gardner 


It brings together results which are scattered 
throughout the literature and will be a standard 


reference for many years. . . 
—Mathematical Reviews 


Martin has an easy style of writing, and the mate- 
rial is effortless to read and understand. 
—Mathematics Teacher 


The writing is clear, with enough formality to make 
the arguments make sense. This book is a neces- 
sary part of a well-stocked library in recreational 
mathematics. —AAAS, Science Books & Films 


Polyominoes will delight not only students and 
teachers of mathematics at all levels, but will be 
appreciated by anyone who likes a good geomet- 
ric challenge. There are no prerequisites. If you 
like jigsaw puzzles or if you hate jigsaw puzzles 
but have ever wondered abut the pattern of some 
floor tiling, there is much here to interest you. 


A polyomino is a shape cut along the lines from 
square graph paper; the pronunciation of 
polyonimo begins as does polygon and ends as 
does domino. Tilings, also called tessellations of 
mosaic patterns, are older than civilization itself. 
Tiling with polyominoes provides challenges that 
range from the popular jigsawlike puzzles to easily 
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understood mathematical research problems. You 
will find unsolved puzzles and problems of both 
kinds here. Answers are provided for most of the 
problems that have a known solution. 


It is only fair to repeat here the warning stated in 
the preface to this book, “Playing with polyomi- 
noes can be habit forming.” 
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Symbolic Computation in 
Undergraduate Mathematics 


Education 


Zaven Karian, Editor 


If you are considering putting a symbolic comput- 
ing system into your curriculum, this is one publi- 
cation you should have. 

—Mathematics Teacher 


This well-written book should be helpful to anyone 
using symbolic computation as an aid in teaching 
undergraduates—The book provides a number of 
examples for presenting probability and statistics 
ina way that removes the tedium and emphasizes 
the underlying ideas. 

—AAAS, Science Book and Films 


If you have any plans to integrate symbolic com- 

puting into your program, read and study this book 

first. Your students will thank you for it. 
—AMATYC Review 


This volume brings together many of the facets 
associated with the pedagogic uses of symbolic 
computation. 


Part | consists of articles that deal with general 
issues of learning mathematics and the role of 
symbolic computation in that process. The articles 
in Part] describe the use of symbolic computation 
in teaching calculus. Some of the areas covered 
are the use of symbolic computation in a labora- 
tory calculus course, the uses of Derive in the 
instruction of calculus, antidifferentiation and the 
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definite integral, and the experiences and reflec- 
tions of teachers who have used symbolic compu- 
tation in calculus instruction. 


Part Ill consists of papers on sophomore-level 
courses on linear algebra and differential equa- 
tions. The articles in Part IV describe what can be 
done in using symbolic computation in teaching 
combinatorics, probability and statistics courses. 
The articles and references in Part V will help you 
get started in using some of these ideas at your 
own institution. : 
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Mathematical Magic Show 


...We visit most of the prime sites of recreational 
mathematics: game theory, factorials, puzzles, 
playing cards, finger arithmetic, Mobius bands, 
polyominoes, perfect numbers, the knight’s tour, 
trees, and dice. Gardner always has new facts 
and ideas to add interest to even the most well- 
trodden areas. —Times Literary Supplement 
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His startling gift for bringing the sublime to the 
people is unabated. Once again, hard math- 
ematical ideas are conveyed with fluency, charm, 
and utter clarity. Asa philosopher, |warmly salute 
his judgement of when to leave a metaphysical 
question enticingly open. His craftsmanship re- 
mains exquisite. — New Scientist 
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Riddles of the Sphinx and Other 
Mathematical Puzzle Tales 


This book charms, informs, inspires, puzzles, 
and delights, and the reader can dip in almost 
anywhere and get hooked by the natural lucidity 
of style and the friendly tone which are so charac- 
teristic of Martin Gardner. 

—Mathematical Spectrum 
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Mathematical Circus 


A circus suggests fun and enjoyment and there is 
plenty of both to be found here. The book should 
certainly be in the school library. It will also be 
valuable resource for the teacher. 

—The Mathematical Gazette 
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How are universities around the 
world putting Mathematica® on 
) every computer on campus for 
as little as $1 per stu- 


ing advantage of 
new Mathematica site 
license programs. In fact, site 
licenses at over 700 universi- 
ties have made Mathematica 
accessible to millions of stu- 
dents without breaking the 
school budget. 

This new series of flexible, 
affordable site license programs 
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puts you in charge. You choose 
where you want Mathematica, 
what kinds of computers you 
want it on, how you want to net- 
work it in your labs, and how 
much you want to invest. And 
you can save up to 90% off the 
mq already-reduced acade- 
mic price. 

If you’re excited 
about preparing more 
students for tomorrow by teach- 
ing with the world’s leading 
technical computing system 
today, talk to us about a Mathe- 


matica site license. We’ll work 
together to customize one to fit 
your school’s needs. 


1-800-441-MATH (6284) 


(U.S. and Canada} 


Call us about a site license 
for your campus and ask 
for a free copy of this 
booklet, Mathematica 
Flexible Academic 
Purchase Programs. 
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